p3-sort¶

Small File Multi-Column Sort¶

p3-sort.pl [options] col1 col2 ... colN

This is a sort script variant that sorts a single small file in memory with the ability to specify multiple columns. It assumes the file has a header, and the columns are tab-delimited. If no columns are specified, it sorts by the first column only.

Parameters¶

The positional parameters are the indices (1-based) or names of the key columns. Columns to be sorted numerically are indicated by a slash-n (/n) at the end of the column index or name. So,

p3-sort genome.genome_id feature.start/n

Would indicate two key columns, the second of which is to be sorted numerically.

Use /p to sort in PEG order, which means the column contains FIG feature IDs.

To sort in reverse, add /r for reverse string sort and /nr for reverse numeric sort.

The standard input can be overridden using the options in Input Options.

The following additional options are suppported.

count

If specified, the output will consist only of the key fields with a count column added.

nonblank

If specified, records with at least one empty key field will be discarded.

unique

Only include one output line for each key value. This option is mutually exclusive with --dups.

dups

Only include lines with duplicate keys in the output. This option is mutually exclusive with --unique.

nohead

If specified, the input file has no headers.

verbose

Write progress messages to STDERR.