p3-genome-kmer-hits¶
Count KMER Hits in Genomes¶
p3-genome-kmer-hits.pl [options] kmerDB
This script takes as input a list of genome IDs and outputs a table of the number of kmer hits by group in each genome. The output file will be tab-delimited, with the genomeID, the sequence ID, the group ID, the group name, and the kmer hit count.
Parameters¶
The positional parameter is the file name of the kmer database. This is a json-format KmerDb object.
The standard input can be overridden using the options in Input Options.
Additional command-line options are those given in Column Options (to choose the genome ID column) plus the following options.
prot
If specified, the kmers are assumed to be protein kmers.
pegs
If specified, the kmer hits will be counted against the genome’s proteins, not the genome itself. This implies
--prot
.
verbose
If specified, progress messages will be written to STDERR.