Command Line Interface
Root command group
gambit [OPTIONS] COMMAND [ARGS]...
Some top-level options are set at the root command group, and should be specified before the name of the subcommand to run.
Options
- -d, --db DIR
Path to the directory containing reference database files. Required by most subcommands. As an alternative you can specify the database location with the
GAMBIT_DB_PATH
environment variable.
Environment variables
Querying the database
query
gambit query [OPTIONS] (-s SIGFILE | -l LIST | GENOMES...)
Predict taxonomy of microbial samples from genome sequences.
GENOMES
are one or more FASTA files containing assembled query genomes. Alternatively
a file containing pre-calculated signatures may be used with the --sigfile
option. The
reference database must be specified from the root command group.
Options
- -s, --sigfile FILE
Path to file containing query signatures.
- -o, --output FILE
File to write output to. If omitted will write to stdout.
- -f, --outfmt {csv|json|archive}
Results format (see next section).
Result Formats
CSV
A .csv file with one row per query. Contains the following columns:
query.*
- Query genome.query.name
- Name of query.query.path
- Path to query file, if any.
predicted.*
- Predicted taxon.predicted.name
predicted.rank
predicted.ncbi_id
- Numeric ID in NCBI taxonomy database.predicted.threshold
closest.*
- Reference genome closest to query.closest.distance
- Distance to closest genome.closest.decription
- Text description.
next.*
- Next most specific taxon for which the classification threshold was not met.next.name
next.rank
next.ncbi_id
next.threshold
JSON
A machine-readable format meant to be used in pipelines.
Todo
Document schema
Archive
A more verbose JSON-based format used for testing and development.
Generating and inspecting k-mer signatures
signatures info
gambit signatures info [OPTIONS] FILE
Print information about a GAMBIT signatures file. Defaults to a basic human-readable format.
Options
- -j, --json
Print information in JSON format. Includes more information than standard output.
- -p, --pretty
Prettify JSON output to make it more human-readable.
- -i, --ids
Print IDs of all signatures in file.
signatures create
gambit signatures create [OPTIONS] GENOMES
Calculate GAMBIT signatures of GENOMES
and write to file.
The -k
and --prefix
options may be omitted if a reference database is specified through the
root command group, in which case the parameters of the database will be used.
Options
- -o, --output FILE
Path to write file to (required).
- -k INTEGER
Length of k-mers to find (does not include length of prefix).
- -p, --prefix STRING
K-mer prefix to match, a non-empty string of DNA nucleotide codes.
- -i, --ids FILE
File containing IDs to assign to signatures in file metadata. Should contain one ID per line.
- -m, --meta-json FILE
JSON file containing metadata to attach to file.
Todo
Document metadata schema