Annotation of bacterial genomes (isolates, MAGs) and plasmids
Input
name:type
description
pattern
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
fasta
:file
FASTA file to be annotated. Has to contain at least a non-empty string dummy value.
*.{fa,fas,fna,fasta}
db
:file
Path to the Bakta database directory. Must have amrfinderplus database directory already installed within it (in a directory called ‘amrfinderplus-db/’).
proteins
:file
FASTA/GenBank file of trusted proteins to first annotate from (optional)
*.{fa,fas,fna,fasta,faa}
prodigal_tf
:file
Training file to use for Prodigal for CDS prediction(optional)
*.{tf,trn}
regions
:file
GFF3 or GenBank file of pre-annotated regions (optional)
*.{gbff,gff3}
hmms
:file
HMM database file for custom annotation (optional)
*.hmm
Output
name:type
description
pattern
embl
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.embl
:file
annotations & sequences in (multi) EMBL format
*.embl
faa
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.faa
:file
CDS/sORF amino acid sequences as FASTA
*.faa
ffn
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.ffn
:file
feature nucleotide sequences as FASTA
*.ffn
fna
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.fna
:file
replicon/contig DNA sequences as FASTA
*.fna
gbff
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.gbff
:file
annotations & sequences in (multi) GenBank format
*.gbff
gff
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.gff3
:file
annotations & sequences in GFF3 format
*.gff3
hypotheticals_tsv
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.hypotheticals.tsv
:file
additional information on hypothetical protein CDS as simple human readable tab separated values
*.hypotheticals.tsv
hypotheticals_faa
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.hypotheticals.faa
:file
hypothetical protein CDS amino acid sequences as FASTA
*.hypotheticals.faa
tsv
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.tsv
:file
annotations as simple human readable tab separated values
*.tsv
txt
meta
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
${prefix}.txt
:file
genome statistics and annotation summary
*.txt
versions
versions.yml
:file
File containing software versions
versions.yml
Tools
bakta
GPL v3
Rapid & standardized annotation of bacterial genomes, MAGs & plasmids.