Description

Annotation of bacterial genomes (isolates, MAGs) and plasmids

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

fasta

:file

FASTA file to be annotated. Has to contain at least a non-empty string dummy value.

*.{fa,fas,fna,fasta}

db

:file

Path to the Bakta database directory. Must have amrfinderplus database directory already installed within it (in a directory called ‘amrfinderplus-db/’).

proteins

:file

FASTA/GenBank file of trusted proteins to first annotate from (optional)

*.{fa,fas,fna,fasta,faa}

prodigal_tf

:file

Training file to use for Prodigal for CDS prediction(optional)

*.{tf,trn}

regions

:file

GFF3 or GenBank file of pre-annotated regions (optional)

*.{gbff,gff3}

hmms

:file

HMM database file for custom annotation (optional)

*.hmm

Output

name:type
description
pattern

embl

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.embl

:file

annotations & sequences in (multi) EMBL format

*.embl

faa

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.faa

:file

CDS/sORF amino acid sequences as FASTA

*.faa

ffn

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.ffn

:file

feature nucleotide sequences as FASTA

*.ffn

fna

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.fna

:file

replicon/contig DNA sequences as FASTA

*.fna

gbff

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.gbff

:file

annotations & sequences in (multi) GenBank format

*.gbff

gff

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.gff3

:file

annotations & sequences in GFF3 format

*.gff3

hypotheticals_tsv

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.hypotheticals.tsv

:file

additional information on hypothetical protein CDS as simple human readable tab separated values

*.hypotheticals.tsv

hypotheticals_faa

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.hypotheticals.faa

:file

hypothetical protein CDS amino acid sequences as FASTA

*.hypotheticals.faa

tsv

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.tsv

:file

annotations as simple human readable tab separated values

*.tsv

txt

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

${prefix}.txt

:file

genome statistics and annotation summary

*.txt

versions

versions.yml

:file

File containing software versions

versions.yml

Tools

bakta
GPL v3

Rapid & standardized annotation of bacterial genomes, MAGs & plasmids.