Description

Locate and tag duplicate reads in a BAM file

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

reads

:file

Sequence reads file, can be SAM/BAM/CRAM format

*.{bam,cram,sam}

meta2

:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fasta

:file

Reference genome fasta file, required for CRAM input

*.{fasta,fa}

meta3

:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fai

:file

Reference genome fasta index

*.{fai}

Output

name:type
description
pattern

bam

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bam

:file

BAM file with duplicate reads marked/removed

*.{bam}

bai

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bai

:file

An optional BAM index file. If desired, —CREATE_INDEX must be passed as a flag

*.{bai}

cram

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.cram

:file

Output CRAM file

*.{cram}

metrics

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.metrics.txt

:file

Duplicate metrics file generated by picard

*.{metrics.txt}

versions

versions.yml

:file

File containing software versions

versions.yml

Tools

picard
MIT

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.