Description

Locate and tag duplicate reads in a BAM file

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

reads:file

Sequence reads file, can be SAM/BAM/CRAM format

*.{bam,cram,sam}

meta2:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fasta:file

Reference genome fasta file, required for CRAM input

*.{fasta,fa}

meta3:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fai:file

Reference genome fasta index

*.{fai}

Output

name:type
description
pattern

bam

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.bam:file

BAM file with duplicate reads marked/removed

*.{bam}

bai

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.bai:file

An optional BAM index file. If desired, —CREATE_INDEX must be passed as a flag

*.{bai}

cram

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.cram:file

Output CRAM file

*.{cram}

metrics

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.metrics.txt:file

Duplicate metrics file generated by picard

*.{metrics.txt}

versions

versions.yml:file

File containing software versions

versions.yml

Tools

picard
MIT

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.