Coverage statistics for a BED file.
This program is now part of the main jvarkit
tool. See jvarkit for compiling.
Usage: java -jar dist/jvarkit.jar bamstats04 [options] Files
Usage: bamstats04 [options] Files
Options:
* -B, --bed
A source of intervals. The following suffixes are recognized: vcf,
vcf.gz bed, bed.gz, gtf, gff, gff.gz, gtf.gz.Otherwise it could be an
empty string (no interval) or a list of plain interval separated by '[
\t\n;,]'
-cov, --cov
add this min coverage value to ask wether the position is not covered.
Use with care: any depth below this treshold will be trimmed to zero.
Default: []
-f, --filter, --jexl
A JEXL Expression that will be used to filter out some sam-records (see
https://software.broadinstitute.org/gatk/documentation/article.php?id=1255).
An expression should return a boolean value (true=exclude, false=keep
the read). An empty expression keeps everything. The variable 'record'
is the current observed read, an instance of SAMRecord (https://samtools.github.io/htsjdk/javadoc/htsjdk/htsjdk/samtools/SAMRecord.html).
Default: record.getMappingQuality()<1 || record.getDuplicateReadFlag() || record.getReadFailsVendorQualityCheckFlag() || record.isSecondaryOrSupplementary()
-h, --help
print help and exit
--helpFormat
What kind of help. One of [usage,markdown,xml].
-o, --output
Output file. Optional . Default: stdout
-partition, --partition
[20171120]Data partitioning using the SAM Read Group (see
https://gatkforums.broadinstitute.org/gatk/discussion/6472/ ) . It can
be any combination of sample, library....
Default: sample
Possible Values: [readgroup, sample, library, platform, center, sample_by_platform, sample_by_center, sample_by_platform_by_center, any]
-R, --ref
[20180126]If set, a column with the GC% will be added. Also used to read
CRAM. Indexed fasta Reference file. This file must be indexed with
samtools faidx and with picard/gatk CreateSequenceDictionary or samtools
dict
--version
print version and exit
20130513
The project is licensed under the MIT license.
Should you cite bamstats04 ? https://github.com/mr-c/shouldacite/blob/master/should-I-cite-this-software.md
The current reference is:
http://dx.doi.org/10.6084/m9.figshare.1425030
Lindenbaum, Pierre (2015): JVarkit: java-based utilities for Bioinformatics. figshare. http://dx.doi.org/10.6084/m9.figshare.1425030
Input is one or more indexed BAM files.
$ java -jar dist/bamstats04.jar -B src/test/resources/toy.bed.gz src/test/resources/toy.bam 2> /dev/null | column -t
#chrom start end length sample mincov maxcov meancov mediancov nocoveragebp percentcovered
ref 10 13 3 S1 3 3 3.0 3.0 0 100
ref2 1 2 1 S1 2 2 2.0 2.0 0 100
ref2 13 14 1 S1 6 6 6.0 6.0 0 100
ref2 16 17 1 S1 6 6 6.0 6.0 0 100