Number of repeated REF bases around POS.
This program is now part of the main jvarkit
tool. See jvarkit for compiling.
Usage: java -jar dist/jvarkit.jar vcfpolyx [options] Files
Usage: vcfpolyx [options] Files
Options:
--bcf-output
If this program writes a VCF to a file, The format is first guessed from
the file suffix. Otherwise, force BCF output. The current supported BCF
version is : 2.1 which is not compatible with bcftools/htslib (last
checked 2019-11-15)
Default: false
--generate-vcf-md5
Generate MD5 checksum for VCF output.
Default: false
-h, --help
print help and exit
--helpFormat
What kind of help. One of [usage,markdown,xml].
-n, --filter, --max-repeats
if number of repeated bases is greater or equal to 'n' set a FILTER =
(tag)
Default: -1
-o, --out
Output file. Optional . Default: stdout
* -R, --reference
Indexed fasta Reference file. This file must be indexed with samtools
faidx and with picard/gatk CreateSequenceDictionary or samtools dict
--skip-filtered
Don't spend some time to calculate the tag if the variant is FILTERed
Default: false
-t, --tag
Tag used in INFO and FILTER columns.
Default: POLYX
--version
print version and exit
20200930
The project is licensed under the MIT license.
Should you cite vcfpolyx ? https://github.com/mr-c/shouldacite/blob/master/should-I-cite-this-software.md
The current reference is:
http://dx.doi.org/10.6084/m9.figshare.1425030
Lindenbaum, Pierre (2015): JVarkit: java-based utilities for Bioinformatics. figshare. http://dx.doi.org/10.6084/m9.figshare.1425030
$ java -jar dist/vcfpolyx.jar -R reference.fa input.vcf
(...)
2 1133956 . A G 2468.84 . POLYX=23
2 1133956 . A AG 3604.25 . POLYX=23
2 2981671 . T G 47.18 . POLYX=24
(...)