Vcftools

From UFRC
Revision as of 20:55, 12 August 2022 by Israel.herrera (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Description

vcftools website  

VCFtools is a program package designed for working with VCF files, such as those generated by the 1000 Genomes Project. The aim of VCFtools is to provide methods for working with VCF files: validating, merging, comparing and calculate some basic population genetic statistics. Upstream documentation for vcftools.

Environment Modules

Run module spider vcftools to find out what environment modules are available for this application.

System Variables

  • HPC_VCFTOOLS_DIR - installation directory
  • HPC_VCFTOOLS_BIN - executable directory

How To Run

By default vcftools uses /tmp for temporary file storage. As explained in vcftools documentation you should use the --temp argument to point vcftools to a directory in your /blue space instead since HiPerGator2 nodes are diskless and the /tmp directory on them is very small. For example, the following can be used in a job script:

mkdir -p tmp
export TMPDIR="$(pwd)/tmp"
vcftools ... --temp ${TMPDIR}



Citation

If you publish the research that uses this software please cite The Variant Call Format and VCFtools, Petr Danecek, Adam Auton, Goncalo Abecasis, Cornelis A. Albers, Eric Banks, Mark A. DePristo, Robert Handsaker, Gerton Lunter, Gabor Marth, Stephen T. Sherry, Gilean McVean, Richard Durbin and 1000 Genomes Project Analysis Group, Bioinformatics, 2011.

http://dx.doi.org/10.1093/bioinformatics/btr330