Difference between revisions of "GATK"
Moskalenko (talk | contribs) m (Text replace - "<!-- ######## Template Configuration ######## --> <!--Edit definitions of the variables used in template calls Required variables: app - lowercase name of the application e.g. "amber" url - url of the software page (project, company prod) |
Moskalenko (talk | contribs) m (Text replace - "<!--Choose sections to enable - OPTIONAL--> |{{#vardefine:mod|1}} <!--Present instructions for running the software with modules -->" to "") |
||
Line 10: | Line 10: | ||
|{{#vardefine:intel|}} <!-- E.g. "11.1" --> | |{{#vardefine:intel|}} <!-- E.g. "11.1" --> | ||
|{{#vardefine:mpi|}} <!-- E.g. "openmpi/1.3.4" --> | |{{#vardefine:mpi|}} <!-- E.g. "openmpi/1.3.4" --> | ||
− | + | ||
− | |||
|{{#vardefine:exe|1}} <!--Present manual instructions for running the software --> | |{{#vardefine:exe|1}} <!--Present manual instructions for running the software --> | ||
|{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF--> | |{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF--> |
Revision as of 17:52, 10 August 2012
Description
The GATK is a structured software library that makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner.
We aim to work well with both samtools and Picard by providing complementary tools to those available in those two packages. Our SNP calling pipeline (Q score recalibration -> multiple sequence realignment -> snp/index calling) is a particular area of focus, and have been pushing to make these capabilities as general-purpose and powerful as possible. My group's mandate is to ensure the success of the human medical resequencing projects we've undertaken at the Broad over the next 2-3 years, which involves providing a robust, production-quality development library that underlies tools for common analysis problems (like SNP calling) as well as enabling exploratory research on NGS data. Upstream documentation for gatk.
Required Modules
Serial
- gatk
How To Run
We provide two wrapper scripts AnalyzeCovariates and GenomeAnalysisTK that are equivalent to running
java -jar $HPC_GATK_DIR/GenomeAnalysisTK.jar
and
java -jar $HPC_GATK_DIR/AnalyzeCovariates.jar