Revision as of 21:19, 6 December 2019

Description

ANNOVAR is an efficient software tool to utilize update-to-date information to functionally annotate genetic variants detected from diverse genomes (including human genome hg18, hg19, as well as mouse, worm, fly, yeast and many others). Given a list of variants with chromosome, start position, end position, reference nucleotide and observed nucleotides, ANNOVAR can perform:

Gene-based annotation: identify whether SNPs or CNVs cause protein coding changes and the amino acids that are affected. Users can flexibly use RefSeq genes, UCSC genes, ENSEMBL genes, GENCODE genes, or many other gene definition systems.
Region-based annotations: identify variants in specific genomic regions, for example, conserved regions among 44 species, predicted transcription factor binding sites, segmental duplication regions, GWAS hits, database of genomic variants, DNAse I hypersensitivity sites, ENCODE H3K4Me1/H3K4Me3/H3K27Ac/CTCF sites, ChIP-Seq peaks, RNA-Seq peaks, or many other annotations on genomic intervals.
Filter-based annotation: identify variants that are reported in dbSNP, or identify the subset of common SNPs (MAF>1%) in the 1000 Genome Project, or identify subset of non-synonymous SNPs with SIFT score>0.05, or many other annotations on specific mutations.
Other functionalities: Retrieve the nucleotide sequence in any user-specific genomic positions in batch, identify a candidate gene list for Mendelian diseases from exome data, identify a list of SNPs from 1000 Genomes that are in strong LD with a GWAS hit, and many other creative utilities.

SUMMARIZE_ANNOVAR is a script within the ANNOVAR package that is very popular among users. Given a list of variants from whole-exome or whole-genome sequencing, it will generate an Excel-compatible file with gene annotation, amino acid change annotation, SIFT scores, PolyPhen scores, LRT scores, MutationTaster scores, PhyloP conservation scores, GERP++ conservation scores, dbSNP identifiers, 1000 Genomes Project allele frequencies, NHLBI-ESP 5400 exome project allele frequencies and other information.

Required Modules

modules documentation

Serial

annovar

System Variables

HPC_ANNOVAR_DIR - installation directory

Citation

If you publish research that uses annovar you have to cite it as follows:

Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from next-generation sequencing data, Nucleic Acids Research, 38:e164, 2010

Validation

Validated 4/5/2018

@@ Line 2: / Line 2: @@
 __NOEDITSECTION__
 [[Category:Software]][[Category:Bioinformatics]][[Category:Genomics]]
-<!-- ########  Template Configuration ######## -->
+{|<!--Main settings - REQUIRED-->
-<!--Edit definitions of the variables used in template calls
-Required variables:
-app - lowercase name of the application e.g. "amber"
-url - url of the software page (project, company product, etc) - e.g. "http://ambermd.org/"
-Optional variables:
-INTEL - Version of the Intel Compiler e.g. "11.1"
-MPI - MPI Implementation and version e.g. "openmpi/1.3.4"
--->
-{|
-<!--Main settings - REQUIRED-->
 |{{#vardefine:app|annovar}}
 |{{#vardefine:url|http://www.openbioinformatics.org/annovar/}}
-<!--Compiler and MPI settings - OPTIONAL -->
-|{{#vardefine:intel|}} <!-- E.g. "11.1" -->
-|{{#vardefine:mpi|}} <!-- E.g. "openmpi/1.3.4" -->
-<!--Choose sections to enable - OPTIONAL-->
-|{{#vardefine:mod|1}} <!--Present instructions for running the software with modules -->
 |{{#vardefine:exe|}} <!--Present manual instructions for running the software -->
 |{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF-->
@@ Line 26: / Line 11: @@
 |{{#vardefine:testing|}} <!--Enable performance testing/profiling section -->
 |{{#vardefine:faq|}} <!--Enable FAQ section -->
-|{{#vardefine:citation|}} <!--Enable Reference/Citation section -->
+|{{#vardefine:citation|1}} <!--Enable Reference/Citation section -->
 |}
 <!-- ########  Template Body ######## -->
@@ Line 42: / Line 27: @@
 SUMMARIZE_ANNOVAR is a script within the ANNOVAR package that is very popular among users. Given a list of variants from whole-exome or whole-genome sequencing, it will generate an Excel-compatible file with gene annotation, amino acid change annotation, SIFT scores, PolyPhen scores, LRT scores, MutationTaster scores, PhyloP conservation scores, GERP++ conservation scores, dbSNP identifiers, 1000 Genomes Project allele frequencies, NHLBI-ESP 5400 exome project allele frequencies and other information.
-==Available versions==
+<!--Modules-->
-* 20120308
+==Required Modules==
-<!-- -->
+[[Modules|modules documentation]]
-{{#if: {{#var: mod}}|==Running the application using modules==
+===Serial===
-{{App_Module|app={{#var:app}}|intel={{#var:intel}}|mpi={{#var:mpi}}}}|}}
+*{{#var:app}}
-{{#if: {{#var: exe}}|==How To Run==
+==System Variables==
+* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
+<!--Additional-->
+{{#if: {{#var: exe}}|==Additional Information==
 WRITE INSTRUCTIONS ON RUNNING THE ACTUAL BINARY|}}
 {{#if: {{#var: conf}}|==Configuration==
@@ Line 53: / Line 41: @@
 {{#if: {{#var: pbs}}|==PBS Script Examples==
 See the [[{{PAGENAME}}_PBS]] page for {{#var: app}} PBS script examples.|}}
-{{#if: {{#var: policy}}|==Usage policy==
+{{#if: {{#var: policy}}|==Usage Policy==
 WRITE USAGE POLICY HERE (perhaps templates for a couple of main licensing schemes can be used)|}}
 {{#if: {{#var: testing}}|==Performance==
@@ Line 60: / Line 48: @@
 *'''Q:''' **'''A:'''|}}
 {{#if: {{#var: citation}}|==Citation==
-If you publish research that uses {{{app}}} you have to cite it as follows:
+If you publish research that uses {{#var:app}} you have to cite it as follows:
-WRITE CITATION HERE
+Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from next-generation sequencing data, ''Nucleic Acids Research'',''' 38:e164''', 2010
 |}}
+=Validation=
+* Validated 4/5/2018

Difference between revisions of "ANNOVAR"