Difference between revisions of "Genome STRiP"

From UFRC
Jump to navigation Jump to search
 
(3 intermediate revisions by one other user not shown)
Line 27: Line 27:
  
 
<!--Modules-->
 
<!--Modules-->
==Required Modules==
+
==Environment Modules==
 
+
Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application.
===Serial===
 
* {{#var:app}}
 
<!--
 
===Parallel (OpenMP)===
 
* intel
 
* {{#var:app}}
 
===Parallel (MPI)===
 
* intel
 
* openmpi
 
* {{#var:app}}
 
-->
 
 
==System Variables==
 
==System Variables==
* HPC_{{#uppercase:{{#var:app}}}}_DIR - installation directory
+
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
<!--Configuration-->
 
<!--Configuration-->
 
{{#if: {{#var: conf}}|==Configuration==
 
{{#if: {{#var: conf}}|==Configuration==
Line 49: Line 38:
 
{{#if: {{#var: exe}}|==Additional Information==
 
{{#if: {{#var: exe}}|==Additional Information==
  
Genome STRiP meta data is located in /scratch/lfs/bio/reference/genomestrip
+
Genome STRiP meta data is located in /data/reference/genomestrip directory tree.
  
 
* HPC_GENOMESTRIP_META - metadata directory
 
* HPC_GENOMESTRIP_META - metadata directory
Line 88: Line 77:
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
__NOTOC____NOEDITSECTION__
 
__NOTOC____NOEDITSECTION__
=Validation=
 
* Validate 4/5/2018
 

Latest revision as of 19:28, 10 June 2022

Description

genomestrip website  

Genome STRiP (Genome STRucture In Populations) is a suite of tools for discovering and genotyping structural variations using sequencing data. The methods are designed to detect shared variation using data from multiple individuals.

Genome STRiP looks both across and within a set of sequenced genomes to detect variation. The methods are adaptive and support heterogeneous data sets, including variations in sequencing depth, read lengths and mixtures of paired and single-end reads. A minimum of 20 to 30 genomes are required to get acceptable results, but the method gains power across genomes and processing more genomes provide better results.

To run discovery or genotyping on a single sequenced genome or a small set of genomes, you need to call your data against a background population, such as a set of genomes from the 1000 Genomes Project. The background population does not need to be matched to the target individuals.

Genome STRiP can be used for discovery of novel structural variations or to genotype known variants in new samples.

Environment Modules

Run module spider genomestrip to find out what environment modules are available for this application.

System Variables

  • HPC_GENOMESTRIP_DIR - installation directory

Additional Information

Genome STRiP meta data is located in /data/reference/genomestrip directory tree.

  • HPC_GENOMESTRIP_META - metadata directory



Citation

If you publish research that uses genomestrip you have to cite it as follows:

Handsaker RE, Van Doren V, Berman JR, Genovese G, Kashin S, Boettger LM, McCarroll SA Large multiallelic copy number variations in humans Nature Genetics 47, 296-303 (2015) PMID: 25621458