Difference between revisions of "Samstat"

From UFRC
Jump to navigation Jump to search
(Created page with "Category:SoftwareCategory:BiologyCategory:Bioinformatics {|<!--CONFIGURATION: REQUIRED--> |{{#vardefine:app|samstat}} |{{#vardefine:url|http://samstat.sourceforge....")
 
m (Text replacement - "#uppercase" to "uc")
 
(One intermediate revision by one other user not shown)
Line 44: Line 44:
 
-->
 
-->
 
==System Variables==
 
==System Variables==
* HPC_{{#uppercase:{{#var:app}}}}_DIR - installation directory
+
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
<!--Configuration-->
 
<!--Configuration-->
 
{{#if: {{#var: conf}}|==Configuration==
 
{{#if: {{#var: conf}}|==Configuration==
Line 86: Line 86:
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
__NOTOC____NOEDITSECTION__
 
__NOTOC____NOEDITSECTION__
 +
=Validation=
 +
* Validated 4/5/2018

Latest revision as of 21:24, 6 December 2019

Description

samstat website  

SAMStat is an efficient C program to quickly display statistics of large sequence files from next generation sequencing projects. When applied to SAM/BAM files all statistics are reported for unmapped, poorly and accurately mapped reads separately. This allows for identification of a variety of problems, such as remaining linker and adaptor sequences, causing poor mapping. Apart from this SAMStat can be used to verify individual processing steps in large analysis pipelines. SAMStat reports nucleotide composition, length distribution, base quality distribution, mapping statistics, mismatch, insertion and deletion error profiles, di-nucleotide and 10-mer over-representation. The output is a single html5 page which can be interpreted by a non-specialist.

Required Modules

Serial

  • samstat

System Variables

  • HPC_SAMSTAT_DIR - installation directory




Citation

If you publish research that uses samstat you have to cite it as follows:

Lassmann et al. (2010) "SAMStat: monitoring biases in next generation sequencing data." Bioinformatics doi:10.1093/bioinformatics/btq614 [PMID: 21088025]


Validation

  • Validated 4/5/2018