Difference between revisions of "PAUDA"

From UFRC
Jump to navigation Jump to search
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[Category:Software]][[Category:Biology]][[Category:Bioinformatics]][[Category:Sequence Alignment]]
+
[[Category:Software]][[Category:Biology]][[Category:NGS]][[Category:Phylogenetics]]
 
{|<!--CONFIGURATION: REQUIRED-->
 
{|<!--CONFIGURATION: REQUIRED-->
 
|{{#vardefine:app|PAUDA}}
 
|{{#vardefine:app|PAUDA}}
Line 21: Line 21:
  
 
<!--Modules-->
 
<!--Modules-->
==Required Modules==
+
==Environment Modules==
 
+
Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application.
===Serial===
 
* {{#var:app}}
 
===Parallel (OpenMP)===
 
* intel
 
* {{#var:app}}
 
===Parallel (MPI)===
 
* intel
 
* openmpi
 
* {{#var:app}}
 
 
 
 
==System Variables==
 
==System Variables==
 
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
Line 76: Line 66:
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
__NOTOC____NOEDITSECTION__
 
__NOTOC____NOEDITSECTION__
=Validation=
 
* Validate 4/5/2018
 

Latest revision as of 17:03, 21 August 2022

Description

PAUDA website  

PAUDA is a new approach toward the problem of comparing DNA reads against a database of protein reference sequences that is applicable to very large datasets consisting of hundreds of millions or billions of reads. PAUDA is an acronym for "Protein Alignment Using a DNA Aligner". The approach allows one to harness the high efficiency of DNA read aligners to compute BLASTX-like alignments between sequencing reads and a protein database in a small fraction of the time required by BLASTX. The PAUDA approach makes it possible to process DNA reads at a rate of millions of reads per CPU hour. PAUDA is 10,000 times faster than BLASTX.

Environment Modules

Run module spider PAUDA to find out what environment modules are available for this application.

System Variables

  • HPC_PAUDA_DIR - installation directory




Citation

If you publish research that uses PAUDA you have to cite it as follows:

Daniel H. Huson and Chao Xie, A poor man’s BLASTX - high-throughput metagenomic protein database search using PAUDA, submitted to HitSeq (2013).