Difference between revisions of "PASTA"

From UFRC
Jump to navigation Jump to search
(Created page with "Category:SoftwareCategory:BiologyCategory:Sequence Alignment {|<!--CONFIGURATION: REQUIRED--> |{{#vardefine:app|pasta}} |{{#vardefine:url|https://github.com/smirar...")
 
m (Changed Cat. name)
Line 1: Line 1:
[[Category:Software]][[Category:Biology]][[Category:Sequence Alignment]]
+
[[Category:Software]][[Category:Biology]][[Category:NGS]]
 
{|<!--CONFIGURATION: REQUIRED-->
 
{|<!--CONFIGURATION: REQUIRED-->
 
|{{#vardefine:app|pasta}}
 
|{{#vardefine:app|pasta}}

Revision as of 18:44, 7 July 2017

Description

pasta website  

PASTA estimates alignments and ML trees from unaligned sequences using an iterative approach. In each iteration, it first estimates a multiple sequence alignment using the current tree as a guide and then estimates a ML tree on (a masked version of) the alignment. By default, PASTA performs 3 iterations, but a host of options enable changing that behavior. In each iteration, a divide-and-conquer strategy is used for estimating the alignment. The set of sequences is divided into smaller subsets, each of which is aligned using an external alignment tool (default is MAFFT). These subset alignments are then pairwise merged (by default using Opal) and finally the pairwise merged alignments are merged into a final alignment using a transitivity merge technique. The division of the dataset into smaller subsets and selecting which alignments should be pairwise merged is guided by the tree from the previous iteration. The first step therefore needs an initial tree.

Required Modules

Serial

  • gcc/5.2.0`
  • pasta

System Variables

  • HPC_{{#uppercase:pasta}}_DIR - installation directory