Difference between revisions of "TransDecoder"
Moskalenko (talk | contribs) m (Text replacement - "#uppercase" to "uc") |
|||
Line 40: | Line 40: | ||
<!--Modules--> | <!--Modules--> | ||
− | == | + | ==Environment Modules== |
− | + | Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application. | |
− | |||
− | < | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
==System Variables== | ==System Variables== | ||
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory | * HPC_{{uc:{{#var:app}}}}_DIR - installation directory | ||
Line 95: | Line 85: | ||
<!--Turn the Table of Contents and Edit paragraph links ON/OFF--> | <!--Turn the Table of Contents and Edit paragraph links ON/OFF--> | ||
__NOTOC____NOEDITSECTION__ | __NOTOC____NOEDITSECTION__ | ||
− | |||
− |
Revision as of 13:39, 13 June 2022
Description
TransDecoder identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks.
TransDecoder identifies likely coding sequences based on the following criteria:
A minimum length open reading frame (ORF) is found in a transcript sequence
A log-likelihood score similar to what is computed by the GeneID software is > 0.
The above coding score is greatest when the ORF is scored in the 1st reading frame as compared to scores in the other 5 reading frames.
If a candidate ORF is found fully encapsulated by the coordinates of another candidate ORF, the longer one is reported. However, a single transcript can report multiple ORFs (allowing for operons, chimeras, etc).
optional the putative peptide has a match to a Pfam domain above the noise cutoff score.
Environment Modules
Run module spider transdecoder
to find out what environment modules are available for this application.
System Variables
- HPC_TRANSDECODER_DIR - installation directory