Difference between revisions of "HaMStR"

From UFRC
Jump to navigation Jump to search
Line 20: Line 20:
 
HaMSTR is a Hidden Markov Model based search tool to screen EST sequence data for the presence of putative orthologs to a pre-defined set of genes. A-priori knowledge about orthology-relationships among genes in different sets of reference species is extracted from the [http://inparanoid.sbc.su.se/ Inparanoid-Database]. Ortholog groups of genes are then formed from the pairwise Inparanoid ortholog assignments applying the criterion of [http://www.deep-phylogeny.org/hamstr/images/tc.png transitive closure]. Based on these ortholog groups, hidden markov models are generated using the [[HMMER3|HMMER package]], each representative of a single cluster of orthologous genes on the chosen set of reference taxa (Details). For each HMM, [http://www.clccell.com/ hmmersearch_cell] is used to scan the translated EST-compilation for significant hits. Putative orthologs are then identified by re-blasting the candidate ESTs against a reference proteome. EST-sequences with the reference-protein as best BLAST hit that was used for the HMM-geneneration are returned as operational orthologs. Click [http://www.deep-phylogeny.org/hamstr/images/hamstr.png here] for details on the workflow.
 
HaMSTR is a Hidden Markov Model based search tool to screen EST sequence data for the presence of putative orthologs to a pre-defined set of genes. A-priori knowledge about orthology-relationships among genes in different sets of reference species is extracted from the [http://inparanoid.sbc.su.se/ Inparanoid-Database]. Ortholog groups of genes are then formed from the pairwise Inparanoid ortholog assignments applying the criterion of [http://www.deep-phylogeny.org/hamstr/images/tc.png transitive closure]. Based on these ortholog groups, hidden markov models are generated using the [[HMMER3|HMMER package]], each representative of a single cluster of orthologous genes on the chosen set of reference taxa (Details). For each HMM, [http://www.clccell.com/ hmmersearch_cell] is used to scan the translated EST-compilation for significant hits. Putative orthologs are then identified by re-blasting the candidate ESTs against a reference proteome. EST-sequences with the reference-protein as best BLAST hit that was used for the HMM-geneneration are returned as operational orthologs. Click [http://www.deep-phylogeny.org/hamstr/images/hamstr.png here] for details on the workflow.
 
<!--Modules-->
 
<!--Modules-->
==Required Modules==
+
==Environment Modules==
[[Modules|modules documentation]]
+
Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application.
===Serial===
 
* {{#var:app}}
 
<!--
 
===Parallel (OpenMP)===
 
* intel
 
* {{#var:app}}
 
===Parallel (MPI)===
 
* intel
 
* openmpi
 
* {{#var:app}}
 
-->
 
 
==System Variables==
 
==System Variables==
 
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
Line 70: Line 59:
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
<!--Turn the Table of Contents and Edit paragraph links ON/OFF-->
 
__NOTOC____NOEDITSECTION__
 
__NOTOC____NOEDITSECTION__
=Validation=
 
* Validated 4/5/2018
 

Revision as of 19:11, 10 June 2022

Description

hamstr website  

HaMSTR is a Hidden Markov Model based search tool to screen EST sequence data for the presence of putative orthologs to a pre-defined set of genes. A-priori knowledge about orthology-relationships among genes in different sets of reference species is extracted from the Inparanoid-Database. Ortholog groups of genes are then formed from the pairwise Inparanoid ortholog assignments applying the criterion of transitive closure. Based on these ortholog groups, hidden markov models are generated using the HMMER package, each representative of a single cluster of orthologous genes on the chosen set of reference taxa (Details). For each HMM, hmmersearch_cell is used to scan the translated EST-compilation for significant hits. Putative orthologs are then identified by re-blasting the candidate ESTs against a reference proteome. EST-sequences with the reference-protein as best BLAST hit that was used for the HMM-geneneration are returned as operational orthologs. Click here for details on the workflow.

Environment Modules

Run module spider hamstr to find out what environment modules are available for this application.

System Variables

  • HPC_HAMSTR_DIR - installation directory
  • HPC_HAMSTR_BIN - executable directory
  • HPC_HAMSTR_REF - reference datasets directory