Difference between revisions of "Themisto"
Johnbullard (talk | contribs) (Created page with "Category:Software Category:Biology Category:Genomics Category:Alignment {|<!--CONFIGURATION: REQUIRED--> |{{#vardefine:app|themisto}} |{{#vardefine:url|https:/...") |
|||
(One intermediate revision by one other user not shown) | |||
Line 1: | Line 1: | ||
[[Category:Software]] | [[Category:Software]] | ||
− | |||
[[Category:Genomics]] | [[Category:Genomics]] | ||
[[Category:Alignment]] | [[Category:Alignment]] | ||
Line 13: | Line 12: | ||
|{{#vardefine:testing|}} <!--PROFILING--> | |{{#vardefine:testing|}} <!--PROFILING--> | ||
|{{#vardefine:faq|}} <!--FAQ--> | |{{#vardefine:faq|}} <!--FAQ--> | ||
− | |{{#vardefine:citation|}} <!--CITATION--> | + | |{{#vardefine:citation|1}} <!--CITATION--> |
|{{#vardefine:installation|}} <!--INSTALLATION--> | |{{#vardefine:installation|}} <!--INSTALLATION--> | ||
|} | |} | ||
Line 64: | Line 63: | ||
If you publish research that uses {{#var:app}} you have to cite it as follows: | If you publish research that uses {{#var:app}} you have to cite it as follows: | ||
− | + | Tommi Mäklin, Teemu Kallonen, Jarno Alanko, Veli Mäkinen, Jukka Corander, Antti Honkela. Genomic Epidemiology with Mixed Samples. Supplement: Pseudoalignment in the mGEMS pipeline. | |
|}} | |}} |
Latest revision as of 20:57, 3 June 2022
Description
A metanenomic sample is a set of sequences of reads from microbial life living in a particular environment. Standard analysis involves estimating the species composition of the environment by aligning the reads against a reference database. Since the age of pangenomics, alignment is preferentially done against a variation graph encompassing all variation within a species.
Themisto is a space-efficient tool for indexing such variation graphs. The Themisto index is a compressed colored de-bruijn graph of order k, where each node has a set of colors representing the reference sequences that contain the k-mer corresponding to the node. Reads are pseudoaligned to the index using a method similar to the one used by the tool Kallisto: all k-mers of the read are located in the de-bruijn graph and the intersection of the color sets of the nodes is returned.
Environment Modules
Run module spider themisto
to find out what environment modules are available for this application.
System Variables
- HPC_THEMISTO_DIR - installation directory
- HPC_THEMISTO_BIN - executable directory
Citation
If you publish research that uses themisto you have to cite it as follows:
Tommi Mäklin, Teemu Kallonen, Jarno Alanko, Veli Mäkinen, Jukka Corander, Antti Honkela. Genomic Epidemiology with Mixed Samples. Supplement: Pseudoalignment in the mGEMS pipeline.