Difference between revisions of "MMseqs2"
(Created page with "Category:SoftwareCategory:Phylogenetics {|<!--CONFIGURATION: REQUIRED--> |{{#vardefine:app|mmseqs2}} |{{#vardefine:url|https://github.com/soedinglab/MMseqs2}} <!--CONF...") |
|||
Line 5: | Line 5: | ||
<!--CONFIGURATION: OPTIONAL (|1}} means it's ON)--> | <!--CONFIGURATION: OPTIONAL (|1}} means it's ON)--> | ||
|{{#vardefine:conf|}} <!--CONFIGURATION--> | |{{#vardefine:conf|}} <!--CONFIGURATION--> | ||
− | |{{#vardefine:exe|}} <!--ADDITIONAL INFO--> | + | |{{#vardefine:exe|1}} <!--ADDITIONAL INFO--> |
|{{#vardefine:job|}} <!--JOB SCRIPTS--> | |{{#vardefine:job|}} <!--JOB SCRIPTS--> | ||
|{{#vardefine:policy|}} <!--POLICY--> | |{{#vardefine:policy|}} <!--POLICY--> | ||
Line 37: | Line 37: | ||
{{#if: {{#var: exe}}|==Additional Information== | {{#if: {{#var: exe}}|==Additional Information== | ||
− | + | The '''databases''' module in mmseqs provides download of protein, nucleotide, and profile databases. For a list of curated datasets available, load the module and use the following command: | |
+ | $ mmseqs databases | ||
+ | |||
+ | For your convenience, the SILVA nucleotide database is available at the following path: | ||
+ | /data/reference/mmseqs/14/SILVA | ||
|}} | |}} |
Revision as of 17:59, 27 September 2023
Description
MMseqs2 (Many-against-Many sequence searching) is a software suite to search and cluster huge protein and nucleotide sequence sets. MMseqs2 is open source GPL-licensed software implemented in C++ for Linux, MacOS, and (as beta version, via cygwin) Windows. The software is designed to run on multiple cores and servers and exhibits very good scalability. MMseqs2 can run 10000 times faster than BLAST. At 100 times its speed it achieves almost the same sensitivity. It can perform profile searches with the same sensitivity as PSI-BLAST at over 400 times its speed.
Environment Modules
Run module spider mmseqs2
to find out what environment modules are available for this application.
System Variables
- HPC_MMSEQS2_DIR - installation directory
- HPC_MMSEQS2_BIN - executable directory
- HPC_MMSEQS2_DOC - documentation directory
- HPC_MMSEQS2_EXE - example directory
Additional Information
The databases module in mmseqs provides download of protein, nucleotide, and profile databases. For a list of curated datasets available, load the module and use the following command: $ mmseqs databases
For your convenience, the SILVA nucleotide database is available at the following path: /data/reference/mmseqs/14/SILVA
Citation
If you publish research that uses mmseqs2 you have to cite it as follows: