Meraculous

From UFRC
Jump to navigation Jump to search

Description

Meraculous website  

Meraculous is a whole genome assembler for Next Generation Sequencing data geared for large genomes. It is a hybrid k-mer/read-based assembler that capitalizes on the high accuracy of Illumina sequence by eschewing an explicit error correction step which we argue to be redundant with the assembly process. Meraculous achieves high performance with large datasets by utilizing lightweight data structures and multi-threaded parallelization, allowing to assemble human-sized genomes on commodity clusters in under a day. The process pipeline implements a highly transparent and portable model of job control and monitoring where different assembly stages can be executed and re-executed separately or in unison on a wide variety of architectures.

Required Modules

Serial

  • Meraculous

System Variables

  • HPC_MERACULOUS_DIR - installation directory




Citation

If you publish research that uses Meraculous you have to cite it as follows:

Chapman et al., 2014