Jellyfish

From UFRC
Revision as of 21:22, 6 December 2019 by Moskalenko (talk | contribs) (Text replacement - "#uppercase" to "uc")
Jump to navigation Jump to search

Description

jellyfish website  

k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers quickly by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism.

Jellyfish is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the "jellyfish dump" command.

Required Modules

Serial

  • jellyfish

or

  • gcc/4.7.2 jellyfish

System Variables

  • HPC_JELLYFISH_DIR - installation directory
  • HPC_JELLYFISH_BIN - executable directory
  • HPC_JELLYFISH_BIN - includes directory





Validation

  • Validated 4/5/2018