BDG-SeQuila

From UFRC
Jump to navigation Jump to search

Description

sequila website  

SeQuiLa is an ANSI-SQL compliant solution for efficient genomic intervals querying and processing built on top of Apache Spark. Range joins and depth of coverage computations are bread and butter for NGS analysis but the high volume of data make them execute very slowly or even failing to compute.

Environment Modules

Run module spider sequila to find out what environment modules are available for this application.

System Variables

  • HPC_SEQUILA_DIR - installation directory
  • HPC_SEQUILA_BIN - executable directory




Citation

If you publish research that uses sequila you have to cite it as follows:

Marek Wiewiórka, Anna Leśniewska, Agnieszka Szmurło, Kacper Stępień, Mateusz Borowiak, Michał Okoniewski, and Tomasz Gambin. Sequila: an elastic, fast and scalable sql-oriented solution for processing and querying genomic intervals. Bioinformatics, 2018.