SeQuiLa is an ANSI-SQL compliant solution for efficient genomic intervals querying and processing built on top of Apache Spark. Range joins and depth of coverage computations are bread and butter for NGS analysis but the high volume of data make them execute very slowly or even failing to compute.
module spider sequila to find out what environment modules are available for this application.
- HPC_SEQUILA_DIR - installation directory
- HPC_SEQUILA_BIN - executable directory
If you publish research that uses sequila you have to cite it as follows:
Marek Wiewiórka, Anna Leśniewska, Agnieszka Szmurło, Kacper Stępień, Mateusz Borowiak, Michał Okoniewski, and Tomasz Gambin. Sequila: an elastic, fast and scalable sql-oriented solution for processing and querying genomic intervals. Bioinformatics, 2018.