Difference between revisions of "SEQuel"

From UFRC
Jump to navigation Jump to search
m (Text replacement - "#uppercase" to "uc")
Line 32: Line 32:
 
<!--Add additional HPC_FOO_BIN and other ENV VARIABLES below-->
 
<!--Add additional HPC_FOO_BIN and other ENV VARIABLES below-->
 
==System Variables==
 
==System Variables==
* HPC_{{#uppercase:{{#var:app}}}}_DIR - installation directory
+
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
<!--Run-->
 
<!--Run-->
  

Revision as of 21:24, 6 December 2019

Description

sequel website  


SEQuel is a tool for correcting errors (i.e., insertions, deletions, and substitutions) in contigs output from assembly. While assemblies of next generation sequencing (NGS) data are accurate, they still contain a substantial number of errors that need to be corrected after the assembly process. The algorithm behind SEQuel makes use of a graph structure called the positional de Bruijn graph, which models k-mers within reads while incorporating their approximate positions into the model.

Required Modules

modules documentation

Serial

  • sequel

System Variables

  • HPC_SEQUEL_DIR - installation directory

How To Run

Use the 'sequel' script we provide instead of the full 'java -jar...' command.

If needed increase the available memory by setting 'Xmx' in the Java environment variable in your job script or in the shell. E.g. set

export _JAVA_OPTIONS="-Xmx6g ${_JAVA_OPTIONS}"

before running something like

sequel -c sample.fasta -ap reads_aln.sam -i 404