Difference between revisions of "SEQuel"
Jump to navigation
Jump to search
Moskalenko (talk | contribs) |
Moskalenko (talk | contribs) m (Text replacement - "#uppercase" to "uc") |
||
Line 32: | Line 32: | ||
<!--Add additional HPC_FOO_BIN and other ENV VARIABLES below--> | <!--Add additional HPC_FOO_BIN and other ENV VARIABLES below--> | ||
==System Variables== | ==System Variables== | ||
− | * HPC_{{ | + | * HPC_{{uc:{{#var:app}}}}_DIR - installation directory |
<!--Run--> | <!--Run--> | ||
Revision as of 21:24, 6 December 2019
Description
SEQuel is a tool for correcting errors (i.e., insertions, deletions, and substitutions) in contigs output from assembly. While assemblies of next generation sequencing (NGS) data are accurate, they still contain a substantial number of errors that need to be corrected after the assembly process. The algorithm behind SEQuel makes use of a graph structure called the positional de Bruijn graph, which models k-mers within reads while incorporating their approximate positions into the model.
Required Modules
Serial
- sequel
System Variables
- HPC_SEQUEL_DIR - installation directory
How To Run
Use the 'sequel' script we provide instead of the full 'java -jar...' command.
If needed increase the available memory by setting 'Xmx' in the Java environment variable in your job script or in the shell. E.g. set
export _JAVA_OPTIONS="-Xmx6g ${_JAVA_OPTIONS}"
before running something like
sequel -c sample.fasta -ap reads_aln.sam -i 404