Difference between revisions of "FALCON"

From UFRC
Jump to navigation Jump to search
(Created page with "Category:SoftwareCategory:BioinformaticsCategory:NGS {|<!--CONFIGURATION: REQUIRED--> |{{#vardefine:app|falcon}} |{{#vardefine:url|https://github.com/PacificBiosci...")
 
 
(7 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[Category:Software]][[Category:Bioinformatics]][[Category:NGS]]
+
[[Category:Software]][[Category:Biology]][[Category:NGS]][[Category:Sequencing]]
 
{|<!--CONFIGURATION: REQUIRED-->
 
{|<!--CONFIGURATION: REQUIRED-->
 
|{{#vardefine:app|falcon}}
 
|{{#vardefine:app|falcon}}
Line 5: Line 5:
 
<!--CONFIGURATION: OPTIONAL (|1}} means it's ON)-->
 
<!--CONFIGURATION: OPTIONAL (|1}} means it's ON)-->
 
|{{#vardefine:conf|}}          <!--CONFIGURATION-->
 
|{{#vardefine:conf|}}          <!--CONFIGURATION-->
|{{#vardefine:exe|}}            <!--ADDITIONAL INFO-->
+
|{{#vardefine:exe|1}}            <!--ADDITIONAL INFO-->
 
|{{#vardefine:pbs|}}            <!--PBS SCRIPTS-->
 
|{{#vardefine:pbs|}}            <!--PBS SCRIPTS-->
 
|{{#vardefine:policy|}}        <!--POLICY-->
 
|{{#vardefine:policy|}}        <!--POLICY-->
Line 21: Line 21:
  
 
The Falcon tool kit is a set of simple code collection which I use for studying efficient assembly algorithm for haploid and diploid genomes. It has some back-end code implemented in C for speed and some simple front-end written in Python for convenience.
 
The Falcon tool kit is a set of simple code collection which I use for studying efficient assembly algorithm for haploid and diploid genomes. It has some back-end code implemented in C for speed and some simple front-end written in Python for convenience.
 +
 +
We use FALCON-integrate in conjunction with SLURM to run FALCON on HiPerGator. See the [[FALCON_fc_run_cfg|sample SLURM fc_run.cfg]] for an example. For a larger run consider modifying DAligner options to set a memory limit on kmer search with -M. See [[FALCON_fc_large_run_cfg|sample SLURM fc_large_run.cfg]] for an example.
 +
  
 
<!--Modules-->
 
<!--Modules-->
==Required Modules==
+
==Environment Modules==
===Serial/Parallel===
+
Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application.
* {{#var:app}}
 
<!--
 
===Parallel (MPI)===
 
* intel
 
* openmpi
 
* {{#var:app}}
 
-->
 
 
==System Variables==
 
==System Variables==
* HPC_{{#uppercase:{{#var:app}}}}_DIR - installation directory
+
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
<!--Configuration-->
 
<!--Configuration-->
 
{{#if: {{#var: conf}}|==Configuration==
 
{{#if: {{#var: conf}}|==Configuration==
Line 40: Line 36:
 
<!--Run-->
 
<!--Run-->
 
{{#if: {{#var: exe}}|==Additional Information==
 
{{#if: {{#var: exe}}|==Additional Information==
 +
=Errors and Pitfalls=
 +
* Make sure that the fasta headers are in a format acceptable to FALCON. If you encounterd a Fasta2DB error '<code>Line 1: Pacbio header line format error</code>' see [https://github.com/PacificBiosciences/FALCON/issues/251 FALCON issue #251] for an explanation.
 +
  
WRITE_ADDITIONAL_INSTRUCTIONS_ON_RUNNING_THE_SOFTWARE_IF_NECESSARY
+
* [https://github.com/pb-sliang/TAP TAP] is a part of FALCON module.
  
 
|}}
 
|}}

Latest revision as of 15:19, 15 August 2022

Description

falcon website  

Falcon: a set of tools for fast aligning long reads for consensus and assembly

The Falcon tool kit is a set of simple code collection which I use for studying efficient assembly algorithm for haploid and diploid genomes. It has some back-end code implemented in C for speed and some simple front-end written in Python for convenience.

We use FALCON-integrate in conjunction with SLURM to run FALCON on HiPerGator. See the sample SLURM fc_run.cfg for an example. For a larger run consider modifying DAligner options to set a memory limit on kmer search with -M. See sample SLURM fc_large_run.cfg for an example.


Environment Modules

Run module spider falcon to find out what environment modules are available for this application.

System Variables

  • HPC_FALCON_DIR - installation directory

Additional Information

Errors and Pitfalls

  • Make sure that the fasta headers are in a format acceptable to FALCON. If you encounterd a Fasta2DB error 'Line 1: Pacbio header line format error' see FALCON issue #251 for an explanation.


  • TAP is a part of FALCON module.