ARC is a pipeline which facilitates iterative, reference guided de novo assemblies with the intent of 1) reducing time in analysis and increasing accuracy of results by only considering those reads which should assemble together and 2) reduce reference bias as compared to mapping based approaches. The software is designed to work in situations where a whole-genome assembly is not the objective, but rather when the researcher wishes to assemble discreet 'targets' contained within next-generation shotgun sequence data. ARC decomplexifies the traditionally difficult problem of assembly by subsetting the reads into small, manageable pieces which can then be assembled quickly and efficiently. Applications include those in which the researcher wishes to de novo assemble specific content and a set of semi-similar reference targets is available to initialize the assembly process.
module spider arc to find out what environment modules are available for this application.
- HPC_ARC_DIR - installation directory
In preparing your ARC_config.txt file, be careful in how things are named. Do not start a non-numerical parameter with a number. ARC checks if the parameter starts with a number, and if it does, assumes the whole parameter is a number and tries to convert to integer.
For example a reference file that is named 5_genes.fasta would throw an error because ARC sees the 5, assumes the parameter is a number and tries to convert "5_genes.fata" into an integer.