Sequence assembler for very short reads. Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom.
Velvet currently takes in short read sequences, removes errors then produces high quality unique contigs. It then uses paired-end read and long read information, when available, to retrieve the repeated areas between contigs.
Velvet has been installed on the cluster in several versions:
|These versions are the defailt compile options (max kmer=31, single threaded)|
|These versions are compiled with the default kmer, but using OpenMP--see OpenMP notes below for details|
|These versions are compiled with a max kmer of 99 and OpenMP--see OpenMP notes below for details|
Note that all versions were compiled with icc.
Please file a bugzilla request for additional compile options like color space, different kmers, LONGSEQUENCES, etc.
If you use one of the OpenMP versions of velvet, you must set the enviromental variable OMP_THREAD_LIMIT to the same value as your processor request in your submission script. For example if you use #PBS -l nodes=1:ppn=8 in your script also include export OMP_THREAD_LIMIT=8 (for a bash script).
Example submission script
#PBS -N velvet
#PBS -o velvet.out
#PBS -e velvet.err
#PBS -M <your e-mail addres>
#PBS -m abe
#PBS -l walltime=12:00:00 # HH:MM:SS
#PBS -l pmem=4gb # 4GB of RAM per thread, or 4X8=32GB in this case
#PBS -l nodes=1:ppn=8
#Set OMP_THREAD_LIMIT--should be the same as ppn above
/apps/velvet/1.1.05/velveth_max99_OMP velvet_out/ 45 -fastq.gz -shortPaired my_paried_data.fastq.gz
/apps/velvet/1.1.05/velvetg_max99_OMP velvet_out/ -min_contig_lgth 250 -exp_cov 10 -ins_length 350