Tgicl

From UFRC
Revision as of 02:36, 10 August 2012 by Moskalenko (talk | contribs) (Text replace - "<!--Versions--> ==Available versions== .* <!--Modules-->" to "<!--Modules-->")
Jump to navigation Jump to search

Description

tgicl website  

TGI Clustering tools (TGICL): a software system for fast clustering of large EST datasets

This package automates clustering and assembly of a large EST/mRNA dataset. The clustering is performed by a slightly modified version of NCBI's megablast , and the resulting clusters are then assembled using CAP3 assembly program. TGICL starts with a large multi-FASTA file (and an optional peer quality values file) and outputs the assembly files as produced by CAP3


Execution Environment and Modules

To use tgicl with the environment modules system at HPC the following commands are available:

Get module information for tgicl:

$module spider tgicl

Load the default application module:

$module load tgicl

The modulefile for this software adds the directory with executable files to the shell execution PATH and sets the following environment variables:

  • HPC_TGICL_DIR - directory where tgicl is located.
  • HPC_TGICL_BIN - executable directory
  • HPC_TGICL_CONF - conf directory

Note:

By default loading the module for tgicl also loads the latest Perl and CAP3 modules. The main version of cap3 is used for analyses. If you wish to use the cap3 version distributed with tgicl, it is located at $HPC_TGICL_BIN/cap3.tgicl

How To Run

  • To copy the default skeleton configuration file to the working directory:
cp $HPC_TGICL_CONF/tgicl.cfg .
  • To get a full list of tgicl commands:
perldoc tgicl
  • Sample run
tgicl -F sample.fasta  -q sample.qual -c 4