Difference between revisions of "SRA"

From UFRC
Jump to navigation Jump to search
m (Text replace - "{{#if: {{#var: mod}}|==Execution Environment and Modules== {{App_Module|app={{#var:app}}|intel={{#var:intel}}|mpi={{#var:mpi}}}}|}}" to "==Required Modules== modules documentation ===Serial=== *{{#var:app}}")
m (Text replace - "<!-- ######## Template Configuration ######## --> <!--Edit definitions of the variables used in template calls Required variables: app - lowercase name of the application e.g. "amber" url - url of the software page (project, company prod)
Line 2: Line 2:
 
__NOEDITSECTION__
 
__NOEDITSECTION__
 
[[Category:Software]]
 
[[Category:Software]]
<!-- ########  Template Configuration ######## -->
+
 
<!--Edit definitions of the variables used in template calls
 
Required variables:
 
app - lowercase name of the application e.g. "amber"
 
url - url of the software page (project, company product, etc) - e.g. "http://ambermd.org/"
 
Optional variables:
 
INTEL - Version of the Intel Compiler e.g. "11.1"
 
MPI - MPI Implementation and version e.g. "openmpi/1.3.4"
 
-->
 
 
{|
 
{|
 
<!--Main settings - REQUIRED-->
 
<!--Main settings - REQUIRED-->

Revision as of 17:52, 10 August 2012


Description

sra website  

This is the NCBI Short Read Archive Toolkit.

Release notes:

SRA Toolkit 2.1.7a includes new features in sam-dump tool and vdb-dump tools.

Sam-dump now supports slicing across multiple sequences, and dumping cSRA files to fasta and fastq formats. In addition, sam-dump has three new parameters:

-=|--hide-identical              Output '=' if base is identical to reference
--gzip                           Compress output using gzip
--bzip2                          Compress output using bzip2

vdb-dump has two new parameters

-o|--column_enum_short           enumerates columns in short form
-b|--boolean                     defines how boolean's are printed (1,T)

We have combined the functionality of two scripts, config-assistant.perl and reference-assistant.perl into a single script, configuration-assistant.perl that helps users download the correct references for a given cSRA file and configure the user environment for the SRA Toolkit.

Required Modules

modules documentation

Serial

  • sra
  • HPC_SRA_BIN - location of the executables directory
  • HPC_SRA_DOC - location of the documentation directory

Aspera Connect

To download SRA data you can use the "ascp" utility from the Aspera Connect browser plugin package. We have a copy installed and provided by the sra module. A wrapper script ascp.sh that automatically uses the ssh key is available. For instance:

ascp.sh -QT anonftp@ftp-private.ncbi.nlm.nih.gov:/genomes/Bacteria/all.faa.tar.gz faa

will download the all.faa.tar.gz archive to the faa directory.

Note: if the download fails to start on the first try with a "Session Stop (Error: Client unable to connect to server (check UDP port and firewall))" error just re-run the command. It's a DNS (host name resolution) problem, which will resolve itself.