Difference between revisions of "BLAT"

From UFRC
Jump to navigation Jump to search
m (Text replace - "{{#if: {{#var: mod}}|==Execution Environment and Modules== {{App_Module|app={{#var:app}}|intel={{#var:intel}}|mpi={{#var:mpi}}}}|}}" to "==Required Modules== modules documentation ===Serial=== *{{#var:app}}")
 
(13 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
__NOTOC__
 
__NOTOC__
 
__NOEDITSECTION__
 
__NOEDITSECTION__
[[Category:Software]][[Category:Bioinformatics]][[Category:Alignment]]
+
[[Category:Software]][[Category:Biology]][[Category:Genomics]]
<!-- ########  Template Configuration ######## -->
+
{|<!--Main settings - REQUIRED-->
<!--Edit definitions of the variables used in template calls
 
Required variables:
 
app - lowercase name of the application e.g. "amber"
 
url - url of the software page (project, company product, etc) - e.g. "http://ambermd.org/"
 
Optional variables:
 
INTEL - Version of the Intel Compiler e.g. "11.1"
 
MPI - MPI Implementation and version e.g. "openmpi/1.3.4"
 
-->
 
{|
 
<!--Main settings - REQUIRED-->
 
 
|{{#vardefine:app|blat}}
 
|{{#vardefine:app|blat}}
 
|{{#vardefine:url|http://genome.ucsc.edu/goldenPath/help/blatSpec.html}}
 
|{{#vardefine:url|http://genome.ucsc.edu/goldenPath/help/blatSpec.html}}
<!--Compiler and MPI settings - OPTIONAL -->
+
|{{#vardefine:exe|1}} <!--Present manual instructions for running the software -->
|{{#vardefine:intel|}} <!-- E.g. "11.1" -->
 
|{{#vardefine:mpi|}} <!-- E.g. "openmpi/1.3.4" -->
 
<!--Choose sections to enable - OPTIONAL-->
 
|{{#vardefine:mod|1}} <!--Present instructions for running the software with modules -->
 
|{{#vardefine:exe|}} <!--Present manual instructions for running the software -->
 
 
|{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF-->
 
|{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF-->
 
|{{#vardefine:pbs|}} <!--Enable PBS script wiki page link-->
 
|{{#vardefine:pbs|}} <!--Enable PBS script wiki page link-->
Line 50: Line 35:
  
 
<!--Modules-->
 
<!--Modules-->
==Required Modules==
+
==Environment Modules==
[[Modules|modules documentation]]
+
Run <code>module spider {{#var:app}}</code> to find out what environment modules are available for this application.
===Serial===
+
==System Variables==
*{{#var:app}}
+
* HPC_{{uc:{{#var:app}}}}_DIR - installation directory
 
* HPC_BLAT_BIN - executable directory
 
* HPC_BLAT_BIN - executable directory
 
* HPC_BLAT_DOC - documentation directory
 
* HPC_BLAT_DOC - documentation directory
{{#if: {{#var: exe}}|==How To Run==
+
<!--Additional-->
WRITE INSTRUCTIONS ON RUNNING THE ACTUAL BINARY|}}
+
{{#if: {{#var: exe}}|==Additional Information==
 +
'''Note:''' If you encounter segfaults (crashes) while running blat against a larger reference database like nr or md5nr please use a three-way split version of the respective database under <code>/bio/reference/fasta</code> and run three blat processes instead of one. The upstream author Jim Kent has been notified of the issue. It's unknown when a fix will be provided.
 +
|}}
 
{{#if: {{#var: conf}}|==Configuration==
 
{{#if: {{#var: conf}}|==Configuration==
 
See the [[{{PAGENAME}}_Configuration]] page for {{#var: app}} configuration details.|}}
 
See the [[{{PAGENAME}}_Configuration]] page for {{#var: app}} configuration details.|}}

Latest revision as of 13:03, 15 August 2022

Description

blat website  

Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. BLAT (the BLAST-Like Alignment Tool) is a software program developed by Jim Kent at UCSC to identify similarities between DNA sequences and protein sequences. It was developed to assist in the annotation of the human genome sequence. BLAT is much faster than older tools such as BLAST for nucleotide and protein alignments, and it can also perform spliced alignments of RNA to DNA. BLAT uses a space-time tradeoff to compare sequences quickly. BLAT precomputes an index of all nonoverlapping k-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be similar to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible.

Environment Modules

Run module spider blat to find out what environment modules are available for this application.

System Variables

  • HPC_BLAT_DIR - installation directory
  • HPC_BLAT_BIN - executable directory
  • HPC_BLAT_DOC - documentation directory

Additional Information

Note: If you encounter segfaults (crashes) while running blat against a larger reference database like nr or md5nr please use a three-way split version of the respective database under /bio/reference/fasta and run three blat processes instead of one. The upstream author Jim Kent has been notified of the issue. It's unknown when a fix will be provided.