Search results

Category:Alignment
Biological sequence alignment tools and documentation

24 members (0 subcategories, 0 files) - 15:24, 23 August 2022
WebLogo
WebLogo is an application designed to make the generation of sequence logos as easy and painless as possible. ...precise description of, for example,a binding site, than would a consensus sequence.

3 KB (417 words) - 17:40, 22 August 2022
Category:Assembly
Short Sequence Assembly software.

20 members (0 subcategories, 0 files) - 15:24, 23 August 2022
Infernal
Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a

3 KB (305 words) - 19:31, 24 August 2022
EvoLSTM
...robabilities. EvoLSTM brings modern machine-learning approaches to bear on sequence evolution. It will serve as a useful tool to study and simulate complex mut

3 KB (309 words) - 18:52, 12 August 2022
Sim4
...verlaps the end of the other). If seqfile2 is a database of sequences, the sequence in seqfile1 will be aligned with each of the sequences in seqfile2.

2 KB (302 words) - 20:41, 12 August 2022
RNAsalsa
...e, that RNAsalsa uses structure information for adjusting and refining the sequence alignment and vice versa.

3 KB (368 words) - 21:24, 6 December 2019
Spruceup
...ts (alignment rows), which is different from the problem of poorly aligned sequence blocks (alignment columns) commonly addressed by alignment trimming softwar ...identification, visualization, and removal of outliers from large multiple sequence alignments. Journal of Open Source Software, 4(42), 1635, https://doi.org/1

3 KB (317 words) - 21:24, 6 December 2019
CD-HIT
...-hit produces a set of closely related protein families from a given fasta sequence database.

3 KB (354 words) - 18:24, 12 August 2022
SuperCRUNCH
...orking with phylogenetic datasets. SuperCRUNCH can be run using any set of sequence data, as long as sequences are in fasta format with standard naming convent ...s (adjust sequence directions, adjust reading frames), several options for sequence alignment (Clustal-O, MAFFT, Muscle, MACSE), and multiple options for align

4 KB (499 words) - 14:47, 11 May 2020
RepeatMasker
sequence as well as a modified version of the query sequence in which On average, almost 50% of a human genomic DNA sequence currently will

2 KB (325 words) - 20:32, 12 August 2022
Minigraph
...graph constructor. It finds approximate locations of a query sequence in a sequence graph and incrementally augments an existing graph with long query subseque

2 KB (279 words) - 18:08, 31 March 2021
Sputnik
...the resulting hits are written to stdout along with their position in the sequence, length, and a score determined by the length of the repeat and the number

2 KB (311 words) - 15:50, 22 August 2022
FAST
...BioPerl FastA format). FAST tools expose the power of Perl and BioPerl for sequence analysis to non-programmers in an easy-to-learn command-line paradigm.

3 KB (358 words) - 21:21, 6 December 2019
Artemis
...next generation data and the results of analyses within the context of the sequence, and also its six-frame translation. ...: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data.

3 KB (378 words) - 14:53, 12 August 2022
Mlstcheck
...for each locus, and the ST (or nearest ST), the other contains the genomic sequence for each allele. ...us, the contaminated flag is set. Optionally you can output a concatenated sequence in FASTA format, which you can then use with tree building programs. New, u

3 KB (395 words) - 18:06, 27 May 2022
TNTBLAST
...ideally) match experimental PCR results. To enable searching of very large sequence databases (i.e. all of Genbank), ThermonucleotideBLAST can use run-time dat

3 KB (385 words) - 20:08, 2 June 2022
Seq crumbs
seq_crumbs aims to be a collection of small sequence processing utilities. ...and most of them take a sequence file as input and create a new processed sequence file as output. This design encourages the assembly of the seq_crumbs utili

3 KB (328 words) - 15:12, 27 May 2022
HMMER
HMMER is used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods

3 KB (397 words) - 18:18, 15 August 2022
Impg
impg (Implicit Pangenome Graph) projects sequence ranges through many-way (e.g. all-vs-all) pairwise alignments built by tool ...htforward to use to extract FASTA sequences for downstream use in multiple sequence alignment (like mafft) or pangenome graph building (e.g., pggb or minigraph

3 KB (392 words) - 21:22, 10 April 2024
Reference Indexes
...rence indexes built for the respective tools using the appropriate genomic sequence data. This page provides a short overview of our reference index building p ...most reference indexes to be built. In that case we'll use the non-masked sequence.

2 KB (361 words) - 20:30, 12 August 2022
Consed
calling, sequence comparisons, and sequence assembly. Phred, Cross_match, Consed/Autofinish is a tool for viewing, editing, and finishing sequence assemblies created with phrap. Finishing capabilities include allowing the

3 KB (348 words) - 18:39, 12 August 2022
UCSC
...ion tasks such as reverse complementation, codon and amino acid lookup and sequence translation, as well as functions specifically designed for extracting, loa

3 KB (349 words) - 20:53, 12 August 2022
WGS Assembler
...n sequence of a multi-cellular organism (Myers 2000) and the first diploid sequence of an individual human (Levy 2007). Celera Assembler was developed at Celer

3 KB (359 words) - 20:56, 12 August 2022
SRAssembler
...cal assembly of genomic regions matching a homologous query protein or DNA sequence. ...ssembler first collects the reads that can be locally aligned to the query sequence and assembles them into contigs. Additional reads are then found by alignin

3 KB (381 words) - 15:33, 27 May 2022
Kalign
Kalign is a fast multiple sequence alignment program for biological sequences. "Kalign 3: multiple sequence alignment of large data sets."

2 KB (265 words) - 19:28, 12 August 2022
USEARCH
...tions, including E-values, identity, coverage (fraction of query or target sequence covered by the alignment) and maximum gap length, and a range of output fil ...RCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude

4 KB (559 words) - 17:19, 22 August 2022
PRINSEQ
...n in Perl and can be helpful if you want to filter, reformat, or trim your sequence data. It also generates basic statistics for your sequences.

2 KB (271 words) - 15:52, 10 June 2022
Wgs
...n sequence of a multi-cellular organism (Myers 2000) and the first diploid sequence of an individual human (Levy 2007). Celera Assembler was developed at Celer

3 KB (375 words) - 20:55, 12 August 2022
Nt
This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks. It powers several papers and work

2 KB (262 words) - 14:38, 15 July 2022
MHAP
...rithm. Designed to efficiently detect all overlaps between noisy long-read sequence data. It efficiently estimates Jaccard similarity by compressing sequences

2 KB (270 words) - 16:15, 10 June 2022
SEQPower
SEQPower provides statistical power analysis and sample size estimation for sequence-based association studies. ...tez, B. Peng and S. M. Leal, Power analysis and sample size estimation for sequence-based association studies. Bioinformatics. (2014)]

2 KB (274 words) - 22:38, 21 August 2022
GHOST-MP
...ches for similar sequences among nucleotide query sequences and amino acid sequence database like BLASTX. GHOST-MP runs on a distributed memory system and proc

2 KB (285 words) - 20:11, 7 May 2020
Shasta
...of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells. Using a run-length representation of the read sequence. This makes the assembly process more resilient to errors in homopolymer re

3 KB (402 words) - 21:24, 6 December 2019
Ont tutorial cas9
...t enrichment strategy. This workflow is suitable for Oxford Nanopore fastq sequence collections and requires a reference genome and a BED file of target coordi

2 KB (289 words) - 18:38, 2 June 2022
Tcoffee
T-Coffee is a multiple sequence alignment package. You can use T-Coffee to align sequences or to combine th T-Coffee: A novel method for multiple sequence alignments.

2 KB (280 words) - 20:46, 12 August 2022
NCBI GenBank Tools
..., 2013 Jan;41(D1):D36-42). GenBank is part of the International Nucleotide Sequence Database Collaboration, which comprises the DNA DataBank of Japan (DDBJ), t

2 KB (291 words) - 14:33, 19 August 2022
ProDy
...s for comparative analysis and modeling of protein structural dynamics and sequence co-evolution. Fast and flexible ProDy API is for interactive usage as well

2 KB (294 words) - 19:38, 21 August 2022
Newbler
...de novo DNA sequence assembly. It is designed specifically for assembling sequence data generated by the 454 GS-series of pyrosequencing platforms sold by 454

2 KB (305 words) - 16:17, 19 August 2022
Rvtests
...tests, is a flexible software package for genetic association analysis for sequence datasets. ...ficient and Comprehensive Tool for Rare Variant Association Analysis Using Sequence Data. Bioinformatics 2016 32: 1423-1426.]

3 KB (293 words) - 21:59, 21 August 2022
GeneMarkS
....hmm (P and E) programs identify the maximum likely parse of the whole DNA sequence into protein coding genes (with possible introns) and intergenic regions.

2 KB (296 words) - 16:45, 27 May 2022
BloomTree
The Sequence Bloom Tree (SBT) is a method that will allow you to index a set of sequence. The code base provided here is an implementation of SBT written in

3 KB (302 words) - 16:49, 10 June 2022
MetaGeneMark
....hmm (P and E) programs identify the maximum likely parse of the whole DNA sequence into protein coding genes (with possible introns) and intergenic regions.

2 KB (298 words) - 21:22, 6 December 2019
GeneMarkS-T
....hmm (P and E) programs identify the maximum likely parse of the whole DNA sequence into protein coding genes (with possible introns) and intergenic regions.

2 KB (296 words) - 17:25, 2 June 2022
DEXTRACTOR
RS II sequencer. Generally speaking, this information is the sequence of all the reads to produce a highly accurate consensus sequence as the last step in the assembly

3 KB (441 words) - 19:20, 10 June 2022
HAPCUT
...ts, and outputs the phased haplotype blocks that can be assembled from the sequence reads.

2 KB (298 words) - 18:13, 15 August 2022
Ncbi cli
Use it to find and download sequence, annotation, and metadata for genes and genomes Use '''datasets''' to download biological sequence data across all domains of life from NCBI.

3 KB (306 words) - 15:47, 9 June 2023
Fairseq
Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom mod A Fast, Extensible Toolkit for Sequence Modeling. Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and

3 KB (296 words) - 15:18, 15 August 2022
YASRA
...improve the performance, both in runtime and quality for 454 and Illumina sequence reads.

3 KB (304 words) - 20:56, 12 August 2022
GeneMark-ES
....hmm (P and E) programs identify the maximum likely parse of the whole DNA sequence into protein coding genes (with possible introns) and intergenic regions. F

3 KB (303 words) - 16:44, 27 May 2022
MMseqs2
...ing) is a software suite to search and cluster huge protein and nucleotide sequence sets. MMseqs2 is open source GPL-licensed software implemented in C++ for L ...Soeding J. MMseqs2 desktop and local web server app for fast, interactive sequence searches. Bioinformatics, doi: 10.1093/bioinformatics/bty1057 (2019).]

3 KB (413 words) - 20:22, 2 October 2023
Wise
...e focused on comparisons of biopolymers, commonly DNA sequence and protein sequence. There are many other packages which do this, probably the best known being

3 KB (321 words) - 21:29, 6 December 2019
SNP-Pipeline
SNP Pipeline is a pipeline for the production of SNP matrices from sequence data used in the phylogenetic analysis of pathogenic organisms sequenced fr ...ne: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Computer Science 1:e20 https://doi.org/10.7717/peerj-cs.20]

3 KB (306 words) - 17:00, 10 June 2022
Catch
...ackage for designing probe sets to use for nucleic acid capture of diverse sequence. of species. It allows blacklisting sequence from the design (e.g., background in microbial enrichment),

3 KB (422 words) - 18:23, 12 August 2022
DeconSeq
The DeconSeq tool can be used to automatically detect and efficiently remove sequence contaminations from genomic and metagenomic datasets. It is easily configur ...med/21278185 Schmieder R and Edwards R: Fast identification and removal of sequence contamination from genomic and metagenomic datasets. PLoS ONE 2011, 6:e1728

3 KB (299 words) - 17:00, 10 June 2022
Recycler
...from sequence data of isolate microbial genomes, plasmidome and metagenome sequence data.

3 KB (297 words) - 14:21, 11 October 2022
FSA
...using only pairwise estimations of homology. This is made possible by the sequence annealing technique for constructing a multiple alignment from pairwise com

3 KB (307 words) - 20:38, 23 May 2022
Ancestryhmm
...neously estimating local ancestry and admixture time using next generation sequence data in samples of arbitrary ploidy. ...neously estimating local ancestry and admixture time using next generation sequence data in samples of arbitrary ploidy. PLoS genetics, 13(1), p.e1006529.

2 KB (291 words) - 21:25, 9 May 2023
AliStat
Multiple sequence alignments may contain a variety of completely-specified characters. Here a for multiple sequence alignments. NAR Genomics and Bioinformatics 2 (2), lqaa024.

3 KB (315 words) - 20:18, 1 October 2020
REPdenovo
REPdenovo is designed for constructing repeats directly from sequence reads. It based on the idea of frequent k-mer assembly. REPdenovo provides ...elsen and Yufeng Wu, REPdenovo: Inferring de novo repeat motifs from short sequence reads, PLoS One 11.3 (2016): e0150719.]

3 KB (327 words) - 21:43, 21 August 2022
SeSiMCMC
The Sequence Similarities by Markov Chain Monte Carlo (SeSiMCMC) algorithm careful Bayesian analysis to consider site absence in a sequence.

3 KB (299 words) - 22:41, 21 August 2022
HLAminer
...and align the resulting contigs to reference HLA alleles from the IMGT/HLA sequence repository using commodity hardware with standard specifications (<2GB RAM, ...novo assembly of all recruited reads, a set of contigs is generated. Only sequence contigs equal or larger than 200nt in length are considered for further ana

4 KB (541 words) - 14:55, 14 December 2022
Magus
MAGUS is a tool for piecewise large-scale multiple sequence alignment. Original MAGUS paper: Smirnov, V. and Warnow, T., 2020. MAGUS: Multiple Sequence Alignment using Graph Clustering. Bioinformatics. https://doi.org/10.1093/b

3 KB (327 words) - 17:51, 16 March 2022
CRISP
...d using the Illumina sequencing platform. In principle, it should work for sequence data from other sequencing platforms. The method requires each pool to be s

3 KB (342 words) - 21:01, 6 December 2019
MetaCluster
...nomic sequences. Existing binning methods based on sequence similarity and sequence composition markers rely heavily on the reference genomes of known microorg

3 KB (321 words) - 19:26, 18 August 2022
RSeQC
...put sequence data especially RNA-seq data. “Basic modules” quickly inspect sequence quality, nucleotide composition bias, PCR bias and GC bias, while “RNA-se

3 KB (328 words) - 17:48, 10 June 2022
Barrnap
...more CPUs. Running time is approximately 1 second per 1 megabase of input sequence.

3 KB (328 words) - 17:55, 10 June 2022
ProtExcluder
...ude the portion of DNA sequence (as well as certain length of the flanking sequence – given by the user, default = 50 bp) matching subjects in a protein data

3 KB (329 words) - 21:22, 6 December 2019
Bridger
...r to search an optimal set of paths (transcripts) that can be supported by sequence data and could explain all observed splicing events of each locus.

3 KB (341 words) - 13:14, 15 August 2022
Last
* Handle big sequence data, e.g: * Use sequence quality data properly.

3 KB (342 words) - 19:29, 12 August 2022
Probalign
...terior probability estimates to compute maximum expected accuracy multiple sequence alignments. It performs statistically significantly better than the leading generation and manipulation of multiple sequence alignments using

3 KB (349 words) - 19:36, 21 August 2022
CGView
...w sequence features, gene and protein names, COG category assignments, and sequence composition characteristics. CCT can generate maps in a variety of sizes, i

3 KB (351 words) - 19:39, 23 May 2022
PhyloPhlAn2
...nction that selects how many position to consider for each of the multiple-sequence alignment.

3 KB (375 words) - 17:57, 9 June 2022
Jasper
...assemblies. JASPER is substantially faster than polishing methods based on sequence alignment, and more accurate than currently available k-mer based methods.

3 KB (354 words) - 16:42, 14 December 2023
EMBOSS
...BOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. EMBOSS breaks the historical trend towards

3 KB (372 words) - 15:01, 15 August 2022
SeqPrep
...most cases) so they are forcefully merged. When reads do not have adapter sequence they must be treated with care when doing the merging, so a much more sensi

3 KB (371 words) - 22:38, 21 August 2022
HybPiper
HybPiper was designed for targeted sequence capture, in which DNA sequencing libraries are enriched for gene regions of ...., Zerega, N. J. C, and Wickett, N. J. (2016). HybPiper: Extracting Coding Sequence and Introns for Phylogenetics from High-Throughput Sequencing Reads Using T

3 KB (349 words) - 18:36, 10 June 2022
Maq
Follow these steps to run Maq. All you need is a reference sequence file in the FASTA format. Prepare a reference sequence (ref.fasta), better a bacterial genome to make the test run faster.

3 KB (377 words) - 19:47, 12 August 2022
Viralmsa
ViralMSA is a tool to perform reference-guided multiple sequence alignment of viral genomes. ViralMSA wraps around existing read mapping too ...Moshiri N (2020). "ViralMSA: Massively scalable reference-guided multiple sequence alignment of viral genomes." Bioinformatics. btaa743. doi:10.1093/bioinform

3 KB (357 words) - 19:56, 27 May 2022
MODELLER
...alignment of protein sequences and/or structures, clustering, searching of sequence databases, comparison of protein structures, etc. MODELLER is available for

3 KB (364 words) - 19:53, 12 August 2022
Trim Galore
...h some added functionality to remove biased methylation positions for RRBS sequence files (for directional, non-directional (or paired-end) sequencing). It's m ...suitable for both ends of paired-end libraries), but accepts other adapter sequence, too

4 KB (512 words) - 20:52, 12 August 2022
RDnaTools
...is a python package of tools and pipelines for working with ribosomal DNA sequence data generated with the PacBio(R) SMRT sequencing. rDnaTools works by wrapp ...al Ecology community, and their existing tools for analyzing ribosomal DNA sequence data. Since the core of the analyses wrapped by rDnaTools come from the Mot

3 KB (361 words) - 21:24, 6 December 2019
Ssr pipeline
...quality standards, (2) align paired-end reads into a single composite DNA sequence, and (3) identify sequences that possess microsatellites conforming to user ...n of microsatellite sequences from paired-end Illumina High-Throughput DNA sequence data (ver. 1.1, February 2014): U.S. Geological Survey Data Series 778.

4 KB (501 words) - 18:55, 6 June 2022
Quake
..., which create false sequence unsimilar to anything in the original genome sequence from which the read was taken.

3 KB (367 words) - 18:50, 10 June 2022
Mvftools
...e data is encoded based on the information content at a particular aligned sequence site. This contextual encoding allows for rapid computation of phylogenetic

3 KB (370 words) - 13:58, 30 April 2020
Erpin
...write complex descriptors before starting a search. Instead ERPIN reads a sequence alignement and secondary structure, and automatically infers a statistical ...ert A. (2001) Direct RNA Motif Definition and Identification from Multiple Sequence Alignments using Secondary Structure Profiles. J Mol Biol. 313:1003-11]

3 KB (377 words) - 21:21, 6 December 2019
Samtools
...nce Alignment/Map) format is a generic format for storing large nucleotide sequence alignments. SAM Tools provide various utilities for manipulating alignments

3 KB (357 words) - 22:04, 21 August 2022
BiG-SCAPE
...ed on a comparison of their protein domain content, order, copy number and sequence identity.

3 KB (377 words) - 12:53, 15 August 2022
PROVEAN
PROVEAN is useful for filtering sequence variants to identify nonsynonymous or indel variants that are predicted to A fast computation approach to obtain pairwise sequence alignment scores enabled the generation of precomputed PROVEAN predictions

3 KB (350 words) - 20:27, 12 August 2022
MACH
available genotype and shotgun sequence data to estimate unobserved Li Y, Willer CJ, Ding J, Scheet P and Abecasis GR (2010) MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Ep

3 KB (363 words) - 20:23, 15 August 2022
Anchorwave
...ragments, i.e., anchor and inter-anchor intervals. By performing sensitive sequence alignment for each shorter interval via a 2-piece affine gap cost strategy ...Michelle C. Stitzer. AnchorWave: Sensitive alignment of genomes with high sequence diversity, extensive structural polymorphism, and whole-genome duplication.

3 KB (367 words) - 21:10, 20 September 2023
FastANI
...avoids expensive sequence alignments and uses Mashmap as its MinHash based sequence mapping engine to compute the orthologous mappings and alignment identity e

3 KB (380 words) - 21:21, 6 December 2019
ShapeMapper
...grates careful handling of all classes of adduct-induced sequence changes, sequence variant correction, basecall quality filters, and quality-control warnings

3 KB (387 words) - 21:24, 6 December 2019
INDELible
INDELible is a new, portable, and flexible application for biological sequence simulation that combines many features in the same place for the first time ...tcher, W. and Yang, Z. 2009. INDELible: a flexible simulator of biological sequence evolution. Mol. Biol. and Evol. 2009 26(8):1879-1888]

3 KB (384 words) - 13:34, 28 June 2021
MOSAIK
...ikSort resolves paired-end reads and sorts the alignments by the reference sequence coordinates. Finally, MosaikText converts alignments to different text-base

3 KB (414 words) - 11:59, 19 August 2022
Ncbi-vdb
...sion by Reference, which only stores the differences in base pairs between sequence data and the segment it aligns to. The process to restore original data, fo

3 KB (428 words) - 14:33, 19 August 2022
Scrappie
* scrappie squiggle Create approximate squiggle for sequence * scrappie seqmappy Map signal to sequence via basecall posteriors

3 KB (416 words) - 13:43, 7 April 2021
TWINSCAN
with the length of the target sequence. A rough guideline is 1 GB of memory for 1 Mb of input sequence.

3 KB (386 words) - 20:20, 27 May 2022
TRF
...Repeats with pattern size in the range from 1 to 2000 bases are detected. Sequence information sent to the server is confidential and deleted after program ex

3 KB (442 words) - 20:52, 12 August 2022
SCRATCH-1D
* EVALpro release 1.0 (2019) : Evaluation of sequence-based & profile-based predictors * PROFILpro release 2.0 (2021) : Protein evolutionary information / sequence profiles

3 KB (381 words) - 20:35, 12 August 2022
UFCG
* '''ufcg train''' generates sequence model of your own fungal marker gene, even from a small set of seed sequenc * '''ufcg align''' conducts multiple sequence alignment of the genes from the set of marker gene profiles.

3 KB (431 words) - 17:41, 20 September 2023
Uproc
...) toolbox implements a novel algorithm ("Mosaic Matching") for large-scale sequence analysis and is now available in terms of an open source C library. UProC i

3 KB (418 words) - 20:53, 12 August 2022
SAMStat
...a reference genome in the alignment / map format (SAM/BAM). To monitor the sequence quality over time and to identify problems it is necessary to report variou SAMStat is an efficient C program to quickly display statistics of large sequence files from next generation sequencing projects. When applied to SAM/BAM fil

3 KB (429 words) - 20:34, 12 August 2022
Wengan
alignments can be inferred by building paths on a sequence graph. To achieve this, Wengan builds a new sequence graph called the Synthetic Scaffolding Graph (SSG). The SSG is built from

4 KB (472 words) - 20:55, 12 August 2022
BEDTools
#Calculating the depth and breadth of sequence coverage across defined "windows" in a genome. ...ntrol over how output is reported. Most recently, I have added support for sequence alignments in BAM ([http://samtools.sourceforge.net/]) format, as well as f

3 KB (485 words) - 12:48, 15 August 2022
TREEPUZZLE
...t computing an overall tree and to visualize the phylogenetic content of a sequence alignment. TREE-PUZZLE also conducts a number of statistical tests on the d

4 KB (450 words) - 19:31, 10 June 2022
Sate
SATé is a software package for inferring a sequence alignment and phylogenetic tree. The iterative algorithm involves repeated .... Linder, T. Warnow, 2009. "Rapid and accurate large scale coestimation of sequence alignments and phylogenetic trees." Science, 324(5934), pp. 1561-1564, 19 J

7 KB (910 words) - 20:43, 21 December 2022
BLAT
sequence. BLAT is much faster than older tools such as BLAST for nucleotide and the genome likely to be similar to the query sequence. It performs an alignment

3 KB (484 words) - 13:03, 15 August 2022
Fast-Plast
...roximately 30 minutes with no user mediation. In addition to a chloroplast sequence, Fast-Plast identifies chloroplast genes present in the final assembly. .... Lohse, and B. Usadel. 2014. Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics, btu170.

4 KB (487 words) - 21:20, 6 December 2019
Gffread
GFF/GTF parsing utility providing format conversions, region filtering, FASTA sequence extraction and more.

2 KB (242 words) - 19:02, 12 August 2022
SeCaPr
A computational pipeline for processing Illumina sequence capture data

2 KB (246 words) - 21:24, 6 December 2019
InStruct
...subpopulations and classfies each individual into subpopulations given the sequence data.

2 KB (254 words) - 18:00, 11 March 2020
BppSuite
BppSuite is a suite of ready-to-use programs for phylogenetic and sequence analysis.

2 KB (248 words) - 15:37, 9 December 2022
Pyfasta
Fast, memory-efficient, pythonic (and command-line) access to fasta sequence files.

2 KB (238 words) - 19:58, 21 August 2022
Randfold
Randfold compute the probability that, for a given RNA sequence, the Minimum Free Energy (MFE) of the secondary structure is different from

2 KB (262 words) - 21:29, 21 August 2022
Staden
A fully developed set of DNA sequence assembly (Gap4 and Gap5), editing and analysis tools (Spin) for Unix, Linux

2 KB (254 words) - 15:59, 22 August 2022
Talesf
...sites for transcription activator-like (TAL) effectors in a given genomic sequence.

2 KB (256 words) - 21:29, 6 December 2019
Hgtector
...-wide detection of putative horizontal gene transfer (HGT) events based on sequence homology search hit distribution statistics.

2 KB (253 words) - 17:04, 10 October 2022
GeP
Genome Profiler (GeP) is a program to perform whole-genome multilocus sequence typing (wgMLST) analysis for bacterial isolates.

2 KB (255 words) - 15:19, 10 June 2022
LIGR
LIGR Assembler is a sequence assembly program. It was derived from TIGR Assembler and addresses some of

2 KB (249 words) - 19:55, 15 August 2022
Metabin
...for faster and more accurate taxonomic assignment of single and paired-end sequence reads of varying lengths (≥45 bp) obtained from both Sanger and next-gene ...g Blastx. The MetaBin web server allows users to upload their own data, as sequence reads or Blastx output, to carry out taxonomic analysis. It provides severa

4 KB (588 words) - 19:25, 18 August 2022
Pysamstats
...utility for extracting simple statistics against genome positions based on sequence alignments from a SAM or BAM file.

2 KB (259 words) - 19:52, 29 September 2020
PSMC
This software package infers population size history from a diploid sequence

2 KB (263 words) - 14:01, 28 February 2020
CAMP
CAMP is a sequence-based deep learning framework for multifaceted prediction of peptide-protei

2 KB (255 words) - 18:00, 23 May 2022
ParGenes
ParGenes is a parallel tool that takes as input a set of multiple sequence alignments (typically from different genes) and infers their corresponding

2 KB (265 words) - 13:31, 28 April 2021
Odgi
odgi provides an efficient and succinct dynamic DNA sequence graph model, as well as a host of algorithms that allow the use of such gra

2 KB (264 words) - 14:11, 15 July 2021
Minced
...s) in full genomes or environmental datasets such as metagenomes, in which sequence size can be anywhere from 100 to 800 bp.

2 KB (263 words) - 15:38, 10 June 2022
Scythe
Scythe uses a Naive Bayesian approach to classify contaminant substrings in sequence reads. It considers quality information, which can make it robust in pickin

2 KB (262 words) - 20:36, 12 August 2022
SMC++
...a program for estimating the size history of populations from whole genome sequence data.

2 KB (268 words) - 23:52, 21 August 2022
Automlsa2
...ips between genes and organisms. The general idea is to allow the input of sequence data along with marker genes and output a robust phylogenetic tree.

2 KB (266 words) - 15:25, 12 August 2022
ProSplign
Splign is a utility for computing cDNA-to-Genomic, or spliced sequence alignments. At the heart of the program is a compartmentization algorithm w

2 KB (267 words) - 17:08, 1 June 2022
Squigglenet
...ord Nanopore raw electrical signals as target or non-target for Read-Until sequence enrichment or depletion.

2 KB (262 words) - 20:12, 9 May 2022
Seqmagick
We often have to convert sequence files between formats and do little manipulations on them, and it's not wor

2 KB (269 words) - 22:35, 21 August 2022
TrimAl
...ed removal of spurious sequences or poorly aligned regions from a multiple sequence alignment.

2 KB (264 words) - 19:32, 24 August 2022
MetaCherchant
MetaCherchant is a tool for analysing genomic environment of a nucleotide sequence within a metagenome. The implementation is based on MetaFast source code.

2 KB (269 words) - 21:22, 6 December 2019
HmmCleaner
...to remove alignment and sequencing error containing segments from multiple sequence alignments (MSA) using profile hidden Markov models (pHMM). This tool is ba

2 KB (275 words) - 22:25, 27 August 2020
Graphaligner
Rautiainen, M., Marschall, T. GraphAligner: rapid and versatile sequence-to-graph alignment. Genome Biol 21, 253 (2020). https://doi.org/10.1186/s13

2 KB (262 words) - 21:48, 12 January 2023
EPACTS
...orm various statistical tests for identifying genome-wide association from sequence data through a user-friendly interface, both to scientific analysts and to

2 KB (269 words) - 18:52, 12 August 2022
FastQ-Screen
...lows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with wha

2 KB (274 words) - 16:05, 16 March 2020
PyCogent
PyCogent is a toolkit for making sense from sequence

2 KB (275 words) - 17:20, 1 June 2022
Trgt
...PacBio HiFi data. In addition to the basic size genotyping, TRGT profiles sequence composition, mosaicism, and CpG methylation of each analyzed repeat. TRGT c

2 KB (273 words) - 14:45, 28 July 2023
Megalodon
...is a research command line tool to extract high accuracy modified base and sequence variant calls from raw nanopore reads by anchoring the information rich bas

2 KB (271 words) - 22:24, 8 September 2020
Trvz
...PacBio HiFi data. In addition to the basic size genotyping, TRGT profiles sequence composition, mosaicism, and CpG methylation of each analyzed repeat.

2 KB (273 words) - 14:46, 28 July 2023
SInC
...tor for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data.

2 KB (278 words) - 21:24, 6 December 2019
DRAP
...shed method to study transcription of organisms lacking a reference genome sequence. Available packages such as Trinity and Oases have proven to be able to hig

2 KB (280 words) - 21:20, 6 December 2019
Exonerate
* Exonerate is a generic tool for pairwise sequence comparison.

2 KB (303 words) - 18:53, 12 August 2022
OrthoFinder
OrthoFinder is a program for identifying orthologous protein sequence families. It is written in python and runs as a single command that takes a

2 KB (276 words) - 17:58, 19 August 2022
M5nr
The M5NR is an integration of many sequence databases into one single, searchable database (plus an index). A single si

2 KB (286 words) - 19:46, 12 August 2022
Pyani
...ticore systems, and can integrate with SGE/OGE-type job schedulers for the sequence comparisons.

2 KB (283 words) - 19:44, 21 August 2022
GROCSVs
...tware pipeline for identifying large-scale structural variants, performing sequence assembly at the breakpoints, and reconstructing the complex structural vari

2 KB (274 words) - 21:21, 6 December 2019
Pindel
...ons and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of thes

2 KB (279 words) - 21:23, 6 December 2019
ARKS
Scaffolding genome sequence assemblies using 10X Genomics GemCode/Chromium data. This project is a new

2 KB (285 words) - 21:15, 6 December 2019
IGV
...ts a wide variety of data types, including array-based and next-generation sequence data, and genomic annotations.

2 KB (276 words) - 18:42, 15 August 2022
Srst2
This program is designed to take Illumina sequence data, a MLST database and/or a database of gene sequences (e.g. resistance

2 KB (278 words) - 17:42, 13 December 2022
SequelQC
...ram specifically for the PacBio Sequel platform that quickly processes raw sequence data from multiple SMRTcells producing multiple statistics and publication

2 KB (280 words) - 21:24, 6 December 2019
Clipkit
ClipKIT: a multiple sequence alignment trimming software for accurate phylogenomic inference. Steenwyk e

2 KB (276 words) - 13:29, 15 August 2022
PhaME
...SNPs from complete genomes, draft genomes and/or reads. Uses SNP multiple sequence alignment to construct a phylogenetic tree. Provides evolutionary analyses

2 KB (287 words) - 14:18, 9 March 2020
RepeatExplorer
graph-based clustering analysis of sequence read similarities to identify

2 KB (278 words) - 21:44, 21 August 2022
Mashtree
...and Carleton, H. A., (2019). Mashtree: a rapid comparison of whole genome sequence files. Journal of Open Source Software, 4(44), 1762]

2 KB (280 words) - 13:30, 15 August 2022
Gimmemotifs
...identify differential motifs, calculate motif enrichment statistics, plot sequence logos and more. In addition, all this functionality is available from a Pyt

2 KB (293 words) - 15:41, 30 June 2023
Cpc
...Calculator distinguishes protein-coding from non-coding RNAs based on the sequence features of the input transcripts. Last version of CPC1 is widely used by w

2 KB (287 words) - 17:22, 9 June 2022
RNAhybrid
...NA. The hybridisation is performed in a kind of domain mode, ie. the short sequence is hybridised to the best fitting part of the long one. The tool is primari

2 KB (285 words) - 20:56, 12 August 2022
CRISPResso2
...n of genome editing experiments. It aligns sequencing reads to a reference sequence, quantifies insertions, mutations and deletions to determine whether a read

2 KB (282 words) - 13:25, 11 October 2022
MPBoot
...program can reconstruct maximum parsimony trees for large DNA and protein sequence alignments. More importantly, it implements the method MPBoot for approxima

3 KB (292 words) - 21:21, 6 December 2019
AmPEPpy
...de sequences using the Distribution descriptor set from the Global Protein Sequence Descriptors. amPEPpy has improved portability, increased accuracy relative

3 KB (291 words) - 14:13, 22 April 2021
MEME
* Search sequence databases using motifs.

2 KB (292 words) - 19:50, 12 August 2022
TraitRate
rate of sequence evolution.

2 KB (292 words) - 16:54, 22 August 2022
InterProScan
This interproscan software allows you to query your sequence against the InterPro databases.

3 KB (334 words) - 19:23, 12 August 2022
FastPlast
...roximately 30 minutes with no user mediation. In addition to a chloroplast sequence, Fast-Plast identifies chloroplast genes present in the final assembly.

2 KB (288 words) - 20:23, 23 May 2022
Haploflow
Haploflow is a strain-aware viral genome assembler for short read sequence data. It uses a flow algorithm on a deBruijn graph data structure to resolv

2 KB (287 words) - 20:31, 3 January 2022
CViT
...Perl script used to quickly generate images that show features on genomic sequence. It is intentionally simplistic and low-tech; there are limitless possibili

2 KB (294 words) - 12:09, 26 August 2022
ZUMIS
...files. In the most common cases, you will have a read containing the cDNA sequence and other read(s) containing UMI and Cell Barcode information. Furthermore,

2 KB (302 words) - 17:55, 22 August 2022
Medaka
medaka is a tool to create a consensus sequence from nanopore sequencing data. This task is performed using neural networks

2 KB (290 words) - 14:37, 21 August 2023
SIFT
...rediction is based on the degree of conservation of amino acid residues in sequence alignments derived from closely related sequences, collected through PSI-BL

2 KB (303 words) - 20:40, 12 August 2022
Kvarq
...rial genome sequences in FastQ format. Mapping to a whole-genome reference sequence or de novo assembly or the short reads is not necessary.

2 KB (287 words) - 15:26, 21 June 2023
BLASTDB
...10.1 of the Rfam collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). ...lies) of the Rfam collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs).

7 KB (975 words) - 18:52, 10 April 2024
Vamb
Vamb is a metagenomic binner which feeds sequence composition information from a contig catalogue and co-abundance informatio

3 KB (302 words) - 18:59, 1 February 2021
Porechop
...hop performs thorough alignments to effectively find adapters, even at low sequence identity.

2 KB (294 words) - 21:23, 6 December 2019
GramCluster
...d accurate progressive clustering algorithm that relies on a grammar-based sequence distance and is particularly useful in clustering large datasets.

2 KB (280 words) - 16:57, 10 June 2022
PRINSEQ++
...rogram. It can be used to filter, reformat or trim genomic and metagenomic sequence data. It is 5X faster than prinseq-lite.pl and uses less RAM thanks to the

2 KB (298 words) - 18:11, 15 April 2020
BPP
...ecies divergence times (tau), and for delimiting species using multi-locus sequence data from several closely related species.

2 KB (294 words) - 18:21, 12 August 2022
QTLtools
...set for molecular QTL discovery and analysis. It allows to go from the raw sequence data to collection of molecular Quantitative Trait Loci (QTLs) in few easy-

2 KB (297 words) - 20:58, 21 August 2022
Radorgminer
The RADOrgMiner pipeline is designed to genotype sequence tags found in reduced genomic complexity libraries that originate from the

3 KB (297 words) - 21:55, 6 July 2022
Chimera
maps, supramolecular assemblies, sequence alignments, docking results,

3 KB (296 words) - 18:27, 12 August 2022
MAFFT
MAFFT is a multiple sequence alignment program for unix-like operating systems. It offers a range of mu

2 KB (295 words) - 20:23, 15 August 2022
StringMLST
Fast k-mer based tool for multi locus sequence typing (MLST) stringMLST is a tool for detecting the MLST of an isolate dir

2 KB (292 words) - 20:45, 12 August 2022
FastGEAR
FastGEAR is a software for analyzing sequence alignments.

3 KB (313 words) - 20:13, 18 October 2022
FastUniq
...as an ultrafast de novo tool for removal of duplicates in paired short DNA sequence reads in FASTQ format. FastUniq identifies duplicates by comparing sequence

3 KB (301 words) - 16:37, 15 August 2022
Vg
* paths, describe genomes, sequence alignments, and annotations (such as gene models and transcripts) as walks

2 KB (299 words) - 14:42, 4 September 2020
RepeatModeler
...ds for identifying repeat element boundaries and family relationships from sequence data. RepeatModeler assists in automating the runs of RECON and RepeatScout

3 KB (297 words) - 20:32, 12 August 2022
FAR
...s heuristic algorithm such as BLAST. Therefore the alignment of an adapter sequence to a read is very accurate.

2 KB (298 words) - 18:54, 12 August 2022
FastML
...together with posterior probabilities for each character and indel at each sequence position for each internal node of the tree. FastML is generic and is appli

3 KB (312 words) - 18:54, 12 August 2022
Wtdbg
Wtdbg2 is a de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Techno

3 KB (310 words) - 17:44, 22 August 2022
GPhoCS
...om individual genome sequences. G-PhoCS accepts as input a set of multiple sequence alignments from separate neutrally evolving loci along the genome. Paramete

3 KB (300 words) - 21:20, 6 December 2019
SortMeRNA
SortMeRNA is a biological sequence analysis tool for filtering, mapping and OTU-picking NGS reads. The core al

3 KB (299 words) - 17:00, 10 June 2022
ExpansionHunter
...ber of regions in the human genome consisting of repetitions of short unit sequence (commonly a trimer). Such repeat regions can expand to a size much larger t

3 KB (335 words) - 20:36, 8 March 2022
Physher
...ls to estimate evolutionary rates and divergence times from heterochronous sequence data. BMC Evolutionary Biology 14:163, 2014.]

3 KB (312 words) - 15:28, 16 November 2021
MIGRATE-N
* Sequence data using Felsenstein's 84 model with or without site rate variation, * Single nucleotide polymorphism data (sequence-like data input, HAPMAP-like data input)

5 KB (636 words) - 16:05, 22 December 2022
Pbmm2
...en used as input to pbmm2. Benchmarks show that pbmm2 outperforms BLASR in sequence identity, number of mapped bases, and especially runtime. pbmm2 is the offi

3 KB (303 words) - 20:08, 31 May 2022
MIRA
MIRA - Sequence assembler for whole genome shotgun and EST sequencing data. Can use Sanger,

2 KB (313 words) - 20:42, 18 August 2022
LDhat
*convert: Simple manipulation of DNA sequence data

2 KB (296 words) - 17:09, 10 June 2022
CAP3
CAP3 is a sequence assembly program.

2 KB (302 words) - 13:19, 15 August 2022
Alfred
...ure annotation and haplotype-resolved consensus computation using multiple sequence alignments.

3 KB (294 words) - 19:29, 14 November 2022
EVidenceModeler
Inputs to EVM include the genome sequence, gene predictions and alignment data in GFF3 format, and a list of numeric

3 KB (311 words) - 14:59, 27 July 2020
Bsmap
Xi Y, Li W: BSMAP: whole genome Bisulfite Sequence MAPping program. BMC Bioinformatics (2009) 10:232.

3 KB (304 words) - 18:22, 12 August 2022
CLIFinder
...) are transcribed from LINE 1 antisense promoter and include the L1 5’UTR sequence in antisense orientation followed by the adjacent genomic region.

3 KB (308 words) - 21:11, 6 December 2019
Flye
...tion and analysis step to improve the structural accuracy of the resulting sequence. The package also includes a polisher module, which produces the final asse

3 KB (317 words) - 21:20, 6 December 2019
IMA3
IMa3 is the newest in the IM sequence of programs. It can be used to solve a fundamental problem in evolutionary

3 KB (317 words) - 17:02, 6 June 2022
GDC-Client
...nism in support of high performance data downloads and submission. The raw sequence files, typically stored as BAM or FASTQ, make up the bulk of data. The size

3 KB (311 words) - 18:57, 12 August 2022
PHAST
...a dozen major programs, plus more than a dozen utilities for manipulating sequence alignments, phylogenetic trees, and genomic annotations (see left panel). F

3 KB (320 words) - 17:17, 10 June 2022
BWA
...that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome. It implements two algorithms, bwa-short and BWA-S

2 KB (317 words) - 18:22, 12 August 2022
Bakta
...rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microbial Genomics, 7(11).]

3 KB (308 words) - 15:28, 12 August 2022
Lrgapcloser
...ind the bridging that cross the gap, and then fills the long read original sequence into the genomic gaps.

3 KB (326 words) - 19:30, 12 August 2022
GeneRax
It accounts for sequence substitutions, gene duplication, gene loss and horizontal gene transfer.

3 KB (323 words) - 14:25, 12 October 2020
Stampy
...nd Chip-seq. Stampy excels in the mapping of reads containing that contain sequence variation relative to the reference, in particular for those containing ins

3 KB (327 words) - 20:45, 12 August 2022
DynamicDBG
...ctures for the representation of de Bruijn graphs are important for genome sequence assembly. A fully dynamic data structure supporting efficient and exact mem

3 KB (312 words) - 21:20, 6 December 2019
ABySS
ABySS is a de novo, parallel, paired-end sequence assembler that is designed for short reads. The single-processor version is

2 KB (315 words) - 16:17, 19 August 2022
Subread
...(2014). featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics, 30(7):923-30]

3 KB (315 words) - 12:57, 15 August 2022
MarginPolish
...nt paths for run-length-encoded reads and uses these to generate a refined sequence. It takes as input a FASTA assembly and an indexed BAM (ONT reads aligned t

3 KB (320 words) - 21:22, 6 December 2019
GenePainter
...origin of introns. GenePainter maps gene structures onto protein multiple sequence alignments. Gene structures, as provided by e.g. WebScipio, are aligned wit

3 KB (334 words) - 21:20, 6 December 2019
Fastq-tools
*fastq-match : (smith-waterman) local sequence alignment

3 KB (306 words) - 17:31, 10 June 2022
Gerbil
...em is to build a histogram of all substrings of length k in a given genome sequence. Gerbil is a k-mer counter that is specialized for high effiency when count

3 KB (320 words) - 14:34, 3 January 2020
EDirect
databases (publication, sequence, structure, gene, variation, expression, etc.)

3 KB (357 words) - 14:55, 15 August 2022
SHAPE-MaP
...cing. Chemical modifications are recorded as noncomplementary nucleotides (sequence changes) during cDNA synthesis and are thus not significantly affected by w

3 KB (325 words) - 17:25, 10 June 2022
HapCUT2
DNA sequence reads, designed to just work with excellent speed and

3 KB (317 words) - 20:03, 8 March 2022
FASTA
by identifying local duplications within a sequence. Other programs provide

2 KB (320 words) - 18:54, 12 August 2022
Genblastg
...s local alignments, or high-scoring segment pairs (HSPs) produced by local sequence alignment programs such as BLAST and WU-BLAST and identify groups of HSPs.

3 KB (306 words) - 18:19, 5 February 2024
IgBLAST
...ore, IgBLAST allows searches against the germline gene databases and other sequence databases simultaneously to minimize the chance of missing possibly the bes

3 KB (338 words) - 19:06, 12 August 2022
Redundans
...am takes as input assembled contigs, sequencing libraries and/or reference sequence and returns scaffolded homozygous genome assembly. Final assembly should be

3 KB (332 words) - 21:24, 6 December 2019
MHCflurry
...models. Pan-allele predictors supporting virtually any MHC allele of known sequence are available for testing (see below). MHCflurry runs on Python 2.7 and 3.4

3 KB (322 words) - 16:07, 21 February 2020
RNAmmer
...-level approach it is avoided to run a full model through an entire genome sequence allowing faster predictions.

3 KB (328 words) - 17:35, 10 June 2022
ClonalFrame
ClonalFrame can be applied to any kind of sequence data, from a

3 KB (349 words) - 18:33, 12 August 2022
PlantTribes
...ed orthologous plant gene family clusters, and builds gene family multiple sequence alignments and their corresponding phylogenies.

3 KB (331 words) - 13:29, 4 October 2021
Ansible
...utomation. It can also do IT orchestration, where you have to run tasks in sequence and create a chain of events which must happen on several different servers

3 KB (348 words) - 14:17, 12 August 2022
NCBI BLAST+
The Basic Local Alignment Search Tool (BLAST) is the most widely used sequence similarity tool. There are versions of BLAST that compare protein queries t

3 KB (344 words) - 19:55, 12 August 2022
Platanus
Platanus is a novel de novo sequence assembler that can reconstruct genomic sequences of

3 KB (318 words) - 21:23, 6 December 2019
Satsuma
...., & Lindblad-Toh, K. (2010). Genome-wide synteny through highly sensitive sequence alignment: Satsuma. Bioinformatics, 26(9), 1145-51.]

3 KB (332 words) - 21:24, 6 December 2019
Wgsim
Wgsim is a small tool for simulating sequence reads from a reference genome.

3 KB (343 words) - 21:29, 6 December 2019
AMRPlusPlus
...want to know what is contained within your sample or how abundant a given sequence is relative to another.

3 KB (365 words) - 21:19, 6 December 2019
Syri
...., Sun, H., Jiao, W. et al. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol 20, 277 (2019) doi:10

3 KB (324 words) - 13:31, 15 August 2022
TGSGapCloser
...ftware tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill

3 KB (321 words) - 15:35, 20 April 2021
FastQC
...a comprehensive QC report to tell if there is anything unusual about your sequence. Each test is flagged as a pass, warning or fail depending on how far it d

3 KB (382 words) - 15:25, 15 August 2022
ElPrep
...D, Fostier J, Verachtert W (2019) elPrep 4: A multithreaded framework for sequence analysis. PLoS ONE 14(2): e0209523. https://doi.org/10.1371/journal.pone.02

3 KB (345 words) - 21:20, 6 December 2019
Jellyfish
substrings is a central step in many analyses of DNA sequence. JELLYFISH can

3 KB (333 words) - 19:27, 12 August 2022
CASAVA
Illumina's Consensus Assessment of Sequence and Variation (CASAVA) software captures summary information for resequenci

3 KB (340 words) - 18:23, 12 August 2022
PALADIN
PALADIN is a protein sequence alignment tool designed for the accurate functional characterization of met

3 KB (361 words) - 18:31, 10 June 2022
Seq-Gen
...t of the commonly used (and computationally tractable) models of molecular sequence evolution.

3 KB (351 words) - 17:58, 10 June 2022
Sabre
...ences for multiple lines or multiple species is a cost-efficient method to sequence and analyze a broad range of data.

3 KB (356 words) - 16:32, 2 November 2020
ClonalFrameML
ClonalFrameML can be applied to any type of aligned sequence data, but is

3 KB (348 words) - 13:47, 15 August 2022
Seqwish
...rithm uses a series of disk-backed sorts and passes over the alignment and sequence inputs to allow the graph to be constructed from very large inputs that are

3 KB (347 words) - 21:24, 6 December 2019
MEGA
Software suite for analyzing DNA and protein sequence data from species and populations.

3 KB (337 words) - 20:30, 15 December 2020
PolyGembler
...population, as well as high coverage (i.e. greater than 30X) whole genome sequence data on a reference sample, or alternatively the availability of a set of r

3 KB (344 words) - 13:45, 3 November 2020
P53MH
Computes a score between 0 and 100 for each site in the DNA sequence of a gene. A high score indicates an increased probability that n nucleotid

3 KB (344 words) - 18:35, 10 June 2022
RNAstructure
than single sequence secondary structure prediction. Finally, RNAstructure

3 KB (359 words) - 21:58, 21 August 2022
Ensembl Tools
Ensembl Tools provide a number of ready-made tools for processing sequence data.

3 KB (333 words) - 18:52, 12 August 2022
Meraculous
...mer/read-based assembler that capitalizes on the high accuracy of Illumina sequence by eschewing an explicit error correction step which we argue to be redunda

3 KB (354 words) - 18:36, 10 June 2022
Trinotate
...enced methods for functional annotation including homology search to known sequence data (BLAST+/SwissProt/Uniref90), protein domain identification (HMMER/PFAM

3 KB (344 words) - 20:29, 7 December 2023
TreeTime
reeTime provides routines for ancestral sequence reconstruction and inference of molecular-clock phylogenies, i.e., a tree w

3 KB (370 words) - 17:44, 2 November 2020
ABruijn
...draft assembly by concatenating different parts of raw reads. This coarse sequence is then polished into a high quality assembly.

3 KB (354 words) - 20:09, 24 August 2022
MasHmap
...nation of Minimizers and MinHash. This is then converted to an estimate of sequence identity using the Mash distance. An appropriate k-mer sampling rate is aut

3 KB (359 words) - 17:28, 27 May 2022
Mobsuite
Robertson, James et al. “Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidem

3 KB (340 words) - 20:03, 10 October 2022
Nanocaller
...to generate indel calling, and then creates consensus sequences for indel sequence prediction.

3 KB (351 words) - 15:41, 16 March 2023
Picard-tools
...nce Alignment/Map) format is a generic format for storing large nucleotide sequence alignments. SAM is described in the SAMtools project page.

6 KB (882 words) - 15:25, 27 January 2023
MentaLiST
MLST (multi-locus sequence typing) is a classic technique for genotyping bacteria, widely applied for

3 KB (368 words) - 18:51, 10 June 2022
AdapterRemoval
AdapterRemoval may be used to recover a consensus adapter sequence for

3 KB (357 words) - 20:47, 11 August 2022
BinPacker
...sta format. Briefly, it works in two step: first, BinPacker partitions the sequence data into many individual splicing graphs, each capturing the full transcri

3 KB (365 words) - 12:53, 15 August 2022
Blast Job Scripts
faSplit sequence ../large.fasta 120 blast_query_

2 KB (358 words) - 20:27, 3 June 2022
Musket
...: " Musket: a multistage k-mer spectrum based error corrector for Illumina sequence data". Bioinformatics, 2013, 29(3): 308-315]

3 KB (351 words) - 13:44, 19 August 2022
Chewbbaca
...for the creation and validation of whole genome and core genome MultiLocus Sequence Typing (wg/cgMLST) schemas, providing an allele calling algorithm based on

3 KB (346 words) - 15:58, 20 October 2023
PHYling
analyses and also projected into coding sequence alignments for codon-based

3 KB (375 words) - 18:17, 3 June 2022
PASTA
...ng an iterative approach. In each iteration, it first estimates a multiple sequence alignment using the current tree as a guide and then estimates a ML tree on

3 KB (381 words) - 19:31, 24 August 2022
NewTargets
Bioinformatic sequence recovery for universal target-capture bait kits can be substantially improv

3 KB (379 words) - 20:06, 12 August 2022
VirusDetect
...tigs are provided as guidance to discovery novel viruses which do not show sequence similarity with known viruses.

3 KB (377 words) - 23:22, 30 July 2020
TransDecoder
A minimum length open reading frame (ORF) is found in a transcript sequence

3 KB (371 words) - 16:54, 22 August 2022
Blast2GO
the XML belonging to each sequence (provided in FASTA format). The program query sequence. Additionally you can add domain/motif based GO term obtained

6 KB (920 words) - 14:08, 16 March 2023
ATRAM
* aTRAM.pl: this script runs aTRAM with a target sequence and the formatted short-read archive.

3 KB (350 words) - 15:23, 12 August 2022
SNAP
[https://arxiv.org/abs/1111.5572 Faster and More Accurate Sequence Alignment with SNAP. Matei Zaharia, William J. Bolosky, Kristal Curtis, Arm

3 KB (372 words) - 20:42, 12 August 2022
Annocript
...putative long non-coding RNAs by using an heuristic based on homology and sequence features.

3 KB (387 words) - 21:18, 6 December 2019
Vcontact
...ool to perform guilt-by-contig-association classification of viral genomic sequence data. It's designed to cluster and provide taxonomic context of viral metag

3 KB (358 words) - 20:04, 27 May 2022
Bracken
...the Kraken database itself to derive probabilities that describe how much sequence from each genome is identical to other genomes in the database, and combine

3 KB (402 words) - 19:01, 10 June 2022
DAWGPAWS
...a Distributed Annotation Working Group (DAWG) in the annotation of genomic sequence contigs. These programs generate the multiple tracks of annotation evidence

3 KB (381 words) - 21:20, 6 December 2019
Metacrast
...ttern search algorithm to rapidly select reads that contain an expected DR sequence. It then proceeds through reads identified in the previous step to find DR

3 KB (392 words) - 17:58, 27 May 2022
PerM
The reference sequence(s) can be whole genomes with multiple chromosomes, the transcriptome or eve

3 KB (396 words) - 17:40, 21 August 2022
Glimmer
...which we term an Interpolated Context Model or ICM). The models for coding sequence are 3-periodic nonhomogenous Markov models. Improvements made in version 3

3 KB (391 words) - 17:09, 15 August 2022
Giraf
...ational tool for identification of reassortments in influenza viruses from sequence databases of isolates. Reassortments in influenza - a process where strains

3 KB (382 words) - 21:12, 7 December 2022
MitoPhAST
3) Reports various mitochondrial genes and sequence information in a simple table format.

3 KB (382 words) - 14:27, 23 April 2020
Saffrontree
...om raw reads or from assembled sequences, without the need for a reference sequence or de novo assemblies. SaffronTree takes FASTQ/FASTA files as input and use

3 KB (400 words) - 17:17, 31 May 2022
Vcflib
vcflib provides methods to manipulate and interpret sequence variation as it can be described by VCF. It is both:

3 KB (390 words) - 20:54, 12 August 2022
Liftoff
...es. For each gene, Liftoff finds the alignments of the exons that maximize sequence identity while preserving the transcript and gene structure. If two genes i

3 KB (399 words) - 13:14, 23 September 2020
HaMStR
HaMSTR is a Hidden Markov Model based search tool to screen EST sequence data for the presence of putative orthologs to a pre-defined set of genes.

3 KB (420 words) - 20:09, 12 August 2022
AntiSMASH
...es. antiSMASH not only detects the gene clusters, but also offers detailed sequence analysis.

3 KB (409 words) - 21:11, 6 December 2019
CNCI
...tao Yu, Changhai Zhang, Yuanning Liu, RunSheng Chen and Yi Zhao* Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transc

3 KB (374 words) - 14:09, 23 April 2021
MACS
...mic Poisson distribution to effectively capture local biases in the genome sequence, allowing for more sensitive and robust prediction. MACS compares favorably

3 KB (384 words) - 19:47, 12 August 2022
GARLI
...rms phylogenetic inference using the maximum-likelihood criterion. Several sequence types are supported, including nucleotide, amino acid and codon. Version 2.

3 KB (485 words) - 16:53, 15 August 2022
Blast
...between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can

3 KB (402 words) - 18:17, 12 August 2022
V-Phaser
...to call variants in genetically heterogeneous populations from ultra-deep sequence data. It combines information regarding the covariation (i.e. phasing) betw

3 KB (390 words) - 20:07, 27 May 2022
RepeatAnalysisTools
sequence data generated with the PacBio No-Amp Targeted Sequencing

4 KB (430 words) - 16:55, 15 December 2022
Graphtyper
...n acyclic graph structure (a "pangenome reference"), which high-throughput sequence reads are re-aligned to for the purpose of discovering and genotyping SNPs,

3 KB (373 words) - 15:43, 17 November 2021
Coevol
simultaneously on a sequence alignment, a matrix of quantitative characters

3 KB (399 words) - 18:37, 12 August 2022
Csubst
* Simulated sequence evolution under specified scenarios of convergent evolution

3 KB (380 words) - 22:12, 26 July 2023
BMGE
...w-software-for-selection-of-phylogenetic-informative-regions-from-multiple-sequence-alignments/}} ...Gathering with Entropy), that is designed to select regions in a multiple sequence alignment that are suited for phylogenetic inference. For each character, B

3 KB (407 words) - 13:04, 15 August 2022
Rascaf
...lter the connections by searching the merged gene sequences against public sequence databases. Finally, Rascaf uses these connections to select and/or re-arran

3 KB (415 words) - 21:30, 21 August 2022
MUMmer
...ogram included with the system. If the species are too divergent for a DNA sequence alignment to detect similarity, then the PROmer program can generate alignm

3 KB (393 words) - 13:39, 19 August 2022
AI Education and Training
* Sequence Models

3 KB (463 words) - 21:43, 7 November 2022
TRNAscan-Se
...eed of about 30 kb/second. It was implemented for large-scale human genome sequence analysis, but is applicable to other DNAs as well. It applies our COVE soft

3 KB (389 words) - 20:53, 12 August 2022
VirusFinder
...ly developed algorithm, Virus intEgration site detection through Reference SEquence customization (VERSE) (manuscript in consideration) as well as other new fe

3 KB (410 words) - 21:29, 6 December 2019
Illumiprocessor
trimmer for Illumina Sequence Data. Bioinformatics.

4 KB (452 words) - 18:44, 15 August 2022
Mauve
...ources. It employs algorithmic techniques that scale well in the amount of sequence being aligned. For example, a pair of Y. pestis genomes can be aligned in u

3 KB (448 words) - 19:49, 12 August 2022
ARC
...es to assemble discreet 'targets' contained within next-generation shotgun sequence data. ARC decomplexifies the traditionally difficult problem of assembly by

3 KB (430 words) - 14:32, 12 August 2022
SNPGenie
...ity of a virus population within a single host, it might be appropriate to sequence the pooled nucleic acid content of the virus in a blood sample from that ho

3 KB (423 words) - 18:41, 1 June 2022
BALi-Phy
BAli-Phy can estimate phylogenetic trees from sequence data when the alignment is uncertain. Instead of conditioning on a single a

3 KB (402 words) - 17:19, 14 December 2022
BamBam
...ordinates in a BAM file as if it had been built from a different reference sequence

3 KB (456 words) - 18:00, 12 August 2022
Platon
...acterial plasmid contigs in short-read draft assemblies exploiting protein sequence-based replicon distribution scores. Microbial Genomics, 95, 295. https://do

4 KB (437 words) - 17:14, 14 December 2022
GSNP
...e of each base. These quality scores are used as the standard of consensus sequence calling. Combined with the prior probability of dbSNP allele, it gets a low

3 KB (459 words) - 17:27, 15 August 2022
Velvet
Sequence assembler for very short reads.

4 KB (523 words) - 17:06, 14 December 2022
Ea-utils
:: Scans a sequence file for adapters, and, based on a log-scaled threshold, determines a set o

4 KB (438 words) - 14:54, 15 August 2022
Amrfinderplus
...her classes of genes using protein annotations and/or assembled nucleotide sequence. AMRFinderPlus is used in the Pathogen Detection pipeline, and these data a

4 KB (447 words) - 15:59, 27 May 2022
Kmerfreq
kmerfreq count K-mer (with size K) frequency from the input sequence data,typically sequencing reads data, and reference genome data is also app

3 KB (449 words) - 19:28, 12 August 2022
Sickle
...strings there for the 3'-end cut. However, if the length of the remaining sequence is less than the minimum length threshold, then the read is discarded entir

4 KB (551 words) - 22:56, 21 August 2022
Kraken
...s by other bioinformatics software to accomplish this task have often used sequence alignment or machine learning techniques that were quite slow, leading to t

4 KB (483 words) - 19:28, 12 August 2022
Stacks
...s developed for the purpose of building genetic maps from RAD-Tag Illumina sequence data, but can also be readily applied to population studies, and phylogeogr

4 KB (658 words) - 21:20, 2 January 2023
Trinity
...entially to process large volumes of RNA-seq reads. Trinity partitions the sequence data into many individual de Bruijn graphs, each representing the transcrip

4 KB (556 words) - 17:10, 22 August 2022
GATK
...two packages. Our SNP calling pipeline (Q score recalibration -> multiple sequence realignment -> snp/index calling) is a particular area of focus, and have b

4 KB (540 words) - 16:53, 15 August 2022
Khmer
: Crusoe et al., The khmer software package: enabling efficient sequence analysis. 2014. doi: 10.6084/m9.figshare.979190

4 KB (504 words) - 17:01, 14 December 2022
ANNOVAR
* Other functionalities: Retrieve the nucleotide sequence in any user-specific genomic positions in batch, identify a candidate gene

4 KB (545 words) - 14:16, 12 August 2022
DDT
...u were going to submit it to the queue. Be sure that you add the following sequence before the <code>mpiexec</code> command in your script:

5 KB (754 words) - 19:24, 30 October 2023
Transfer Data with Globus
To transfer from a remote site to UFRC reverse the sequence in scenarios 2 and 3.

10 KB (1,541 words) - 20:03, 17 January 2023

Search results

Navigation menu

Search