PALADIN

From UFRC
Jump to navigation Jump to search

Description

paladin website  

Protein ALignment And Detection INterface

PALADIN is a protein sequence alignment tool designed for the accurate functional characterization of metagenomes.

PALADIN is based on BWA, and aligns sequences via read-mapping using BWT. PALADIN, however, offers the novel approach of aligning in the protein space. During the index phase, it processes the reference genome's nucleotide sequences and GTF/GFF annotation containing CDS entries, first converting these transcripts into the corresponding protein sequences, then creating the BWT and suffix array from these proteins. The process of translatation is skiped when providing a protein reference file (e.g., UniProt) for mapping. During the alignment phase, it attempts to find ORFs in the read sequences, then converts these to protein sequences, and aligns to the reference protein sequences.

Required Modules

Serial

  • gcc/5.2.0
  • paladin

System Variables

  • HPC_PALADIN_DIR - installation directory




Citation

If you publish research that uses paladin you have to cite it as follows:

https://academic.oup.com/bioinformatics/article/doi/10.1093/bioinformatics/btx021/2964789/PALADIN-protein-alignment-for-functional-profiling