PALADIN
Description
Protein ALignment And Detection INterface
PALADIN is a protein sequence alignment tool designed for the accurate functional characterization of metagenomes.
PALADIN is based on BWA, and aligns sequences via read-mapping using BWT. PALADIN, however, offers the novel approach of aligning in the protein space. During the index phase, it processes the reference genome's nucleotide sequences and GTF/GFF annotation containing CDS entries, first converting these transcripts into the corresponding protein sequences, then creating the BWT and suffix array from these proteins. The process of translatation is skiped when providing a protein reference file (e.g., UniProt) for mapping. During the alignment phase, it attempts to find ORFs in the read sequences, then converts these to protein sequences, and aligns to the reference protein sequences.
Required Modules
Serial
- gcc/5.2.0
- paladin
System Variables
- HPC_PALADIN_DIR - installation directory
Citation
If you publish research that uses paladin you have to cite it as follows: