Multisample Variant Format (MVF), which is designed for compact storage and efficient analysis of multi-genome and multi-transcriptome datasets. The programs provided in MVFtools support this format, both with conversion utilities, filtering and transformation programs, and data analysis and visualization modules. MVF format is designed specifically for biological data analysis, since sequence data is encoded based on the information content at a particular aligned sequence site. This contextual encoding allows for rapid computation of phylogenetic and population genetic analyses, and small file sizes that enable data sharing and distribution.

Environment Modules

Run module spider mvftools to find out what environment modules are available for this application.

System Variables

  • HPC_MVFTOOLS_DIR - installation directory
  • HPC_MVFTOOLS_BIN - executable directory
  • HPC_MVFTOOLS_EXE - example directory
  • HPC_MVFTOOLS_TST - test directory


If you publish research that uses mvftools you have to cite it as follows:

Pease JB and BK Rosenzweig. 2018. "Encoding Data Using Biological Principles: the Multisample Variant Format for Phylogenomics and Population Genomics" IEEE/ACM Transactions on Computational Biology and Bioinformatics. 15(4):1231-1238.