Difference between revisions of "Reference Data"

From UFRC
Jump to navigation Jump to search
Line 15: Line 15:
 
* [https://www.internationalgenome.org/data/ 1000 Genomes Data Releases] - /data/reference/1000genomes
 
* [https://www.internationalgenome.org/data/ 1000 Genomes Data Releases] - /data/reference/1000genomes
 
* [https://www.girinst.org/repbase/ RepBase] - /data/reference/repbase
 
* [https://www.girinst.org/repbase/ RepBase] - /data/reference/repbase
 +
* [https://bfd.mmseqs.com/ BFD] - /data/reference/bfd
 +
* [https://www.uniprot.org/help/uniref Uniref30] - /data/reference/uniref30
 
* Various other references - /data/reference/fasta
 
* Various other references - /data/reference/fasta
  

Revision as of 18:17, 22 July 2021

UFRC maintains a repository of reference data that can be accessed by all HiPerGator users. The primary purposes of this repository are researcher convenience, efficient use of filesystem space, and cost savings. We are happy to download and build reference datasets and configure applications installed on HiPerGator to automatically make use of the available reference data. Having UFRC host common reference data means that a research group does not have to use their Blue or Orange quota to host redundant copies of common reference data.

Use https://support.rc.ufl.edu to request either addition of reference data or to ask for an addition of a directory that you can put reference data into for shared use.

The following is not an exhaustive list of the hosted reference data. If an existing reference is missing from the list below please let us know and we will update the list.

Application-Specific Data

Many other application specific references are located in /data/reference

Raw Genomic Data

AI Reference Datasets and Models

A variety of reference machine learning and AI datasets are located in /data/ai. Browse the catalog of all available AI reference datasets to learn more.

Pre-compiled models can be found in /data/reference/ai/models.

Data