Difference between revisions of "Khmer"
Jump to navigation
Jump to search
Moskalenko (talk | contribs) m (Text replace - "<!--Location--> {{App_Location|app={{#var:app}}|{{#var:ver}}}}" to "") |
Moskalenko (talk | contribs) m (Text replacement - "#uppercase" to "uc") |
||
(16 intermediate revisions by 3 users not shown) | |||
Line 2: | Line 2: | ||
__NOEDITSECTION__ | __NOEDITSECTION__ | ||
[[Category:Software]][[Category:Bioinformatics]][[Category:NGS]] | [[Category:Software]][[Category:Bioinformatics]][[Category:NGS]] | ||
− | + | {|<!--Main settings - REQUIRED--> | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | {| | ||
− | <!--Main settings - REQUIRED--> | ||
|{{#vardefine:app|khmer}} | |{{#vardefine:app|khmer}} | ||
− | |{{#vardefine:url|https://github.com/ | + | |{{#vardefine:url|https://github.com/ged-lab/khmer}} |
− | |||
− | |||
− | |||
− | |||
− | |||
|{{#vardefine:exe|}} <!--Present manual instructions for running the software --> | |{{#vardefine:exe|}} <!--Present manual instructions for running the software --> | ||
|{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF--> | |{{#vardefine:conf|}} <!--Enable config wiki page link - {{#vardefine:conf|1}} = ON/conf|}} = OFF--> | ||
Line 26: | Line 11: | ||
|{{#vardefine:testing|}} <!--Enable performance testing/profiling section --> | |{{#vardefine:testing|}} <!--Enable performance testing/profiling section --> | ||
|{{#vardefine:faq|}} <!--Enable FAQ section --> | |{{#vardefine:faq|}} <!--Enable FAQ section --> | ||
− | |{{#vardefine:citation|}} <!--Enable Reference/Citation section --> | + | |{{#vardefine:citation|1}} <!--Enable Reference/Citation section --> |
|} | |} | ||
<!-- ######## Template Body ######## --> | <!-- ######## Template Body ######## --> | ||
Line 32: | Line 17: | ||
{{#if: {{#var: url}}| | {{#if: {{#var: url}}| | ||
{{App_Description|app={{#var:app}}|url={{#var:url}}|name={{#var:app}}}}|}} | {{App_Description|app={{#var:app}}|url={{#var:url}}|name={{#var:app}}}}|}} | ||
− | |||
− | + | Khmer - python scripts for k-mer counting, filtering and graph traversal. | |
− | |||
− | |||
− | |||
− | |||
Available scripts: abundance-dist.py, count-median.py, do-partition.sh, filter-abund.py, find-knots.py, load-into-counting.py, merge-partitions.py, normalize-by-median.py, partition-graph.py, annotate-partitions.py, count-overlap.py, extract-partitions.py, filter-stoptags.py, load-graph.py, make-initial-stoptags.py, normalize-by-kadian.py, normalize-by-min.py | Available scripts: abundance-dist.py, count-median.py, do-partition.sh, filter-abund.py, find-knots.py, load-into-counting.py, merge-partitions.py, normalize-by-median.py, partition-graph.py, annotate-partitions.py, count-overlap.py, extract-partitions.py, filter-stoptags.py, load-graph.py, make-initial-stoptags.py, normalize-by-kadian.py, normalize-by-min.py | ||
− | + | Use "import khmer" in your script or in an interactive python session. | |
− | + | <!--Modules--> | |
− | + | ==Required Modules== | |
− | + | [[Modules|modules documentation]] | |
− | <!-- --> | + | ===Serial=== |
− | + | *{{#var:app}} | |
− | + | ==System Variables== | |
− | * HPC_KHMER_BIN | + | * HPC_{{uc:{{#var:app}}}}_DIR |
− | * | + | * HPC_KHMER_BIN |
− | + | * HPC_KHMER_LIB | |
{{#if: {{#var: exe}}|==How To Run== | {{#if: {{#var: exe}}|==How To Run== | ||
WRITE INSTRUCTIONS ON RUNNING THE ACTUAL BINARY|}} | WRITE INSTRUCTIONS ON RUNNING THE ACTUAL BINARY|}} | ||
Line 58: | Line 38: | ||
{{#if: {{#var: pbs}}|==PBS Script Examples== | {{#if: {{#var: pbs}}|==PBS Script Examples== | ||
See the [[{{PAGENAME}}_PBS]] page for {{#var: app}} PBS script examples.|}} | See the [[{{PAGENAME}}_PBS]] page for {{#var: app}} PBS script examples.|}} | ||
− | {{#if: {{#var: policy}}|==Usage | + | {{#if: {{#var: policy}}|==Usage Policy== |
WRITE USAGE POLICY HERE (perhaps templates for a couple of main licensing schemes can be used)|}} | WRITE USAGE POLICY HERE (perhaps templates for a couple of main licensing schemes can be used)|}} | ||
{{#if: {{#var: testing}}|==Performance== | {{#if: {{#var: testing}}|==Performance== | ||
Line 65: | Line 45: | ||
*'''Q:''' **'''A:'''|}} | *'''Q:''' **'''A:'''|}} | ||
{{#if: {{#var: citation}}|==Citation== | {{#if: {{#var: citation}}|==Citation== | ||
− | If you | + | If you use the khmer software, you must cite: |
− | + | ||
+ | : Crusoe et al., The khmer software package: enabling efficient sequence analysis. 2014. doi: 10.6084/m9.figshare.979190 | ||
+ | |||
+ | If you use any of khmer's published scientific methods, you should *also* cite the relevant paper(s), as directed below. | ||
+ | |||
+ | * Graph partitioning and/or compressible graph representation | ||
+ | : The load-graph.py, partition-graph.py, find-knots.py, load-graph.py, and partition-graph.py scripts are part of the compressible graph representation and partitioning algorithms described in: | ||
+ | :: Pell J, Hintze A, Canino-Koning R, Howe A, Tiedje JM, Brown CT | ||
+ | :: Proc Natl Acad Sci U S A. 2012 Aug 14;109(33):13272-7 | ||
+ | :: doi: 10.1073/pnas.1121464109 | ||
+ | :: PMID: 22847406 | ||
+ | * Digital normalization | ||
+ | : The normalize-by-median.py and count-median.py scripts are part of the digital normalization algorithm, described in: | ||
+ | :: A Reference-Free Algorithm for Computational Normalization of Shotgun Sequencing Data | ||
+ | :: Brown CT, Howe AC, Zhang Q, Pyrkosz AB, Brom TH | ||
+ | :: arXiv:1203.4802 [q-bio.GN] | ||
+ | :: http://arxiv.org/abs/1203.4802 | ||
+ | * K-mer counting | ||
+ | : The abundance-dist.py, filter-abund.py, and load-into-counting.py scripts implement the probabilistic k-mer counting described in: | ||
+ | :: These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure | ||
+ | :: Zhang Q, Pell J, Canino-Koning R, Howe AC, Brown CT. | ||
+ | :: arXiv:1309.2975 [q-bio.GN] | ||
+ | :: http://arxiv.org/abs/1309.2975 | ||
+ | |||
|}} | |}} | ||
+ | =Validation= | ||
+ | * Validated 4/5/2018 |
Revision as of 21:21, 6 December 2019
Description
Khmer - python scripts for k-mer counting, filtering and graph traversal.
Available scripts: abundance-dist.py, count-median.py, do-partition.sh, filter-abund.py, find-knots.py, load-into-counting.py, merge-partitions.py, normalize-by-median.py, partition-graph.py, annotate-partitions.py, count-overlap.py, extract-partitions.py, filter-stoptags.py, load-graph.py, make-initial-stoptags.py, normalize-by-kadian.py, normalize-by-min.py
Use "import khmer" in your script or in an interactive python session.
Required Modules
Serial
- khmer
System Variables
- HPC_KHMER_DIR
- HPC_KHMER_BIN
- HPC_KHMER_LIB
Citation
If you use the khmer software, you must cite:
- Crusoe et al., The khmer software package: enabling efficient sequence analysis. 2014. doi: 10.6084/m9.figshare.979190
If you use any of khmer's published scientific methods, you should *also* cite the relevant paper(s), as directed below.
- Graph partitioning and/or compressible graph representation
- The load-graph.py, partition-graph.py, find-knots.py, load-graph.py, and partition-graph.py scripts are part of the compressible graph representation and partitioning algorithms described in:
- Pell J, Hintze A, Canino-Koning R, Howe A, Tiedje JM, Brown CT
- Proc Natl Acad Sci U S A. 2012 Aug 14;109(33):13272-7
- doi: 10.1073/pnas.1121464109
- PMID: 22847406
- Digital normalization
- The normalize-by-median.py and count-median.py scripts are part of the digital normalization algorithm, described in:
- A Reference-Free Algorithm for Computational Normalization of Shotgun Sequencing Data
- Brown CT, Howe AC, Zhang Q, Pyrkosz AB, Brom TH
- arXiv:1203.4802 [q-bio.GN]
- http://arxiv.org/abs/1203.4802
- K-mer counting
- The abundance-dist.py, filter-abund.py, and load-into-counting.py scripts implement the probabilistic k-mer counting described in:
- These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure
- Zhang Q, Pell J, Canino-Koning R, Howe AC, Brown CT.
- arXiv:1309.2975 [q-bio.GN]
- http://arxiv.org/abs/1309.2975
Validation
- Validated 4/5/2018