Raw data from sequencing machines is not usually completely prepared for analysis since sequences often contain remnants of nucleotides that have been used during DNA library preparation. Such nucleotides can potentially case hurdles during data analysis and hereby need to be removed. In additional, the noise, which is presented as low-quality bases, has serious impact on genome assembly and mapping. We propose SeqyClean, a specialized cleaning pipeline that alleviates these issues.
- Validated 4/5/2018