View article

Authors

Shifu Chen, Yanqing Zhou, Yaru Chen, Tanxiao Huang, Wenting Liao, Yun Xu, Zhicheng Li, Jia Gu

Publication date

2019/12/27

Journal

Bmc Bioinformatics

Volume

Issue

Suppl 23

Pages

606

Publisher

BioMed Central

Description

Background

Removing duplicates might be considered as a well-resolved problem in next-generation sequencing (NGS) data processing domain. However, as NGS technology gains more recognition in clinical application, researchers start to pay more attention to its sequencing errors, and prefer to remove these errors while performing deduplication operations. Recently, a new technology called unique molecular identifier (UMI) has been developed to better identify sequencing reads derived from different DNA fragments. Most existing duplicate removing tools cannot handle the UMI-integrated data. Some modern tools can work with UMIs, but are usually slow and use too much memory. Furthermore, existing tools rarely report rich statistical results, which are very important for quality control and downstream analysis. These unmet requirements drove us to develop an ultra-fast, simple, little-weighted but …

Total citations

Cited by 58

2019202020212022202320241 6 9 14 16 10

Scholar articles

Gencore: an efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data

S Chen, Y Zhou, Y Chen, T Huang, W Liao, Y Xu, Z Li… - Bmc Bioinformatics, 2019