View article

[HTML] from plos.org

QuorUM: an error corrector for Illumina reads

Authors

Guillaume Marçais, James A Yorke, Aleksey Zimin

Publication date

2015/6/17

Journal

PloS one

Volume

Issue

Pages

e0130821

Publisher

Public Library of Science

Description

Motivation

Illumina Sequencing data can provide high coverage of a genome by relatively short (most often 100 bp to 150 bp) reads at a low cost. Even with low (advertised 1%) error rate, 100 × coverage Illumina data on average has an error in some read at every base in the genome. These errors make handling the data more complicated because they result in a large number of low-count erroneous k-mers in the reads. However, there is enough information in the reads to correct most of the sequencing errors, thus making subsequent use of the data (e.g. for mapping or assembly) easier. Here we use the term “error correction” to denote the reduction in errors due to both changes in individual bases and trimming of unusable sequence. We developed an error correction software called QuorUM. QuorUM is mainly aimed at error correcting Illumina reads for subsequent assembly. It is designed around the novel idea of minimizing the number of distinct erroneous k-mers in the output reads and preserving the most true k-mers, and we introduce a composite statistic π that measures how successful we are at achieving this dual goal. We evaluate the performance of QuorUM by correcting actual Illumina reads from genomes for which a reference assembly is available.

Results

We produce trimmed and error-corrected reads that result in assemblies with longer contigs and fewer errors. We compared QuorUM against several published error correctors and found that it is the best performer in most metrics we use. QuorUM is efficiently implemented making use of current multi-core computing architectures and it is suitable for large data sets (1 billion …

Total citations

Cited by 100

201420152016201720182019202020212022202320242 3 8 23 6 15 16 4 15 5 2

Scholar articles

QuorUM: an error corrector for Illumina reads

G Marçais, JA Yorke, A Zimin - PLoS One, 2015