Authors
John A Hawkins, Stephen K Jones, Ilya J Finkelstein, William H Press
Publication date
2018/7/3
Journal
Proceedings of the National Academy of Sciences
Volume
115
Issue
27
Pages
E6217-E6226
Publisher
National Academy of Sciences
Description
Many large-scale, high-throughput experiments use DNA barcodes, short DNA sequences prepended to DNA libraries, for identification of individuals in pooled biomolecule populations. However, DNA synthesis and sequencing errors confound the correct interpretation of observed barcodes and can lead to significant data loss or spurious results. Widely used error-correcting codes borrowed from computer science (e.g., Hamming, Levenshtein codes) do not properly account for insertions and deletions (indels) in DNA barcodes, even though deletions are the most common type of synthesis error. Here, we present and experimentally validate filled/truncated right end edit (FREE) barcodes, which correct substitution, insertion, and deletion errors, even when these errors alter the barcode length. FREE barcodes are designed with experimental considerations in mind, including balanced guanine-cytosine (GC …
Total citations
201820192020202120222023202414101316106
Scholar articles
JA Hawkins, SK Jones Jr, IJ Finkelstein, WH Press - Proceedings of the National Academy of Sciences, 2018