Authors
Steven Lewis, Attila Csordas, Sarah Killcoyne, Henning Hermjakob, Michael R Hoopmann, Robert L Moritz, Eric W Deutsch, John Boyle
Publication date
2012/12
Journal
BMC bioinformatics
Volume
13
Pages
1-6
Publisher
BioMed Central
Description
Background
For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed.
Results
We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of …
Total citations
2013201420152016201720182019202020212022202327111211943251
Scholar articles