Authors
Magnus Ekeberg, Tuomo Hartonen, Erik Aurell
Publication date
2014/11/1
Journal
Journal of Computational Physics
Volume
276
Pages
341-356
Publisher
Academic Press
Description
Direct-coupling analysis is a group of methods to harvest information about coevolving residues in a protein family by learning a generative model in an exponential family from data. In protein families of realistic size, this learning can only be done approximately, and there is a trade-off between inference precision and computational speed. We here show that an earlier introduced l 2-regularized pseudolikelihood maximization method called plmDCA can be modified as to be easily parallelizable, as well as inherently faster on a single processor, at negligible difference in accuracy. We test the new incarnation of the method on 143 protein family/structure-pairs from the Protein Families database (PFAM), one of the larger tests of this class of algorithms to date.
Total citations
201320142015201620172018201920202021202220232024122201924281726231410