View article

[PDF] from mlr.press

Distributed stochastic gradient MCMC

Authors

Sungjin Ahn, Babak Shahbaba, Max Welling

Publication date

2014/6/18

Conference

International conference on machine learning

Pages

1044-1052

Publisher

PMLR

Description

Probabilistic inference on a big data scale is becoming increasingly relevant to both the machine learning and statistics communities. Here we introduce the first fully distributed MCMC algorithm based on stochastic gradients. We argue that stochastic gradient MCMC algorithms are particularly suited for distributed inference because individual chains can draw minibatches from their local pool of data for a flexible amount of time before jumping to or syncing with other chains. This greatly reduces communication overhead and allows adaptive load balancing. Our experiments for LDA on Wikipedia and Pubmed show that relative to the state of the art in distributed MCMC we reduce compute time from 27 hours to half an hour in order to reach the same perplexity level.

Total citations

Cited by 126

201420152016201720182019202020212022202320246 20 16 11 7 8 12 18 15 8 5

Scholar articles

Distributed stochastic gradient MCMC

S Ahn, B Shahbaba, M Welling - International conference on machine learning, 2014