View article

[PDF] from neurips.cc

Variance reduction in stochastic gradient Langevin dynamics

Authors

Kumar Avinava Dubey, Sashank J Reddi, Sinead A Williamson, Barnabas Poczos, Alexander J Smola, Eric P Xing

Publication date

2016

Journal

Advances in neural information processing systems

Volume

Description

Stochastic gradient-based Monte Carlo methods such as stochastic gradient Langevin dynamics are useful tools for posterior inference on large scale datasets in many machine learning applications. These methods scale to large datasets by using noisy gradients calculated using a mini-batch or subset of the dataset. However, the high variance inherent in these noisy gradients degrades performance and leads to slower mixing. In this paper, we present techniques for reducing variance in stochastic gradient Langevin dynamics, yielding novel stochastic Monte Carlo methods that improve performance by reducing the variance in the stochastic gradient. We show that our proposed method has better theoretical guarantees on convergence rate than stochastic Langevin dynamics. This is complemented by impressive empirical results obtained on a variety of real world datasets, and on four different machine learning tasks (regression, classification, independent component analysis and mixture modeling). These theoretical and empirical contributions combine to make a compelling case for using variance reduction in stochastic Monte Carlo methods.

Total citations

Cited by 101

201720182019202020212022202320243 14 15 20 19 12 12 6

Scholar articles

Variance reduction in stochastic gradient Langevin dynamics

KA Dubey, SJ Reddi, SA Williamson, B Poczos… - Advances in neural information processing systems, 2016