View article

[PDF] from arxiv.org

Population-Contrastive-Divergence: Does consistency help with RBM training?

Authors

Oswin Krause, Asja Fischer, Christian Igel

Publication date

2018/1/15

Journal

Pattern Recognition Letters

Volume

102

Pages

1-7

Publisher

North-Holland

Description

Estimating the log-likelihood gradient with respect to the parameters of a Restricted Boltzmann Machine (RBM) typically requires sampling using Markov Chain Monte Carlo (MCMC) techniques. To save computation time, the Markov chains are only run for a small number of steps, which leads to a biased estimate. This bias can cause RBM training algorithms such as Contrastive Divergence (CD) learning to deteriorate. We adopt the idea behind Population Monte Carlo (PMC) methods to devise a new RBM training algorithm termed Population-Contrastive-Divergence (pop-CD). Compared to CD, it leads to a consistent estimate and may have a significantly lower bias. Its computational overhead is negligible compared to CD, but the variance of the gradient estimate increases. We experimentally show that pop-CD can significantly outperform CD. In many cases, we observed a smaller bias and achieved higher log …

Total citations

Cited by 17

201720182019202020212022202320241 4 1 3 4 3 1

Scholar articles

Population-Contrastive-Divergence: Does consistency help with RBM training?

O Krause, A Fischer, C Igel - Pattern Recognition Letters, 2018