View article

[PDF] from mlr.press

Efficient learning of deep boltzmann machines

Authors

Ruslan Salakhutdinov, Hugo Larochelle

Publication date

2010

Journal

International Conference on Artificial Intelligence and Statistics

Description

We present a new approximate inference algorithm for Deep Boltzmann Machines (DBM’s), a generative model with many layers of hidden variables. The algorithm learns a separate “recognition” model that is used to quickly initialize, in a single bottom-up pass, the values of the latent variables in all hidden layers. We show that using such a recognition model, followed by a combined top-down and bottom-up pass, it is possible to efficiently learn a good generative model of high-dimensional highly-structured sensory input. We show that the additional computations required by incorporating a top-down feedback plays a critical role in the performance of a DBM, both as a generative and discriminative model. Moreover, inference is only at most three times slower compared to the approximate inference in a Deep Belief Network (DBN), making large-scale learning of DBM’s practical. Finally, we demonstrate that the DBM’s trained using the proposed approximate inference algorithm perform well compared to DBN’s and SVM’s on the MNIST handwritten digit, OCR English letters, and NORB visual object recognition tasks.

Total citations

Cited by 1500

2010201120122013201420152016201720182019202020212022202320245 11 15 48 49 82 95 86 189 215 185 163 140 128 76

Scholar articles

Efficient learning of deep Boltzmann machines

R Salakhutdinov, H Larochelle - Proceedings of the thirteenth international conference …, 2010

Cited by 1151 Related articles All 29 versions

Proceedings of the thirteenth international conference on artificial intelligence and statistics*

X Glorot, Y Bengio - Understanding the difficulty of training deep …, 2010

Cited by 354 Related articles