View article

[PDF] from neurips.cc

Unifying Count-Based Exploration and Intrinsic Motivation

Authors

Marc G Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Remi Munos

Publication date

2016/6/6

Conference

Neural Information Processing Systems (NeurIPS)

Description

We consider an agent's uncertainty about its environment and the problem of generalizing this uncertainty across states. Specifically, we focus on the problem of exploration in non-tabular reinforcement learning. Drawing inspiration from the intrinsic motivation literature, we use density models to measure uncertainty, and propose a novel algorithm for deriving a pseudo-count from an arbitrary density model. This technique enables us to generalize count-based exploration algorithms to the non-tabular case. We apply our ideas to Atari 2600 games, providing sensible pseudo-counts from raw pixels. We transform these pseudo-counts into exploration bonuses and obtain significantly improved exploration in a number of hard games, including the infamously difficult Montezuma's Revenge.

Total citations

Cited by 1692

20162017201820192020202120222023202417 94 157 192 217 277 295 291 147

Scholar articles

Unifying count-based exploration and intrinsic motivation

M Bellemare, S Srinivasan, G Ostrovski, T Schaul… - Advances in neural information processing systems, 2016

Cited by 1673 Related articles All 9 versions

Unifying Count-Based Exploration and Intrinsic Motivation, 2016*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - arxiv. org–открытый архив научных статей. URL …

Cited by 7 Related articles

Unifying count-based exploration and intrinsic motivation. arXiv 2016*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - arXiv preprint arXiv:1606.01868

Cited by 7 Related articles

Advances in Neural Information Processing Systems*

M Bellemare, S Srinivasan, G Ostrovski, T Schaul… - 2016

Cited by 5 Related articles

Advances in neural information processing systems 29 (NIPS 2016)*

M Bellemare, S Srinivasan, G Ostrovski, T Schaul… - 2016

Cited by 5 Related articles

Unifying count-based exploration and intrinsic motivation. CoRR abs/1606.01868 (2016)*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - arXiv preprint arXiv:1606.01868, 2016

Cited by 4 Related articles

Google Deepmind, and Rémi Munos*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - Unifying Count-Based Exploration and Intrinsic …, 2016

Cited by 3 Related articles

Unifying count-based exploration and intrinsic motivation. CoRR, abs/1606.01868*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - arXiv preprint arXiv:1606.01868, 2016

Cited by 3 Related articles

Unifying Count-Based Exploration and Intrinsic Motivation. arXiv e-prints, art*

MG Bellemare, S Srinivasan, G Ostrovski, T Schaul… - arXiv preprint arXiv:1606.01868, 2016

Cited by 2 Related articles