View article

[PDF] from arxiv.org

Reinforcement learning with unsupervised auxiliary tasks

Authors

Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z Leibo, David Silver, Koray Kavukcuoglu

Publication date

2017

Journal

International Conference on Learning Representations (ICLR)

Description

Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that also maximises many other pseudo-reward functions simultaneously by reinforcement learning. All of these tasks share a common representation that, like unsupervised learning, continues to develop in the absence of extrinsic rewards. We also introduce a novel mechanism for focusing this representation upon extrinsic rewards, so that learning can rapidly adapt to the most relevant aspects of the actual task. Our agent significantly outperforms the previous state-of-the-art on Atari, averaging 880\% expert human performance, and a challenging suite of first-person, three-dimensional \emph{Labyrinth} tasks leading to a mean speedup in learning of 10 and averaging 87\% expert human performance on Labyrinth.

Total citations

Cited by 1409

2016201720182019202020212022202320247 98 178 194 213 250 196 185 83

Scholar articles

Reinforcement learning with unsupervised auxiliary tasks

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv:1611.05397, 2016

Cited by 1395 Related articles All 7 versions

Reinforcement learning with unsupervised auxiliary tasks. arXiv 2016*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv:1611.05397

Cited by 13 Related articles

Reinforcement learning with unsupervised auxiliary tasks. CoRR abs/1611.05397 (2016)*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv:1611.05397, 2016

Cited by 6 Related articles

Reinforcement learning with unsupervised auxiliary tasks (2016)*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul, J Leibo…

Cited by 4 Related articles

Leibo Joel Z, Silver David, and Kavukcuoglu Koray*

J Max, M Volodymyr, CW Marian, S Tom - Reinforcement learning with unsupervised auxiliary …, 2017

Cited by 3 Related articles

Reinforcement learning with unsupervised auxiliary tasks. arXiv [Preprint](2016)*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv:1611.05397, 2016

Cited by 3 Related articles

Reinforcement Learning with Unsupervised Auxiliary Tasks, CoRR abs/1611.05397*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - arXiv preprint arXiv:1611.05397, 2016

Cited by 2 Related articles

Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv: 161105397*

M Jaderberg, V Mnih, WM Czarnecki, T Schaul… - 2016

Cited by 2 Related articles

Reinforcement learning with unsupervised auxiliary tasks (pp. 1–11)*

M Jaderberg, V Mnih, W Czarnecki, T Schaul… - 2016

Cited by 2 Related articles