View article

[HTML] from nih.gov

The ubiquity of model-based reinforcement learning

Authors

Bradley B Doll, Dylan A Simon, Nathaniel D Daw

Publication date

2012/12/1

Source

Current opinion in neurobiology

Volume

Issue

Pages

1075-1081

Publisher

Elsevier Current Trends

Description

The reward prediction error (RPE) theory of dopamine (DA) function has enjoyed great success in the neuroscience of learning and decision-making. This theory is derived from model-free reinforcement learning (RL), in which choices are made simply on the basis of previously realized rewards. Recently, attention has turned to correlates of more flexible, albeit computationally complex, model-based methods in the brain. These methods are distinguished from model-free learning by their evaluation of candidate actions using expected future outcomes according to a world model. Puzzlingly, signatures from these computations seem to be pervasive in the very same regions previously thought to support model-free learning. Here, we review recent behavioral and neural evidence about these two systems, in attempt to reconcile their enigmatic cohabitation in the brain.

Total citations

Cited by 468

20132014201520162017201820192020202120222023202414 23 42 49 31 54 35 54 44 58 37 23

Scholar articles

The ubiquity of model-based reinforcement learning

BB Doll, DA Simon, ND Daw - Current opinion in neurobiology, 2012