Authors
Bradley B Doll, Dylan A Simon, Nathaniel D Daw
Publication date
2012/12/1
Source
Current opinion in neurobiology
Volume
22
Issue
6
Pages
1075-1081
Publisher
Elsevier Current Trends
Description
The reward prediction error (RPE) theory of dopamine (DA) function has enjoyed great success in the neuroscience of learning and decision-making. This theory is derived from model-free reinforcement learning (RL), in which choices are made simply on the basis of previously realized rewards. Recently, attention has turned to correlates of more flexible, albeit computationally complex, model-based methods in the brain. These methods are distinguished from model-free learning by their evaluation of candidate actions using expected future outcomes according to a world model. Puzzlingly, signatures from these computations seem to be pervasive in the very same regions previously thought to support model-free learning. Here, we review recent behavioral and neural evidence about these two systems, in attempt to reconcile their enigmatic cohabitation in the brain.
Scholar articles
BB Doll, DA Simon, ND Daw - Current opinion in neurobiology, 2012