View article

[PDF] from ucl.ac.uk

On the sample complexity of reinforcement learning

Authors

Sham Machandranath Kakade

Publication date

2003

Source

PQDT-Global

Institution

University of London, University College London (United Kingdom)

Description

This thesis is a detailed investigation into the following question: how much data must an agent collect in order to perform" reinforcement learning" successfully? This question is analogous to the classical issue of the sample complexity in supervised learning, but is harder because of the increased realism of the reinforcement learning setting. This thesis summarizes recent sample complexity results in the reinforcement learning literature and builds on these results to provide novel algorithms with strong performance guarantees. We focus on a variety of reasonable performance criteria and sampling models by which agents may access the environment. For instance, in a policy search setting, we consider the problem of how much simulated experience is required to reliably choose a" good" policy among a restricted class of policies II (as in Kearns, Mansour, and Ng [2000]). In a more online setting, we consider the …

Total citations

Cited by 828

2004200520062007200820092010201120122013201420152016201720182019202020212022202320245 8 11 15 24 24 18 14 31 27 22 26 19 32 41 58 88 115 94 89 58

Scholar articles

On the sample complexity of reinforcement learning

SM Kakade - 2003