View article

[PDF] from mlr.press

Online multi-task learning for policy gradient methods

Authors

Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, Matthew E. Taylor

Publication date

2014/1/27

Conference

International conference on machine learning

Pages

1206-1214

Description

Policy gradient algorithms have shown considerable recent success in solving high-dimensional sequential decision making tasks, particularly in robotics. However, these methods often require extensive experience in a domain to achieve high performance. To make agents more sample-efficient, we developed a multi-task policy gradient method to learn decision making tasks consecutively, transferring knowledge between tasks to accelerate learning. Our approach provides robust theoretical guarantees, and we show empirically that it dramatically accelerates learning on a variety of dynamical systems, including an application to quadrotor control.

Total citations

Cited by 188

201420152016201720182019202020212022202320242 7 13 29 25 19 24 22 20 21 6

Scholar articles

Online multi-task learning for policy gradient methods

HB Ammar, E Eaton, P Ruvolo, M Taylor - International conference on machine learning, 2014