Authors
Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, Matthew E. Taylor
Publication date
2014/1/27
Conference
International conference on machine learning
Pages
1206-1214
Description
Policy gradient algorithms have shown considerable recent success in solving high-dimensional sequential decision making tasks, particularly in robotics. However, these methods often require extensive experience in a domain to achieve high performance. To make agents more sample-efficient, we developed a multi-task policy gradient method to learn decision making tasks consecutively, transferring knowledge between tasks to accelerate learning. Our approach provides robust theoretical guarantees, and we show empirically that it dramatically accelerates learning on a variety of dynamical systems, including an application to quadrotor control.
Total citations
201420152016201720182019202020212022202320242713292519242220216
Scholar articles
HB Ammar, E Eaton, P Ruvolo, M Taylor - International conference on machine learning, 2014