View article

[PDF] from researchgate.net

GA3C: GPU-based A3C for deep reinforcement learning

Authors

Mohammad Babaeizadeh, Iuri Frosio, Stephen Tyree, Jason Clemons, Jan Kautz

Publication date

2016/11

Journal

CoRR abs/1611.06256

Description

We introduce and analyze the computational aspects of a hybrid CPU/GPU implementation of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. Our analysis concentrates on the critical aspects to leverage the GPU’s computational power, including the introduction of a system of queues and a dynamic scheduling strategy, potentially helpful for other asynchronous algorithms as well. We also show the potential for the use of larger DNN models on a GPU. Our Tensor-Flow implementation achieves a significant speed up compared to our CPU-only implementation, and it will be made publicly available to other researchers.

Total citations

Cited by 89

201720182019202020212022202320243 11 14 18 16 10 10 7

Scholar articles

GA3C: GPU-based A3C for deep reinforcement learning

M Babaeizadeh, I Frosio, S Tyree, J Clemons, J Kautz - CoRR abs/1611.06256, 2016