Articles with public access mandates - Shixiang Shane GuLearn more
Available somewhere: 6
Sequence tutor: Conservative fine-tuning of sequence generation models with kl-control
N Jaques, S Gu, D Bahdanau, JM Hernández-Lobato, RE Turner, D Eck
International Conference on Machine Learning, 1645-1654, 2017
Mandates: Natural Sciences and Engineering Research Council of Canada
Interpolated policy gradient: Merging on-policy and off-policy gradient estimation for deep reinforcement learning
SS Gu, T Lillicrap, RE Turner, Z Ghahramani, B Schölkopf, S Levine
Advances in neural information processing systems 30, 2017
Mandates: Natural Sciences and Engineering Research Council of Canada
Neural adaptive sequential monte carlo
SS Gu, Z Ghahramani, RE Turner
Advances in neural information processing systems 28, 2015
Mandates: UK Engineering and Physical Sciences Research Council
The mirage of action-dependent baselines in reinforcement learning
G Tucker, S Bhupatiraju, S Gu, R Turner, Z Ghahramani, S Levine
International conference on machine learning, 5015-5024, 2018
Mandates: UK Engineering and Physical Sciences Research Council
Tuning recurrent neural networks with reinforcement learning
N Jaques, S Gu, RE Turner, D Eck
Mandates: Natural Sciences and Engineering Research Council of Canada
Weakly-supervised reinforcement learning for controllable behavior
L Lee, B Eysenbach, RR Salakhutdinov, SS Gu, C Finn
Advances in Neural Information Processing Systems 33, 2661-2673, 2020
Mandates: US National Science Foundation, US Department of Defense
Publication and funding information is determined automatically by a computer program