Public access

Articles with public access mandates - Shixiang Shane GuLearn more

Available somewhere: 6

Sequence tutor: Conservative fine-tuning of sequence generation models with kl-control

N Jaques, S Gu, D Bahdanau, JM Hernández-Lobato, RE Turner, D Eck

International Conference on Machine Learning, 1645-1654, 2017

Mandates: Natural Sciences and Engineering Research Council of Canada

Interpolated policy gradient: Merging on-policy and off-policy gradient estimation for deep reinforcement learning

SS Gu, T Lillicrap, RE Turner, Z Ghahramani, B Schölkopf, S Levine

Advances in neural information processing systems 30, 2017

Mandates: Natural Sciences and Engineering Research Council of Canada

Neural adaptive sequential monte carlo

SS Gu, Z Ghahramani, RE Turner

Advances in neural information processing systems 28, 2015

Mandates: UK Engineering and Physical Sciences Research Council

The mirage of action-dependent baselines in reinforcement learning

G Tucker, S Bhupatiraju, S Gu, R Turner, Z Ghahramani, S Levine

International conference on machine learning, 5015-5024, 2018

Mandates: UK Engineering and Physical Sciences Research Council

Tuning recurrent neural networks with reinforcement learning

N Jaques, S Gu, RE Turner, D Eck

Mandates: Natural Sciences and Engineering Research Council of Canada

Weakly-supervised reinforcement learning for controllable behavior

L Lee, B Eysenbach, RR Salakhutdinov, SS Gu, C Finn

Advances in Neural Information Processing Systems 33, 2661-2673, 2020

Mandates: US National Science Foundation, US Department of Defense

Publication and funding information is determined automatically by a computer program