Follow
Shixiang Shane Gu
Shixiang Shane Gu
Other namesShane Gu, Shixiang Gu
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Year
Data-efficient hierarchical reinforcement learning
LEE Honglak, S Gu, S Levine
US Patent 11,992,944, 2024
12024
Deep reinforcement learning for robotic manipulation
S Levine, E Holly, S Gu, T Lillicrap
US Patent App. 18/526,443, 2024
2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
2272024
Dreamsparse: Escaping from plato’s cave with 2d diffusion model given sparse views
P Yoo, J Guo, Y Matsuo, SS Gu
Advances in Neural Information Processing Systems 36, 2024
42024
For sale: State-action representation learning for deep reinforcement learning
S Fujimoto, WD Chang, E Smith, SS Gu, D Precup, D Meger
Advances in Neural Information Processing Systems 36, 2024
252024
Deep reinforcement learning for robotic manipulation
S Levine, E Holly, S Gu, T Lillicrap
US Patent 11,897,133, 2024
22024
Scaling instruction-finetuned language models
HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ...
Journal of Machine Learning Research 25 (70), 1-53, 2024
22662024
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
11592023
Domain prompt learning for efficiently adapting clip to unseen domains
X Zhang, SS Gu, Y Matsuo, Y Iwasawa
Transactions of the Japanese Society for Artificial Intelligence 38 (6), B …, 2023
172023
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views
P Yoo, J Guo, Y Matsuo, SS Gu
arXiv preprint arXiv:2306.03414, 2023
122023
Multimodal web navigation with instruction-finetuned foundation models
H Furuta, KH Lee, O Nachum, Y Matsuo, A Faust, SS Gu, I Gur
arXiv preprint arXiv:2305.11854, 2023
502023
Learning a universal human prior for dexterous manipulation from human preference
Z Ding, Y Chen, AZ Ren, SS Gu, Q Wang, H Dong, C Jin
arXiv preprint arXiv:2304.04602, 2023
82023
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning
S Kataoka, Y Chung, SKS Ghasemipour, P Sanketi, SS Gu, I Mordatch
arXiv preprint arXiv:2303.14870, 2023
22023
Collective intelligence for 2d push manipulations with mobile robots
S Kuroki, T Matsushima, J Arima, H Furuta, Y Matsuo, SS Gu, Y Tang
IEEE Robotics and Automation Letters 8 (5), 2820-2827, 2023
42023
Gpt-4 technical report
J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv preprint arXiv:2303.08774, 2023
28362023
Aligning text-to-image models using human feedback
K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ...
arXiv preprint arXiv:2302.12192, 2023
1462023
Instruction-finetuned foundation models for multimodal web navigation
H Furuta, O Nachum, KH Lee, Y Matsuo, SS Gu, I Gur
Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 2023
52023
Large language models are zero-shot reasoners
T Kojima, SS Gu, M Reid, Y Matsuo, Y Iwasawa
Advances in neural information processing systems 35, 22199-22213, 2022
25522022
Why so pessimistic? estimating uncertainties for offline rl through ensembles, and why their independence matters
K Ghasemipour, SS Gu, O Nachum
Advances in Neural Information Processing Systems 35, 18267-18281, 2022
482022
A system for morphology-task generalization via unified representation and behavior distillation
H Furuta, Y Iwasawa, Y Matsuo, SS Gu
arXiv preprint arXiv:2211.14296, 2022
62022
The system can't perform the operation now. Try again later.
Articles 1–20