Data-efficient hierarchical reinforcement learning LEE Honglak, S Gu, S Levine US Patent 11,992,944, 2024 | 1 | 2024 |
Deep reinforcement learning for robotic manipulation S Levine, E Holly, S Gu, T Lillicrap US Patent App. 18/526,443, 2024 | | 2024 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 227 | 2024 |
Dreamsparse: Escaping from plato’s cave with 2d diffusion model given sparse views P Yoo, J Guo, Y Matsuo, SS Gu Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
For sale: State-action representation learning for deep reinforcement learning S Fujimoto, WD Chang, E Smith, SS Gu, D Precup, D Meger Advances in Neural Information Processing Systems 36, 2024 | 25 | 2024 |
Deep reinforcement learning for robotic manipulation S Levine, E Holly, S Gu, T Lillicrap US Patent 11,897,133, 2024 | 2 | 2024 |
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024 | 2266 | 2024 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 1159 | 2023 |
Domain prompt learning for efficiently adapting clip to unseen domains X Zhang, SS Gu, Y Matsuo, Y Iwasawa Transactions of the Japanese Society for Artificial Intelligence 38 (6), B …, 2023 | 17 | 2023 |
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views P Yoo, J Guo, Y Matsuo, SS Gu arXiv preprint arXiv:2306.03414, 2023 | 12 | 2023 |
Multimodal web navigation with instruction-finetuned foundation models H Furuta, KH Lee, O Nachum, Y Matsuo, A Faust, SS Gu, I Gur arXiv preprint arXiv:2305.11854, 2023 | 50 | 2023 |
Learning a universal human prior for dexterous manipulation from human preference Z Ding, Y Chen, AZ Ren, SS Gu, Q Wang, H Dong, C Jin arXiv preprint arXiv:2304.04602, 2023 | 8 | 2023 |
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning S Kataoka, Y Chung, SKS Ghasemipour, P Sanketi, SS Gu, I Mordatch arXiv preprint arXiv:2303.14870, 2023 | 2 | 2023 |
Collective intelligence for 2d push manipulations with mobile robots S Kuroki, T Matsushima, J Arima, H Furuta, Y Matsuo, SS Gu, Y Tang IEEE Robotics and Automation Letters 8 (5), 2820-2827, 2023 | 4 | 2023 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 2836 | 2023 |
Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023 | 146 | 2023 |
Instruction-finetuned foundation models for multimodal web navigation H Furuta, O Nachum, KH Lee, Y Matsuo, SS Gu, I Gur Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 2023 | 5 | 2023 |
Large language models are zero-shot reasoners T Kojima, SS Gu, M Reid, Y Matsuo, Y Iwasawa Advances in neural information processing systems 35, 22199-22213, 2022 | 2552 | 2022 |
Why so pessimistic? estimating uncertainties for offline rl through ensembles, and why their independence matters K Ghasemipour, SS Gu, O Nachum Advances in Neural Information Processing Systems 35, 18267-18281, 2022 | 48 | 2022 |
A system for morphology-task generalization via unified representation and behavior distillation H Furuta, Y Iwasawa, Y Matsuo, SS Gu arXiv preprint arXiv:2211.14296, 2022 | 6 | 2022 |