Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-world Multi-task Agents Z Wang, S Cai, G Chen, A Liu, X Ma, Y Liang NeurIPS 2023, 2023 | 262* | 2023 |
Rethinking Graph Neural Architecture Search from Message-passing S Cai, L Li, J Deng, B Zhang, ZJ Zha, L Su, Q Huang CVPR 2021, 2021 | 66* | 2021 |
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models Z Wang, S Cai, A Liu, Y Jin, J Hou, B Zhang, H Lin, Z He, Z Zheng, Y Yang, ... Workshop on Agent Learning in Open-Endedness (ALOE) at NeurIPS 2023, 2023 | 41 | 2023 |
Open-world Multi-task Control Through Goal-aware Representation Learning and Adaptive Horizon Prediction S Cai, Z Wang, X Ma, A Liu, Y Liang CVPR 2023, 2023 | 27 | 2023 |
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning Z Liu, J Deng, L Li, S Cai, Q Xu, S Wang, Q Huang ACM MM 2020, Oral Presentation, 2020 | 19 | 2020 |
GROOT: Learning to Follow Instructions by Watching Gameplay Videos S Cai, B Zhang, Z Wang, X Ma, A Liu, Y Liang ICLR 2024, Spotlight Presentation, 2023 | 16 | 2023 |
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings B Li, S Cai, W Liu, P Zhang, Q He, M Hua, Z Yi WACV 2023, 2023 | 11 | 2023 |
Automatic Relation-aware Graph Network Proliferation S Cai, L Li, X Han, J Luo, ZJ Zha, Q Huang CVPR 2022, Oral Presentation, 2022 | 11 | 2022 |
Edge-featured Graph Neural Architecture Search S Cai, L Li, X Han, Z Zha, Q Huang arXiv preprint arXiv:2109.01356, 2021 | 7 | 2021 |
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition S Cai, L Li, X Han, S Huang, Q Tian, Q Huang TNNLS 2023, 2023 | 5 | 2023 |
Inductive State-Relabeling Adversarial Active Learning with Heuristic Clique Rescaling B Zhang, L Li, S Wang, S Cai, ZJ Zha, Q Tian, Q Huang TPAMI 2024, 2024 | | 2024 |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Z Wang, S Cai, Z Mu, H Lin, C Zhang, X Liu, Q Li, A Liu, X Ma, Y Liang arXiv preprint arXiv:2407.00114, 2024 | | 2024 |
GROOT-1.5: Learning to Follow Multi-Modal Instructions from Weak Supervision S Cai, B Zhang, Z Wang, X Ma, A Liu, Y Liang Multi-modal Foundation Model meets Embodied AI Workshop@ ICML2024, 0 | | |