Shiyu Huang

Cited by

	All	Since 2019
Citations	245	229
h-index	9	9
i10-index	9	9

201720182019202020212022202320241 15 23 13 17 20 65 90

Public access

View all

5 articles

0 articles*

available

not available

Based on funding mandates

Co-authors

Jun ZhuProfessor of Computer Science, Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Ting ChenProfessor of Computer Science, Tsinghua UniversityVerified email at tsinghua.edu.cn
Wei-Wei TuNanjing University, ChaLearnVerified email at lamda.nju.edu.cn
Hang SuAssociated Professor, Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Wentse ChenCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Chongxuan LiAssociate Professor (tenure track), Renmin University of ChinaVerified email at ruc.edu.cn
Yejin ChoiUniversity of Washington / Allen Institute for Artificial IntelligenceVerified email at cs.washington.edu
Chandra BhagavatulaAllen Institute for Artificial IntelligenceVerified email at allenai.org
Prithviraj AmmanabroluAssistant Professor, University of California, San DiegoVerified email at ucsd.edu
Sean (Xiang) RenViterbi Early Career Chair & Associate Professor, University of Southern CaliforniaVerified email at usc.edu
Bill Yuchen LinAllen Institute for AI (AI2)Verified email at allenai.org
Yicheng FuStanford UniversityVerified email at stanford.edu
Faeze BrahmanPostdoctoral Researcher at AI2Verified email at ucsc.edu
Deva RamananProfessor, Robotics Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Deheng YeDirector of AI Applications, TencentVerified email at e.ntu.edu.sg
Dong YanBaichuan Inc. Head of Reinforcement Learning Team.Verified email at baichuan-inc.com
Jiayi WengOpenAIVerified email at openai.com
Chao Yu（于超）Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Tim PearceUniversity of LeicesterVerified email at le.ac.uk
Bin WangHuawei Noah's Ark LabVerified email at huawei.com

Shiyu Huang

Other names黄世宇

Researcher at Zhipu AI; Tsinghua University

Verified email at zhipuai.cn - Homepage

Deep RL Multi-agent RL CV AIGC LLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters S Huang, D Ramanan	62	2017
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks B Yuchen Lin, Y Fu, K Yang, F Brahman, S Huang, C Bhagavatula, ... arXiv e-prints, arXiv: 2305.17390, 2023	38*	2023
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations S Huang, W Chen, L Zhang, Z Li, F Zhu, D Ye, T Chen, J Zhu arXiv preprint arXiv:2110.04507, 2021	24	2021
Deep reinforcement learning with credit assignment for combinatorial optimization D Yan, J Weng, S Huang, C Li, Y Zhou, H Su, J Zhu Pattern Recognition 124, 108466, 2022	23	2022
Combo-action: Training agent for fps game with auxiliary tasks S Huang, H Su, J Zhu, T Chen Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 954-961, 2019	21	2019
Svqn: Sequential variational soft q-learning networks S Huang, H Su, J Zhu, T Chen International Conference on Learning Representations, 2019	14	2019
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning Y Lin, Q Zhang, B Gao, J Tang, P Yao, C Li, S Huang, Z Liu, Y Zhou, Y Liu, ... Nature Machine Intelligence 5 (7), 714-723, 2023	13	2023
Robustness and generalizability of deepfake detection: A study with diffusion models H Song, S Huang, Y Dong, WW Tu arXiv preprint arXiv:2309.02218, 2023	10	2023
Tizero: Mastering multi-agent football with curriculum learning and self-play F Lin, S Huang, T Pearce, W Chen, WW Tu arXiv preprint arXiv:2302.07515, 2023	10	2023
Learning graph-enhanced commander-executor for multi-agent navigation X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu, H Yang, Y Wang arXiv preprint arXiv:2302.04094, 2023	6	2023
DGPO: discovering multiple strategies with diversity-guided policy optimization W Chen, S Huang, Y Chiang, T Pearce, WW Tu, T Chen, J Zhu Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11390 …, 2024	5	2024
Llmarena: Assessing capabilities of large language models in dynamic multi-agent environments J Chen, X Hu, S Liu, S Huang, WW Tu, Z He, L Wen arXiv preprint arXiv:2402.16499, 2024	5	2024
Cogvideox: Text-to-video diffusion models with an expert transformer Z Yang, J Teng, W Zheng, M Ding, S Huang, J Xu, Y Yang, W Hong, ... arXiv preprint arXiv:2408.06072, 2024	4	2024
Vmapd: Generate diverse solutions for multi-agent games with recurrent trajectory discriminators S Huang, C Yu, B Wang, D Li, Y Wang, T Chen, J Zhu 2022 IEEE Conference on Games (CoG), 9-16, 2022	2	2022
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization S Huang, B Wang, D Li, J Hao, T Chen, J Zhu arXiv preprint arXiv:2110.03939, 2021	2	2021
Learning to assign credit in reinforcement learning by incorporating abstract relations D Yan, S Huang, H Su, J Zhu AAAI Workshop on Reinforcement Learning in Games, 2019	2	2019
CogVLM2: Visual Language Models for Image and Video Understanding W Hong, W Wang, M Ding, W Yu, Q Lv, Y Wang, Y Cheng, S Huang, J Ji, ... arXiv preprint arXiv:2408.16500, 2024	1	2024
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization W Chen, S Huang, J Schneider arXiv preprint arXiv:2406.13930, 2024	1	2024
LVBench: An Extreme Long Video Understanding Benchmark W Wang, Z He, W Hong, Y Cheng, X Zhang, J Qi, S Huang, B Xu, Y Dong, ... arXiv preprint arXiv:2406.08035, 2024	1	2024
OpenRL: A Unified Reinforcement Learning Framework S Huang, W Chen, Y Sun, F Bie, WW Tu arXiv preprint arXiv:2312.16189, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors