Follow
Shiyu Huang
Shiyu Huang
Other names黄 世宇
Researcher at Zhipu AI; Tsinghua University
Verified email at zhipuai.cn - Homepage
Title
Cited by
Cited by
Year
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
S Huang, D Ramanan
622017
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
B Yuchen Lin, Y Fu, K Yang, F Brahman, S Huang, C Bhagavatula, ...
arXiv e-prints, arXiv: 2305.17390, 2023
38*2023
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
S Huang, W Chen, L Zhang, Z Li, F Zhu, D Ye, T Chen, J Zhu
arXiv preprint arXiv:2110.04507, 2021
242021
Deep reinforcement learning with credit assignment for combinatorial optimization
D Yan, J Weng, S Huang, C Li, Y Zhou, H Su, J Zhu
Pattern Recognition 124, 108466, 2022
232022
Combo-action: Training agent for fps game with auxiliary tasks
S Huang, H Su, J Zhu, T Chen
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 954-961, 2019
212019
Svqn: Sequential variational soft q-learning networks
S Huang, H Su, J Zhu, T Chen
International Conference on Learning Representations, 2019
142019
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning
Y Lin, Q Zhang, B Gao, J Tang, P Yao, C Li, S Huang, Z Liu, Y Zhou, Y Liu, ...
Nature Machine Intelligence 5 (7), 714-723, 2023
132023
Robustness and generalizability of deepfake detection: A study with diffusion models
H Song, S Huang, Y Dong, WW Tu
arXiv preprint arXiv:2309.02218, 2023
102023
Tizero: Mastering multi-agent football with curriculum learning and self-play
F Lin, S Huang, T Pearce, W Chen, WW Tu
arXiv preprint arXiv:2302.07515, 2023
102023
Learning graph-enhanced commander-executor for multi-agent navigation
X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu, H Yang, Y Wang
arXiv preprint arXiv:2302.04094, 2023
62023
DGPO: discovering multiple strategies with diversity-guided policy optimization
W Chen, S Huang, Y Chiang, T Pearce, WW Tu, T Chen, J Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11390 …, 2024
52024
Llmarena: Assessing capabilities of large language models in dynamic multi-agent environments
J Chen, X Hu, S Liu, S Huang, WW Tu, Z He, L Wen
arXiv preprint arXiv:2402.16499, 2024
52024
Cogvideox: Text-to-video diffusion models with an expert transformer
Z Yang, J Teng, W Zheng, M Ding, S Huang, J Xu, Y Yang, W Hong, ...
arXiv preprint arXiv:2408.06072, 2024
42024
Vmapd: Generate diverse solutions for multi-agent games with recurrent trajectory discriminators
S Huang, C Yu, B Wang, D Li, Y Wang, T Chen, J Zhu
2022 IEEE Conference on Games (CoG), 9-16, 2022
22022
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization
S Huang, B Wang, D Li, J Hao, T Chen, J Zhu
arXiv preprint arXiv:2110.03939, 2021
22021
Learning to assign credit in reinforcement learning by incorporating abstract relations
D Yan, S Huang, H Su, J Zhu
AAAI Workshop on Reinforcement Learning in Games, 2019
22019
CogVLM2: Visual Language Models for Image and Video Understanding
W Hong, W Wang, M Ding, W Yu, Q Lv, Y Wang, Y Cheng, S Huang, J Ji, ...
arXiv preprint arXiv:2408.16500, 2024
12024
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
W Chen, S Huang, J Schneider
arXiv preprint arXiv:2406.13930, 2024
12024
LVBench: An Extreme Long Video Understanding Benchmark
W Wang, Z He, W Hong, Y Cheng, X Zhang, J Qi, S Huang, B Xu, Y Dong, ...
arXiv preprint arXiv:2406.08035, 2024
12024
OpenRL: A Unified Reinforcement Learning Framework
S Huang, W Chen, Y Sun, F Bie, WW Tu
arXiv preprint arXiv:2312.16189, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20