Yuhang Zang

Cited by

	All	Since 2019
Citations	2052	2052
h-index	15	15
i10-index	17	17

960

480

240

720

20192020202120222023202420 76 194 340 474 946

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Dahua LinThe Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Pan ZhangShanghai AI LaboratoryVerified email at mail.ustc.edu.cn
Jiaqi WangShanghai AI LaboratoryVerified email at pjlab.org.cn
Xiaoyi DongShanghai AI LaboratoryVerified email at mail.ustc.edu.cn
Chen Change LoyMMLab@NTU, S-Lab, Nanyang Technological UniversityVerified email at ntu.edu.sg
Wei Li (李威)Nanyang Technological University, SingaporeVerified email at ntu.edu.sg
Chen HuangResearch Scientist, Apple IncVerified email at apple.com
Enze XieNVIDIA, HKUVerified email at connect.hku.hk
Kaiyang ZhouAssistant Professor, Hong Kong Baptist UniversityVerified email at hkbu.edu.hk
Gang YUStepFunVerified email at stepfun.com
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Haodong Duan 段浩东Shanghai AI LaboratoryVerified email at pjlab.org.cn
Ziwei LiuAssistant Professor, Nanyang Technological UniversityVerified email at ntu.edu.sg
Wenwei ZhangShanghai AI LaboratoryVerified email at ntu.edu.sg
Yuhang CaoMMLab The Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Kai ChenShanghai AI LaboratoryVerified email at pjlab.org.cn
Wenhai Wang (王文海)CUHK | Shanghai AI Laboratory | NJUVerified email at cuhk.edu.hk
Xilin WeiPhd, Fudan UniversityVerified email at m.fudan.edu.cn
Songyang ZhangShanghai AI LaboratoryVerified email at pjlab.org.cn
Yining LiShanghai AI LabVerified email at pjlab.org.cn

Yuhang Zang

Shanghai AI Laboratory

Verified email at pjlab.org.cn - Homepage

Vision Language Model Large Language Model Computer Vision Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network W Wang, E Xie, X Song, Y Zang, W Wang, T Lu, G Yu, C Shen IEEE International Conference on Computer Vision (ICCV), 2019	556	2019
Seesaw Loss for Long-Tailed Instance Segmentation J Wang, W Zhang, Y Zang, Y Cao, J Pang, T Gong, K Chen, Z Liu, CC Loy, ... IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021	271	2021
Scene Text Detection with Supervised Pyramid Context Network E Xie, Y Zang, S Shao, G Yu, C Yao, G Li AAAI Conference on Artificial Intelligence (AAAI), 2019	253	2019
Open-Vocabulary DETR with Conditional Matching Y Zang, W Li, K Zhou, C Huang, CC Loy European Conference on Computer Vision (ECCV), 2022	162	2022
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models H Duan, J Yang, Y Qiao, X Fang, L Chen, Y Liu, X Dong, Y Zang, P Zhang, ... ACM Multimedia (ACM MM) Open Source Software Competition, 2024	128*	2024
Unified Vision and Language Prompt Learning Y Zang, W Li, K Zhou, C Huang, CC Loy arXiv preprint arXiv:2210.07225, 2022	124	2022
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation Y Zang, C Huang, CC Loy IEEE International Conference on Computer Vision (ICCV), 2021	113	2021
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	100	2024
InternLM2 Technical Report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	76	2024
Are We on the Right Way for Evaluating Large Vision-Language Models? L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ... arXiv preprint arXiv:2403.20330, 2024	45	2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024	42	2024
Contextual Object Detection with Multimodal Large Language Models Y Zang, W Li, J Han, K Zhou, CC Loy International Journal of Computer Vision (IJCV), 2023	42	2023
Long-CLIP: Unlocking the Long-Text Capability of CLIP B Zhang, P Zhang, X Dong, Y Zang, J Wang European Conference on Computer Vision (ECCV), 2024	28	2024
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want Z Sun, Y Fang, T Wu, P Zhang, Y Zang, S Kong, Y Xiong, D Lin, J Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024	26	2024
1st Place Solutions for OpenImage2019--Object Detection and Instance Segmentation Y Liu, G Song, Y Zang, Y Gao, E Xie, J Yan, CC Loy, X Wang arXiv preprint arXiv:2003.07557, 2020	21	2020
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch Y Zang, K Zhou, C Huang, CC Loy International Journal of Computer Vision (IJCV), 2023	12	2023
KPNet: Towards Minimal Face Detector G Song, Y Liu, Y Zang, X Wang, B Leng, Q Yuan AAAI Conference on Artificial Intelligence (AAAI), 2020	10	2020
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions L Chen, X Wei, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, B Lin, ... arXiv preprint arXiv:2406.04325, 2024	9	2024
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024	7	2024
On-Device Domain Generalization K Zhou, Y Zhang, Y Zang, J Yang, CC Loy, Z Liu arXiv preprint arXiv:2209.07521, 2022	6	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors