Wenhai Wang (王文海)

Cited by

	All	Since 2019
Citations	20425	20394
h-index	37	37
i10-index	54	53

8000

4000

2000

6000

20192020202120222023202473 383 1415 3669 7392 7441

Public access

View all

33 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Enze XieNVIDIA, HKUVerified email at connect.hku.hk
Ping Luo (羅平)Associate Professor, The University of Hong KongVerified email at hku.hk
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Jifeng DaiAssociate Professor of EE, Tsinghua University; Adjuct Researcher of Shanghai AI LaboratoryVerified email at tsinghua.edu.cn
Zhe Chen (陈喆)PhD candidate, Nanjing UniversityVerified email at smail.nju.edu.cn
Xiang Li（李翔）Associate Professor, Nankai UniversityVerified email at nankai.edu.cn
Xizhou ZhuTsinghua UniversityVerified email at tsinghua.edu.cn
Ding Liangvast, tsinghua universityVerified email at vastai3d.com
Jian YangProf. of Computer Science, Nanjing University of Science and TechnologyVerified email at njust.edu.cn
Lewei LuResearch Director (We're Hiring, [email protected]) @ SenseTime ResearchVerified email at sensetime.com
Zhiding YuPrincipal Research Scientist & Research Lead, NVIDIA ResearchVerified email at nvidia.com
Zhiqi LiPhD candidate, Nanjing UniversityVerified email at smail.nju.edu.cn
Deng-Ping Fan (范登平)Professor, Nankai UniversityVerified email at nankai.edu.cn
Chunhua ShenZhejiang UniversityVerified email at zju.edu.cn
Kaitao SongSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Hong-Yang LiAssistant Professor, University of Hong Kong; Research Scientist, Shanghai AI LabVerified email at hku.hk
Limin WangNanjing UniversityVerified email at nju.edu.cn
Yirui WuHohai UniversityVerified email at hhu.edu.cn
Dahua LinThe Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Weiyun WangShanghai AI Laboratory; Fudan UniversityVerified email at pjlab.org.cn

Wenhai Wang (王文海)

CUHK | Shanghai AI Laboratory | NJU

Verified email at cuhk.edu.hk - Homepage

Computer Vision Foundation Models OCR


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Y Liu, Y Cao, Z Gao, W Wang, Z Chen, W Wang, H Tian, L Lu, X Zhu, T Lu, ... arXiv preprint arXiv:2407.15838, 2024		2024
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024		2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Q Li, Z Chen, W Wang, W Wang, S Ye, Z Jin, G Chen, Y He, Z Gao, E Cui, ... arXiv preprint arXiv:2406.08418, 2024		2024
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang, Z Chen, X Zhu, L Lu, T Lu, ... arXiv preprint arXiv:2406.08394, 2024	1	2024
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning C Yang, X Zhu, J Zhu, W Su, J Wang, X Dong, W Wang, L Lu, B Li, J Zhou, ... arXiv preprint arXiv:2406.07543, 2024		2024
Needle In A Multimodal Haystack W Wang, S Zhang, Y Ren, Y Duan, T Li, S Liu, M Hu, Z Chen, K Zhang, ... arXiv preprint arXiv:2406.07230, 2024		2024
LLMs Meet Multimodal Generation and Editing: A Survey Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ... arXiv preprint arXiv:2405.19334, 2024	3	2024
VLG: General Video Recognition with Web Textual Knowledge J Lin, Z Liu, W Wang, W Wu, L Wang International Journal of Computer Vision, 1-26, 2024	1	2024
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024	54	2024
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024	36	2024
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments Y Yang, W Wang, Z Chen, J Dai, L Zheng International Conference on Learning Representation (ICLR), 2024		2024
Vision-rwkv: Efficient and scalable visual perception with rwkv-like architectures Y Duan, W Wang, Z Chen, X Zhu, L Lu, T Lu, Y Qiao, H Li, J Dai, W Wang arXiv preprint arXiv:2403.02308, 2024	16	2024
The all-seeing project v2: Towards general relation comprehension of the open world W Wang, Y Ren, H Luo, T Li, C Yan, Z Chen, W Wang, Q Li, L Lu, X Zhu, ... arXiv preprint arXiv:2402.19474, 2024	12	2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis Y Mu, J Chen, Q Zhang, S Chen, Q Yu, C Ge, R Chen, Z Liang, M Hu, ... arXiv preprint arXiv:2402.16117, 2024	4	2024
Mm-interleaved: Interleaved image-text generative modeling via multi-modal feature synchronizer C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ... arXiv preprint arXiv:2401.10208, 2024	21	2024
Feature Selection Based on Intrusive Outliers Rather Than All Instances L Yuan, C Mei, W Wang, T Lu IEEE Transactions on Image Processing (TIP), 2024		2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Y Xiong, Z Li, Y Chen, F Wang, X Zhu, J Luo, W Wang, T Lu, H Li, Y Qiao, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024	15	2024
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024	124*	2024
Avsegformer: Audio-visual segmentation with transformer S Gao, Z Chen, G Chen, W Wang, T Lu AAAI Conference on Artificial Intelligence (AAAI), 2024	24	2024
The all-seeing project: Towards panoptic visual recognition and understanding of the open world W Wang, M Shi, Q Li, W Wang, Z Huang, L Xing, Z Chen, H Li, X Zhu, ... International Conference on Learning Representation (ICLR), 2024	40	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors