Jiahui Yu

Cited by

	All	Since 2019
Citations	21754	21433
h-index	42	42
i10-index	53	53

7000

3500

1750

5250

2018201920202021202220232024234 818 1571 2840 3947 6357 5852

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yonghui WuGoogle BrainVerified email at google.com
Thomas S. HuangUniversity of Illinois, Urbana-ChampaignVerified email at ifp.uiuc.edu
Wei HanOpenAIVerified email at illinois.edu
Chung-Cheng ChiuAppleVerified email at apple.com
Jimei YangSenior Research Scientist, Adobe ResearchVerified email at adobe.com
Xin LuTikTok/BytedanceVerified email at bytedance.com
Ruoming Pang (庞若鸣)Apple AI/MLVerified email at apple.com
Zhe L. LinSenior Principal Scientist, Adobe ResearchVerified email at adobe.com
Anmol GulatiResearcher, Google DeepmindVerified email at google.com
James QinGoogleVerified email at google.com
Zirui WangResearch Scientist, Apple AI/MLVerified email at apple.com
Zhengdong ZhangResearch Scientist, Google BrainVerified email at csail.mit.edu
Yuchen FanUniversity of IllinoisVerified email at illinois.edu
Yu ZhangOpenAIVerified email at csail.mit.edu
Tara SainathPrincipal Research Scientist, GoogleVerified email at google.com
Vijay VasudevanGoogle, Inc.Verified email at google.com
Xiaohui ShenByteDance ResearchVerified email at bytedance.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
Ding LiuMetaVerified email at meta.com
Jason BaldridgeResearch Scientist, GoogleVerified email at google.com

Jiahui Yu

Research Scientist, OpenAI

Verified email at openai.com - Homepage

Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vector-Quantized Image Modeling YU Jiahui, X Li, H Zhang, V Vasudevan, AYS Ku, JM Baldridge, Y Xu, ... US Patent App. 18/520,083, 2024		2024
Module-wise adaptive distillation for multimodality foundation models C Liang, J Yu, MH Yang, M Brown, Y Cui, T Zhao, B Gong, T Zhou Advances in Neural Information Processing Systems 36, 2024	2	2024
Parrot: Pareto-optimal multi-reward reinforcement learning framework for text-to-image generation SH Lee, Y Li, J Ke, I Yoo, H Zhang, J Yu, Q Wang, F Deng, G Entis, J He, ... arXiv preprint arXiv:2401.05675, 2024	5	2024
De-diffusion makes text a strong cross-modal interface C Wei, C Liu, S Qiao, Z Zhang, A Yuille, J Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	2	2024
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1140	2023
Pyramid attention network for image restoration Y Mei, Y Fan, Y Zhang, J Yu, Y Zhou, D Liu, Y Fu, TS Huang, H Shi International Journal of Computer Vision 131 (12), 3207-3225, 2023	173	2023
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers C Yang, S Qiao, Y Cao, Y Zhang, T Zhu, A Yuille, J Yu arXiv preprint arXiv:2311.17072, 2023		2023
Contrastive captioning neural networks YU Jiahui, Z Wang, V Vasudevan, HM Yeung, SMS Tarzjani, Y Wu US Patent App. 18/141,340, 2023		2023
Combined scaling for zero-shot transfer learning H Pham, Z Dai, G Ghiasi, K Kawaguchi, H Liu, AW Yu, J Yu, YT Chen, ... Neurocomputing 555, 126658, 2023	151	2023
Systems and Methods for Pretraining Image Processing Models Z Wang, YU Jiahui, Y Cao, W Yu, Z Dai US Patent App. 17/685,774, 2023	1	2023
Systems and Methods for Training Dual-Mode Machine-Learned Speech Recognition Models YU Jiahui, R Pang, W Han, A Gulati, CC Chiu, B Li, TN Sainath, Y Hu US Patent App. 18/011,571, 2023		2023
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023	104	2023
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1098	2023
Optimizing Inference Performance for Conformer TN Sainath, R Botros, A Gulati, K Choromanski, R Pang, T Strohman, ... US Patent App. 17/936,547, 2023	1	2023
Predicting Word Boundaries for On-Device Batching of End-To-End Speech Recognition Models SJP Bijwadia, TN Sainath, YU Jiahui, S Chang, Y He US Patent App. 17/934,184, 2023		2023
Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR R Botros, A Gulati, TN Sainath, K Choromanski, R Pang, T Strohman, ... arXiv preprint arXiv:2304.00171, 2023	2	2023
Cobit: A contrastive bi-directional image-text generation model H You, M Guo, Z Wang, KW Chang, J Baldridge, J Yu arXiv preprint arXiv:2303.13455, 2023	14	2023
Noise2music: Text-conditioned music generation with diffusion models Q Huang, DS Park, T Wang, TI Denk, A Ly, N Chen, Z Zhang, Z Zhang, ... arXiv preprint arXiv:2302.03917, 2023	123	2023
Gemini: A family of highly capable multimodal models R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805 1, 2023	111	2023
Vila: Learning image aesthetics from user comments with vision-language pretraining J Ke, K Ye, J Yu, Y Wu, P Milanfar, F Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	38	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors