Follow
Yu Wu (吴俣)
Yu Wu (吴俣)
Microsoft Research Asia
Verified email at microsoft.com - Homepage
Title
Cited by
Year
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
Z Wang, D Chen, D Dai, R Xu, Z Li, Y Wu
arXiv preprint arXiv:2407.01906, 2024
2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ...
arXiv preprint arXiv:2406.11931, 2024
62024
Speechlm: Enhanced speech pre-training with unpaired textual data
Z Zhang, S Chen, L Zhou, Y Wu, S Ren, S Liu, Z Yao, X Gong, L Dai, J Li, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
362024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Z Shao, P Wang, Q Zhu, R Xu, J Song, M Zhang, YK Li, Y Wu, D Guo
arXiv preprint arXiv:2402.03300, 2024
432024
DeepSeek-Coder: When the Large Language Model Meets Programming--The Rise of Code Intelligence
D Guo, Q Zhu, D Yang, Z Xie, K Dong, W Zhang, G Chen, X Bi, Y Wu, ...
arXiv preprint arXiv:2401.14196, 2024
1582024
Advanced Long-Content Speech Recognition with Factorized Neural Transducer
X Gong, Y Wu, J Li, S Liu, R Zhao, X Chen, Y Qian
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
32024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
472024
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
992024
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
43*2024
UNIFIED SPEECH REPRESENTATION LEARNING
Y Qian, Y WU, K Kumatani, S Liu, F Wei, N Zeng, XD Huang, C Wang
US Patent App. 18/217,888, 2023
2023
WavMark: Watermarking for Audio Generation
G Chen, Y Wu, S Liu, T Liu, X Du, F Wei
arXiv preprint arXiv:2308.12770, 2023
122023
Unified speech representation learning
Y Qian, Y Wu, K Kumatani, S Liu, F Wei, N Zeng, XD Huang, C Wang
US Patent 11,735,171, 2023
2023
On decoder-only architecture for speech-to-text and large language model integration
J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ...
Proc. ASRU 2023, 2023
482023
Accelerating Transducers through Adjacent Token Merging
Y Li, Y Wu, J Li, S Liu
Proc. ASRU 2023, 2023
2023
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Y Li, Y Wu, J Li, S Liu
Proc. ASRU 2023, 2023
282023
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
T Wang, L Zhou, Z Zhang, Y Wu, S Liu, Y Gaur, Z Chen, J Li, F Wei
arXiv preprint arXiv:2305.16107, 2023
612023
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
922023
Exploring WavLM on Speech Enhancement
H Song, S Chen, Z Chen, Y Wu, T Yoshioka, M Tang, JW Shin, S Liu
2022 IEEE Spoken Language Technology Workshop (SLT), 451-457, 2023
162023
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
3902023
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers
P Wang, E Sun, J Xue, Y Wu, L Zhou, Y Gaur, S Liu, J Li
Proc. Interspeech, 57-61, 2023
13*2023
The system can't perform the operation now. Try again later.
Articles 1–20