Подписаться
Liumeng Xue
Liumeng Xue
The Chinese University of Hong Kong, Shenzhen; Northwestern Polytechnical University
Подтвержден адрес электронной почты в домене cuhk.edu.cn - Главная страница
Название
Процитировано
Процитировано
Год
Controllable emotion transfer for end-to-end speech synthesis
T Li, S Yang, L Xue, L Xie
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
852021
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis
X Zhu, Y Zhang, S Yang, L Xue, L Xie
IEEE Access 7, 65955-65964, 2019
382019
Building a mixed-lingual neural TTS system with only monolingual data
L Xue, W Song, G Xu, L Xie, Z Wu
arXiv preprint arXiv:1904.06063, 2019
362019
On the localness modeling for the self-attention based end-to-end speech synthesis
S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu
Neural networks 125, 121-130, 2020
332020
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts
L Xue, FK Soong, S Zhang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022
232022
Cycle consistent network for end-to-end style transfer TTS training
L Xue, S Pan, L He, L Xie, FK Soong
Neural Networks 140, 223-236, 2021
222021
Building a controllable expressive speech synthesis system with multiple emotion strengths
X Zhu, L Xue
Cognitive Systems Research 59, 151-159, 2020
202020
Chatmusician: Understanding and generating music intrinsically with llm
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
arXiv preprint arXiv:2402.16153, 2024
192024
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Amphion: An open-source audio, music and speech generation toolkit
X Zhang, L Xue, Y Wang, Y Gu, X Chen, Z Fang, H Chen, L Zou, C Wang, ...
arXiv preprint arXiv:2312.09911, 2023
82023
A comparison of expressive speech synthesis approaches based on neural network
L Xue, X Zhu, X An, L Xie
Proceedings of the Joint Workshop of the 4th Workshop on Affective Social …, 2018
72018
WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
L Ma, D Guo, K Song, Y Jiang, S Wang, L Xue, W Xu, H Zhao, B Zhang, ...
arXiv preprint arXiv:2406.05763, 2024
52024
Multi-scale sub-band constant-q transform discriminator for high-fidelity vocoder
Y Gu, X Zhang, L Xue, Z Wu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
52024
Leveraging content-based features from multiple acoustic models for singing voice conversion
X Zhang, Y Gu, H Chen, Z Fang, L Zou, L Xue, Z Wu
arXiv preprint arXiv:2310.11160, 2023
52023
Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation
H Li, L Xue, H Guo, X Zhu, Y Lv, L Xie, Y Chen, H Yin, Z Li
arXiv preprint arXiv:2406.07422, 2024
32024
An initial investigation of neural replay simulator for over-the-air adversarial perturbations to automatic speaker verification
J Li, L Wang, L Xue, L Wang, Z Wu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
A Kullback-Leibler divergence based recurrent mixture density network for acoustic modeling in emotional statistical parametric speech synthesis
X An, Y Zhang, B Liu, L Xue, L Xie
Proceedings of the Joint Workshop of the 4th Workshop on Affective Social …, 2018
32018
HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS
D Guo, X Zhu, L Xue, T Li, Y Lv, Y Jiang, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
22023
Multi-level temporal-channel speaker retrieval for robust zero-shot voice conversion
Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang
arXiv preprint arXiv:2305.07204, 2023
22023
Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion
Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
12024
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20