Phasen: A phase-and-harmonics-aware speech enhancement network D Yin, C Luo, Z Xiong, W Zeng Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9458-9465, 2020 | 316 | 2020 |
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph D Yin, X Ren, C Luo, Y Wang, Z Xiong, W Zeng International Conference on Learning Representations (ICLR), 2022 | 14 | 2022 |
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo Interspeech 2022, 2022 | 13 | 2022 |
TridentSE: Guiding speech enhancement with 32 global tokens D Yin, Z Zhao, C Tang, Z Xiong, C Luo arXiv preprint arXiv:2210.12995, 2022 | 10 | 2022 |
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng Interspeech 2021, 2021 | 8 | 2021 |
General-purpose speech representation learning through a self-supervised multi-granularity framework Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha arXiv preprint arXiv:2102.01930, 2021 | 8 | 2021 |
ART-V: Auto-Regressive Text-to-Video Generation with Diffusion Models W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 6 | 2024 |
Microcinema: A divide-and-conquer approach for text-to-video generation Y Wang, J Bao, W Weng, R Feng, D Yin, T Yang, J Zhang, Q Dai, Z Zhao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 4 | 2024 |
Learning trajectories are generalization indicators J Fu, Z Zhang, D Yin, Y Lu, N Zheng Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
Decomposing style, content, and motion for videos Y Hu, D Yin, Y Wang, Z Chen, C Luo Journal of Visual Communication and Image Representation 89, 103686, 2022 | | 2022 |