Follow
Mohan Li
Mohan Li
Toshiba Europe Ltd
Verified email at toshiba.eu
Title
Cited by
Cited by
Year
End-to-end speech recognition with adaptive computation steps
M Li, M Liu, H Masanori
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
382019
Transformer-based online speech recognition with decoder-end adaptive computation steps
M Li, C Zorilă, R Doddipatla
2021 IEEE spoken language technology workshop (SLT), 1-7, 2021
222021
Head-synchronous decoding for transformer-based streaming asr
M Li, C Zorilă, R Doddipatla
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Non-autoregressive end-to-end approaches for joint automatic speech recognition and spoken language understanding
M Li, R Doddipatla
2022 IEEE Spoken Language Technology Workshop (SLT), 390-397, 2023
82023
Transformer-based streaming ASR with cumulative attention
M Li, S Zhang, C Zorilă, R Doddipatla
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
72022
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results.
M Li, Y Cao, W Zhou, M Liu
Interspeech, 1641-1645, 2019
62019
An investigation into the multi-channel time domain speaker extraction network
C Zorilă, M Li, R Doddipatla
2021 IEEE Spoken Language Technology Workshop (SLT), 793-800, 2021
42021
Multiple-hypothesis RNN-T Loss for unsupervised fine-tuning and self-training of neural transducer
CT Do, M Li, R Doddipatla
arXiv preprint arXiv:2207.14736, 2022
32022
Toshiba’s speech recognition system for the CHiME 2020 challenge
C Zorila, M Li, D Hayakawa, M Liu, N Ding, R Doddipatla
Proc. of The 6th Intl. Workshop on Speech Processing in Everyday …, 2020
32020
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding
M Li, S Keizer, R Doddipatla
arXiv preprint arXiv:2406.15209, 2024
22024
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs
M Li, C Zorilă, CT Do, R Doddipatla
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
22023
Improving HS-DACS based streaming Transformer ASR with deep reinforcement learning
M Li, R Doddipatla
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
22021
DiaLoc: An Iterative Approach to Embodied Dialog Localization
C Zhang, M Li, I Budvytis, S Liwicki
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring
M Li, CT Do, R Doddipatla
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Towards a speaker diarization system for the CHiME 2020 dinner party transcription
C Boeddeker, T Cord-Landwehr, J Heitkaemper, C Zorila, D Hayakawa, ...
Proc. 6th International Workshop on Speech Processing in Everyday …, 2020
12020
Domain Adaptive Self-supervised Training of Automatic Speech Recognition
CT Do, R Doddipatla, M Li, T Hain
1
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
M Li, CT Do, S Keizer, Y Farag, S Stoyanchev, R Doddipatla
arXiv preprint arXiv:2408.16423, 2024
2024
Speech recognition systems and methods
LI Mohan, T Zorila, RS Doddipatla
US Patent 12,002,450, 2024
2024
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
M Li, R Doddipatla, C Zorila
Proc. Interspeech 2022, 2088-2092, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–19