Mohan Li

Cited by

	All	Since 2019
Citations	122	121
h-index	6	6
i10-index	3	3

20182019202020212022202320241 6 12 26 27 31 19

Co-authors

Rama Sanand DoddipatlaToshiba Europe Ltd.Verified email at toshiba.eu
Tudor Catalin ZorilaResearch Engineer at Toshiba Cambridge Research Laboratory, Cambridge UKVerified email at toshiba.eu
Cong-Thanh DOToshiba Research Europe LimitedVerified email at crl.toshiba.co.uk
Shucong ZhangSamsung AI CambridgeVerified email at samsung.com
Yuanjiang CaoUnversity of New South WalesVerified email at unsw.edu.au
Tobias Cord-LandwehrPaderborn UniversityVerified email at mail.upb.de
Christoph BoeddekerPaderborn UniversityVerified email at mail.upb.de
Reinhold Haeb-UmbachProfessor of Communications Engineering, University of PaderbornVerified email at nt.uni-paderborn.de

Mohan Li

Toshiba Europe Ltd

Verified email at toshiba.eu

automatic speech recognition spoken language understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
End-to-end speech recognition with adaptive computation steps M Li, M Liu, H Masanori ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	38	2019
Transformer-based online speech recognition with decoder-end adaptive computation steps M Li, C Zorilă, R Doddipatla 2021 IEEE spoken language technology workshop (SLT), 1-7, 2021	22	2021
Head-synchronous decoding for transformer-based streaming asr M Li, C Zorilă, R Doddipatla ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	21	2021
Non-autoregressive end-to-end approaches for joint automatic speech recognition and spoken language understanding M Li, R Doddipatla 2022 IEEE Spoken Language Technology Workshop (SLT), 390-397, 2023	8	2023
Transformer-based streaming ASR with cumulative attention M Li, S Zhang, C Zorilă, R Doddipatla ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	7	2022
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results. M Li, Y Cao, W Zhou, M Liu Interspeech, 1641-1645, 2019	6	2019
An investigation into the multi-channel time domain speaker extraction network C Zorilă, M Li, R Doddipatla 2021 IEEE Spoken Language Technology Workshop (SLT), 793-800, 2021	4	2021
Multiple-hypothesis RNN-T Loss for unsupervised fine-tuning and self-training of neural transducer CT Do, M Li, R Doddipatla arXiv preprint arXiv:2207.14736, 2022	3	2022
Toshiba’s speech recognition system for the CHiME 2020 challenge C Zorila, M Li, D Hayakawa, M Liu, N Ding, R Doddipatla Proc. of The 6th Intl. Workshop on Speech Processing in Everyday …, 2020	3	2020
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding M Li, S Keizer, R Doddipatla arXiv preprint arXiv:2406.15209, 2024	2	2024
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs M Li, C Zorilă, CT Do, R Doddipatla 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	2	2023
Improving HS-DACS based streaming Transformer ASR with deep reinforcement learning M Li, R Doddipatla 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	2	2021
DiaLoc: An Iterative Approach to Embodied Dialog Localization C Zhang, M Li, I Budvytis, S Liwicki Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	1	2024
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring M Li, CT Do, R Doddipatla ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	1	2023
Towards a speaker diarization system for the CHiME 2020 dinner party transcription C Boeddeker, T Cord-Landwehr, J Heitkaemper, C Zorila, D Hayakawa, ... Proc. 6th International Workshop on Speech Processing in Everyday …, 2020	1	2020
Domain Adaptive Self-supervised Training of Automatic Speech Recognition CT Do, R Doddipatla, M Li, T Hain	1
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding M Li, CT Do, S Keizer, Y Farag, S Stoyanchev, R Doddipatla arXiv preprint arXiv:2408.16423, 2024		2024
Speech recognition systems and methods LI Mohan, T Zorila, RS Doddipatla US Patent 12,002,450, 2024		2024
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition M Li, R Doddipatla, C Zorila Proc. Interspeech 2022, 2088-2092, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors