State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 1449 | 2018 |
Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 732 | 2019 |
Exploring architectures, data and units for streaming end-to-end speech recognition with rnn-transducer K Rao, H Sak, R Prabhavalkar IEEE Automatic Speech Recognition and Understanding (ASRU), 2017 | 408 | 2017 |
A Comparison of Sequence-to-Sequence Models for Speech Recognition R Prabhavalkar, K Rao, TN Sainath, B Li, L Johnson, N Jaitly Interspeech, 939-943, 2017 | 388 | 2017 |
An analysis of incorporating an external language model into a sequence-to-sequence model A Kannan, Y Wu, P Nguyen, TN Sainath, Z Chen, R Prabhavalkar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 284 | 2018 |
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition C Donahue, B Li, R Prabhavalkar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 278 | 2018 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 226 | 2016 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 225 | 2020 |
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023 | 217 | 2023 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 209 | 2019 |
Deep context: end-to-end contextual speech recognition G Pundak, TN Sainath, R Prabhavalkar, A Kannan, D Zhao 2018 IEEE spoken language technology workshop (SLT), 418-425, 2018 | 197 | 2018 |
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models R Prabhavalkar, TN Sainath, Y Wu, P Nguyen, Z Chen, CC Chiu, ... International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 190 | 2018 |
Two-pass end-to-end speech recognition TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ... arXiv preprint arXiv:1908.10992, 2019 | 166 | 2019 |
From audio to semantics: Approaches to end-to-end spoken language understanding P Haghani, A Narayanan, M Bacchiani, G Chuang, N Gaur, P Moreno, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 720-726, 2018 | 166 | 2018 |
Recognizing long-form speech using streaming end-to-end models A Narayanan, R Prabhavalkar, CC Chiu, D Rybach, TN Sainath, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 135 | 2019 |
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition R Prabhavalkar, O Alsharif, A Bruguier, L McGraw IEEE International Conference on Acoustics, Speech and Signal Processing …, 2016 | 119 | 2016 |
Automatic Gain Control and Multi-Style Training For Robust Small-Footprint Keyword Spotting With Deep Neural Networks R Prabhavalkar, R Alvarez, C Parada, P Nakkiran, TN Sainath International Conference on Acoustics, Speech and Signal Processing, 2015 | 108 | 2015 |
End-to-end speech recognition: A survey R Prabhavalkar, T Hori, TN Sainath, R Schlüter, S Watanabe IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 103 | 2023 |
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw IEEE Automatic Speech Recognition and Understanding (ASRU), 2017 | 103 | 2017 |
Compressing deep neural networks using a rank-constrained topology P Nakkiran, R Alvarez, R Prabhavalkar, C Parada INTERSPEECH, 2015 | 96 | 2015 |