Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 736 | 2019 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 226 | 2020 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 209 | 2019 |
Two-pass end-to-end speech recognition TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ... arXiv preprint arXiv:1908.10992, 2019 | 167 | 2019 |
Towards fast and accurate streaming end-to-end ASR B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 135 | 2020 |
A better and faster end-to-end model for streaming asr B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 127 | 2021 |
Streaming small-footprint keyword spotting using sequence-to-sequence models Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 105 | 2017 |
VoiceFilter-Lite: Streaming targeted voice separation for on-device speech recognition Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ... arXiv preprint arXiv:2009.04323, 2020 | 101 | 2020 |
Fastemit: Low-latency streaming asr with sequence-level emission regularization J Yu, CC Chiu, B Li, S Chang, TN Sainath, Y He, A Narayanan, W Han, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 95 | 2021 |
Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition K Han, Y He, D Bagchi, E Fosler-Lussier, DL Wang INTERSPEECH 2015, 2015 | 77 | 2015 |
Tied & reduced rnn-t decoder R Botros, TN Sainath, R David, E Guzman, W Li, Y He arXiv preprint arXiv:2109.07513, 2021 | 58 | 2021 |
Confidence estimation for attention-based sequence-to-sequence models for speech recognition Q Li, D Qiu, Y Zhang, B Li, Y He, PC Woodland, L Cao, T Strohman ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 49 | 2021 |
Conditional random fields in speech, audio, and language processing E Fosler-Lussier, Y He, P Jyothi, R Prabhavalkar Proceedings of the IEEE 101 (5), 1054-1075, 2013 | 49 | 2013 |
Large-scale asr domain adaptation using self-and semi-supervised learning D Hwang, A Misra, Z Huo, N Siddhartha, S Garg, D Qiu, KC Sim, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 48 | 2022 |
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition D Bagchi, MI Mandel, Z Wang, Y He, A Plummer, E Fosler-Lussier 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 47 | 2015 |
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. TN Sainath, Y He, A Narayanan, R Botros, R Pang, D Rybach, C Allauzen, ... Interspeech 8, 1777-1781, 2021 | 45 | 2021 |
Joint endpointing and decoding with end-to-end models SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 45 | 2019 |
Less is more: Improved rnn-t decoding using limited label context and path merging R Prabhavalkar, Y He, D Rybach, S Campbell, A Narayanan, T Strohman, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 36 | 2021 |
Subword-based modeling for handling OOV words in keyword spotting Y He, B Hutchinson, P Baumann, M Ostendorf, E Fosler-Lussier, ... Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International …, 2014 | 35 | 2014 |
Learning word-level confidence for subword end-to-end ASR D Qiu, Q Li, Y He, Y Zhang, B Li, L Cao, R Prabhavalkar, D Bhatia, W Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 31 | 2021 |