Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 584 | 2019 |
Deep learning for audio signal processing H Purwins, B Li, T Virtanen, J Schlüter, SY Chang, T Sainath IEEE Journal of Selected Topics in Signal Processing 13 (2), 206-219, 2019 | 536 | 2019 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 186 | 2020 |
Towards fast and accurate streaming end-to-end ASR B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 110 | 2020 |
A better and faster end-to-end model for streaming asr B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 87 | 2021 |
Robust CNN-based speech recognition with Gabor filter kernels SY Chang, N Morgan Fifteenth annual conference of the international speech communication …, 2014 | 87 | 2014 |
Temporal modeling using dilated convolution and gating for voice-activity-detection SY Chang, B Li, G Simko, TN Sainath, A Tripathi, A van den Oord, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 70 | 2018 |
Fastemit: Low-latency streaming asr with sequence-level emission regularization J Yu, CC Chiu, B Li, S Chang, TN Sainath, Y He, A Narayanan, W Han, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 69 | 2021 |
Personal VAD: Speaker-conditioned voice activity detection S Ding, Q Wang, S Chang, L Wan, IL Moreno arXiv preprint arXiv:1908.04284, 2019 | 59 | 2019 |
Improved End-of-Query Detection for Streaming Speech Recognition. M Shannon, G Simko, SY Chang, C Parada Interspeech, 1909-1913, 2017 | 41 | 2017 |
Joint endpointing and decoding with end-to-end models SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 37 | 2019 |
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. SY Chang, B Li, TN Sainath, G Simko, C Parada Interspeech, 3812-3816, 2017 | 33 | 2017 |
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. SY Chang, B Li, TN Sainath, G Simko, C Parada Interspeech, 3812-3816, 2017 | 33 | 2017 |
An efficient streaming non-recurrent on-device end-to-end model with improvements to rare-word modeling TN Sainath, YR He, A Narayanan, R Botros, R Pang, DJ Rybach, ... | 31 | 2021 |
The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions SHK Parthasarathi, SY Chang, J Cohen, N Morgan, S Wegmann 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 20 | 2013 |
Spectro-temporal features for noise-robust speech recognition using power-law nonlinearity and power-bias subtraction SY Chang, BT Meyer, N Morgan 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 20 | 2013 |
Improving the latency and quality of cascaded encoders TN Sainath, Y He, A Narayanan, R Botros, W Wang, D Qiu, CC Chiu, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 18 | 2022 |
On the importance of modeling and robustness for deep neural network feature SY Chang, S Wegmann 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 15 | 2015 |
Low Latency Speech Recognition Using End-to-End Prefetching. SY Chang, B Li, D Rybach, Y He, W Li, TN Sainath, T Strohman Interspeech, 1962-1966, 2020 | 12 | 2020 |
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification C Zhang, B Li, T Sainath, T Strohman, S Mavandadi, S Chang, P Haghani arXiv preprint arXiv:2209.06058, 2022 | 10 | 2022 |