Shuo-yiin Chang

Cited by

	All	Since 2019
Citations	3066	2953
h-index	18	17
i10-index	24	21

760

380

190

570

2014201520162017201820192020202120222023202411 22 16 29 24 109 345 569 591 743 590

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Shuo-yiin Chang

Senior Staff Research Scientist, Google

Verified email at google.com

deep learning speech recognition natural language processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep learning for audio signal processing H Purwins, B Li, T Virtanen, J Schlüter, SY Chang, T Sainath IEEE Journal of Selected Topics in Signal Processing 13 (2), 206-219, 2019	727	2019
Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	688	2019
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	442	2023
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	217	2020
Towards fast and accurate streaming end-to-end ASR B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	124	2020
A better and faster end-to-end model for streaming asr B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	115	2021
Robust CNN-based speech recognition with Gabor filter kernels. SY Chang, N Morgan Interspeech, 905-909, 2014	98	2014
Fastemit: Low-latency streaming asr with sequence-level emission regularization J Yu, CC Chiu, B Li, S Chang, TN Sainath, Y He, A Narayanan, W Han, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	89	2021
Temporal modeling using dilated convolution and gating for voice-activity-detection SY Chang, B Li, G Simko, TN Sainath, A Tripathi, A van den Oord, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018	78	2018
Personal VAD: Speaker-conditioned voice activity detection S Ding, Q Wang, S Chang, L Wan, IL Moreno arXiv preprint arXiv:1908.04284, 2019	77	2019
Improved End-of-Query Detection for Streaming Speech Recognition. M Shannon, G Simko, SY Chang, C Parada Interspeech, 1909-1913, 2017	47	2017
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. TN Sainath, Y He, A Narayanan, R Botros, R Pang, D Rybach, C Allauzen, ... Interspeech 8, 1777-1781, 2021	40	2021
Joint endpointing and decoding with end-to-end models SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	40	2019
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. SY Chang, B Li, TN Sainath, G Simko, C Parada Interspeech, 3812-3816, 2017	34	2017
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. SY Chang, B Li, TN Sainath, G Simko, C Parada Interspeech, 3812-3816, 2017	34	2017
Improving the latency and quality of cascaded encoders TN Sainath, Y He, A Narayanan, R Botros, W Wang, D Qiu, CC Chiu, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	25	2022
Spectro-temporal features for noise-robust speech recognition using power-law nonlinearity and power-bias subtraction SY Chang, BT Meyer, N Morgan 2013 IEEE international conference on acoustics, speech and signal …, 2013	21	2013
Streaming end-to-end multilingual speech recognition with joint language identification C Zhang, B Li, T Sainath, T Strohman, S Mavandadi, SY Chang, ... arXiv preprint arXiv:2209.06058, 2022	20	2022
The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions SHK Parthasarathi, SY Chang, J Cohen, N Morgan, S Wegmann 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	20	2013
E2e segmenter: Joint segmenting and decoding for long-form asr WR Huang, S Chang, D Rybach, R Prabhavalkar, TN Sainath, C Allauzen, ... arXiv preprint arXiv:2204.10749, 2022	18	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by