Espnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018 | 1141 | 2018 |
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 582 | 2019 |
WaveGrad: Estimating gradients for waveform generation N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan International Conference on Learning Representations, 2021 | 334 | 2021 |
Deep feature for text-dependent speaker verification Y Liu, Y Qian, N Chen, T Fu, Y Zhang, K Yu Speech Communication 73, 1-13, 2015 | 194 | 2015 |
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 132 | 2020 |
ASSERT: Anti-spoofing with squeeze-excitation and residual networks CI Lai, N Chen, J Villalba, N Dehak arXiv preprint arXiv:1904.01120, 2019 | 121 | 2019 |
Multi-task learning for text-dependent speaker verification N Chen, Y Qian, K Yu Proc. 16th Annual Conference of the International Speech Communication …, 2015 | 113 | 2015 |
Non-autoregressive transformer for speech recognition N Chen, S Watanabe, J Villalba, P Żelasko, N Dehak IEEE Signal Processing Letters 28, 121-125, 2020 | 106 | 2020 |
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and speakers in the wild evaluations J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ... Computer Speech & Language 60, 101026, 2020 | 101 | 2020 |
Mask CTC: Non-autoregressive end-to-end ASR with CTC and mask predict Y Higuchi, S Watanabe, N Chen, T Ogawa, T Kobayashi arXiv preprint arXiv:2005.08700, 2020 | 99 | 2020 |
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ... Interspeech, 1488-1492, 2019 | 99 | 2019 |
Overview of BTAS 2016 speaker anti-spoofing competition P Korshunov, S Marcel, H Muckenhirn, AR Gonçalves, AGS Mello, ... 2016 IEEE 8th international conference on biometrics theory, applications …, 2016 | 90 | 2016 |
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition R Pappagari, T Wang, J Villalba, N Chen, N Dehak ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 88 | 2020 |
Robust deep feature for spoofing detection—The SJTU system for ASVspoof 2015 challenge N Chen, Y Qian, H Dinkel, B Chen, K Yu Sixteenth Annual Conference of the International Speech Communication …, 2015 | 87 | 2015 |
Age estimation in short speech utterances based on LSTM recurrent neural networks R Zazo, PS Nidadavolu, N Chen, J Gonzalez-Rodriguez, N Dehak IEEE Access 6, 22524-22530, 2018 | 75 | 2018 |
End-to-end spoofing detection with raw waveform CLDNNS H Dinkel, N Chen, Y Qian, K Yu 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 70 | 2017 |
Deep features for automatic spoofing detection Y Qian, N Chen, K Yu Speech Communication 85, 43-52, 2016 | 68 | 2016 |
End-to-end Deep Neural Network Age Estimation. P Ghahremani, PS Nidadavolu, N Chen, J Villalba, D Povey, ... Interspeech, 277-281, 2018 | 55 | 2018 |
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. D Snyder, J Villalba, N Chen, D Povey, G Sell, N Dehak, S Khudanpur INTERSPEECH, 2468-2472, 2019 | 36 | 2019 |
Deep feature engineering for noise robust spoofing detection Y Qian, N Chen, H Dinkel, Z Wu IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (10 …, 2017 | 33 | 2017 |