Zhenyao Zhu
Zhenyao Zhu
Verified email at google.com
TitleCited byYear
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
D Amodei, R Anubhai, E Battenberg, C Case, J Casper, B Catanzaro, ...
International Conference on Machine Learning (ICML), 2015
12372015
Deep learning identity-preserving face space
Z Zhu, P Luo, X Wang, X Tang
2013 IEEE International Conference on Computer Vision (ICCV), 113-120, 2013
2682013
Multi-view perceptron: a deep model for learning face identity and view representations
Z Zhu, P Luo, X Wang, X Tang
Advances in Neural Information Processing Systems (NIPS), 217-225, 2014
1702014
Deep Speaker: an End-to-End Neural Speaker Embedding System
C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu
arXiv preprint arXiv:1705.02304, 2017
1432017
DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
W Ouyang, P Luo, X Zeng, S Qiu, Y Tian, H Li, S Yang, Z Wang, Y Xiong, ...
arXiv preprint arXiv:1409.3505, 2014
1072014
Recover canonical-view faces in the wild with deep neural networks
Z Zhu, P Luo, X Wang, X Tang
arXiv preprint arXiv:1404.3543, 2014
872014
Face Model Compression by Distilling Knowledge from Neurons.
P Luo, Z Zhu, Z Liu, X Wang, X Tang
The AAAI Conference on Artificial Intelligence (AAAI) 2016, 3560-3566, 2015
722015
Exploring Neural Transducers for End-to-End Speech Recognition
E Battenberg, J Chen, R Child, A Coates, Y Gaur, Y Li, H Liu, S Satheesh, ...
Automatic Speech Recognition and Understanding (ASRU) 2017, 2017
632017
Methods and systems for verifying face images based on canonical images
X Tang, ZHU Zhenyao, P Luo, X Wang
US Patent 10,037,457, 2018
46*2018
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
H Liu, Z Zhu, X Li, S Satheesh
International Conference on Machine Learning (ICML), 2017, 2017
402017
Fully supervised speaker diarization
A Zhang, Q Wang, Z Zhu, J Paisley, C Wang
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
272019
Deep learning multi-view representation for face recognition
Z Zhu, P Luo, X Wang, X Tang
arXiv preprint arXiv:1406.6947, 2014
272014
Learning Multiscale Features Directly From Waveforms
Z Zhu, JH Engel, A Hannun
International Speech Communication Association (Interspeech) 2016, 2016
242016
Deployed end-to-end speech recognition
B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ...
US Patent App. 15/358,083, 2017
152017
Methods and Systems for Verifying Face Images Based on Canonical Images
X Tang, Z Zhu, P Luo, X Wang
US Patent 20,170,083,754, 2017
72017
Reducing Bias in Production Speech Models
E Battenberg, R Child, A Coates, C Fougner, Y Gaur, J Huang, H Jun, ...
arXiv preprint arXiv:1705.04400, 2017
42017
End-to-end speech recognition
B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ...
US Patent 10,332,509, 2019
32019
Principled Hybrids of Generative and Discriminative Domain Adaptation
H Zhao, Z Zhu, J Hu, A Coates, G Gordon
arXiv preprint arXiv:1705.09011, 2017
32017
Deployed end-to-end speech recognition
B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ...
US Patent App. 10/319,374, 2019
22019
Systems and methods for principled bias reduction in production speech models
E Battenberg, R CHILD, A Coates, C Fougner, G Yashesh, J Huang, ...
US Patent App. 15/884,239, 2018
22018
The system can't perform the operation now. Try again later.
Articles 1–20