Deep voice: Real-time neural text-to-speech SÖ Arık, M Chrzanowski, A Coates, G Diamos, A Gibiansky, Y Kang, X Li, ... International conference on machine learning, 195-204, 2017 | 780 | 2017 |
Data-driven 3D visual pronunciation of Chinese IPA for language learning J Yu, A Li, F Hu, Q Fang, C Jiang, X Li, J Yang, Z Wang 2013 International Conference Oriental COCOSDA held jointly with 2013 …, 2013 | 19 | 2013 |
Realtime speech-driven facial animation using Gaussian Mixture Models C Luo, J Yu, X Li, Z Wang 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 1-6, 2014 | 15 | 2014 |
A HMM-based mandarin chinese singing voice synthesis system X Li, Z Wang IEEE/CAA Journal of Automatica Sinica 3 (2), 192-202, 2016 | 14 | 2016 |
ATST: Audio representation learning with teacher-student transformer X Li, X Li arXiv preprint arXiv:2204.12076, 2022 | 13 | 2022 |
An emotional harmony generation system B Xu, S Wang, X Li IEEE Congress on Evolutionary Computation, 1-7, 2010 | 10 | 2010 |
HMM based speech-driven 3D tongue animation C Luo, J Yu, X Li, L Zhang 2017 IEEE international conference on image processing (ICIP), 4377-4381, 2017 | 4 | 2017 |
Self-supervised audio teacher-student transformer for both clip-level and frame-level tasks X Li, N Shao, X Li IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 3 | 2024 |
DVQVC: An Unsupervised Zero-Shot Voice Conversion Framework D Li, X Li, X Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Frame correlation based autoregressive GMM method for voice conversion X Li, Z Wang The 9th International Symposium on Chinese Spoken Language Processing, 221-225, 2014 | 2 | 2014 |
Connecting the Dots in Self-Supervised Learning: A Brief Survey for Beginners PF Fang, X Li, Y Yan, S Zhang, QY Kang, XF Li, ZZ Lan Journal of Computer Science and Technology 37 (3), 507-526, 2022 | 1 | 2022 |
An evaluation framework for virtual articulatory movements based on medical video R LI, J YU, X LI, P FANG, Z WANG Chinese Journal of Electronics 28 (3), 585-592, 2019 | 1 | 2019 |
Fine-tune the pretrained ATST model for sound event detection N Shao, X Li, X Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |