Conversational end-to-end tts for voice agents H Guo, S Zhang, FK Soong, L He, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021 | 69 | 2021 |
Exemplar-based sparse representation of timbre and prosody for voice conversion H Ming, D Huang, L Xie, S Zhang, M Dong, H Li 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 50 | 2016 |
Fundamental frequency modeling using wavelets for emotional voice conversion H Ming, D Huang, M Dong, H Li, L Xie, S Zhang 2015 International Conference on Affective Computing and Intelligent …, 2015 | 42 | 2015 |
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts L Xue, FK Soong, S Zhang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022 | 22 | 2022 |
Self-supervised context-aware style representation for expressive speech synthesis Y Wu, X Wang, S Zhang, L He, R Song, JY Nie arXiv preprint arXiv:2206.12559, 2022 | 17 | 2022 |
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation S Zhang, D Huang, L Xie, ES Chng, H Li, M Dong 2015 Asia-Pacific Signal and Information Processing Association Annual …, 2015 | 10 | 2015 |
An automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity DY Huang, L Xie, S Zhang, YSW Lee, J Wu, H Ming, X Tian, C Ding, M Li, ... | 9 | 2016 |
A hybrid virtual bass system with improved phase vocoder and high efficiency S Zhang, L Xie, ZH Fu, Y Yuan The 9th International Symposium on Chinese Spoken Language Processing, 401-405, 2014 | 7 | 2014 |
Stylespeech: Self-supervised style enhancing with vq-vae-based pre-training for expressive audiobook speech synthesis X Chen, X Wang, S Zhang, L He, Z Wu, X Wu, H Meng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading Y Xiao, S Zhang, X Wang, X Tan, L He, S Zhao, FK Soong, T Lee arXiv preprint arXiv:2307.00782, 2023 | 5 | 2023 |
MuLanTTS The Microsoft Speech Synthesis System for Blizzard Challenge 2023 Z Xu, S Zhang, X Wang, J Zhang, W Wei, L He, S Zhao arXiv preprint arXiv:2309.02743, 2023 | 2 | 2023 |
Paragraph synthesis with cross utterance features for neural TTS S Zhang, L He US Patent App. 17/631,695, 2022 | 1 | 2022 |
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation. S Zhang, DY Huang, L Xie, E Chng, H Li, M Dong INTERSPEECH, 1498-1502, 2015 | 1 | 2015 |
Large-Scale Automatic Audiobook Creation B Walsh, M Hamilton, G Newby, X Wang, S Ruan, S Zhao, L He, S Zhang, ... arXiv preprint arXiv:2309.03926, 2023 | | 2023 |