Yuancheng Wang

Cited by

	All	Since 2020
Citations	265	265
h-index	8	8
i10-index	7	7

240

120

180

20232024202523 235 6

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Zhizheng WuThe Chinese University of Hong Kong, Shenzhen, Mel LabVerified email at cuhk.edu.cn
Zeqian JuUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Xu TanPrincipal Researcher and Research Manager, MicrosoftVerified email at microsoft.com
Lei HePrincipal Scientist Manager, MicrosoftVerified email at microsoft.com
Kai Shen (沈锴)Zhejiang UniversityVerified email at zju.edu.cn
Haorui He（何昊睿）The University of Hong KongVerified email at connect.hku.hk
Yicheng GuAalto UniversityVerified email at aalto.fi
Jiaqi LiThe Chinese University of Hong Kong, ShenzhenVerified email at link.cuhk.edu.cn
Xueyao ZhangThe Chinese University of Hong Kong, ShenzhenVerified email at link.cuhk.edu.cn
Liumeng XueHong Kong University of Science and TechnologyVerified email at ust.hk

Yuancheng Wang

The Chinese University of Hong Kong, Shenzhen

Verified email at link.cuhk.edu.cn - Homepage

Deep Learning Speech Synthesis Music Generation Audio Generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... International Conference on Machine Learning (ICML 2024), 2024	109	2024
Audit: Audio editing by following instructions with latent diffusion models Y Wang, Z Ju, X Tan, L He, Z Wu, J Bian Advances in Neural Information Processing Systems (NeurIPS 2023), 2023	44	2023
Automated testing of image captioning systems B Yu, Z Zhong, X Qin, J Yao, Y Wang, P He Proceedings of the 31st ACM SIGSOFT International Symposium on Software …, 2022	26	2022
Amphion: An open-source audio, music and speech generation toolkit X Zhang, L Xue, Y Gu, Y Wang, J Li*, H He, C Wang, S Liu, X Chen, ... IEEE Spoken Language Technology Workshop (SLT 2024), 2023	21	2023
Foleycrafter: Bring silent videos to life with lifelike and synchronized sounds Y Zhang, Y Gu, Y Zeng, Z Xing, Y Wang, Z Wu, K Chen arXiv preprint arXiv:2407.01494, 2024	19	2024
Rall-e: Robust codec language modeling with chain-of-thought prompting for text-to-speech synthesis D Xin, X Tan, K Shen, Z Ju, D Yang, Y Wang, S Takamichi, H Saruwatari, ... arXiv preprint arXiv:2404.03204, 2024	19	2024
Emilia: An extensive, multilingual, and diverse speech dataset for large-scale speech generation H He, Z Shang, C Wang, X Li, Y Gu, H Hua, L Liu, C Yang, J Li, P Shi, ... IEEE Spoken Language Technology Workshop (SLT 2024), 2024	14	2024
Maskgct: Zero-shot text-to-speech with masked generative codec transformer Y Wang, H Zhan, L Liu, R Zeng, H Guo, J Zheng, Q Zhang, X Zhang, ... arXiv preprint arXiv:2409.00750, 2024	8	2024
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words J Ao, Y Wang, X Tian, D Chen, J Zhang, L Lu, Y Wang, H Li, Z Wu Advances in Neural Information Processing Systems (NeurIPS 2024), 2024	4	2024
Debatts: Zero-Shot Debating Text-to-Speech Synthesis Y Huang, Y Wang, J Li, H Guo, H He, S Zhang, Z Wu arXiv preprint arXiv:2411.06540, 2024	1	2024
Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities H He, Y Song, Y Wang, H Li, X Zhang, L Wang, G Huang, ES Chng, Z Wu arXiv preprint arXiv:2411.19770, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–11

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors