Kainan Peng

Cited by

	All	Since 2019
Citations	2707	2536
h-index	11	11
i10-index	12	12

560

280

140

420

2017201820192020202120222023202415 138 370 441 547 556 510 102

Co-authors

Wei PingPrincipal Research Scientist, NVIDIAVerified email at nvidia.com
Sercan O. ArikGoogleVerified email at google.com
Gregory DiamosLanding AIVerified email at landing.ai
Yanqi ZhouGoogleVerified email at google.com
Sharan NarangResearch Engineer, Meta AIVerified email at meta.com
Ajay KannanGoogleVerified email at google.com
Jitong ChenByteDanceVerified email at cse.ohio-state.edu

Kainan Peng

Amazon

Verified email at alumni.cmu.edu

Text-to-Speech Computer Engineering Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning W Ping, K Peng, A Gibiansky, SO Arik, A Kannan, S Narang, J Raiman, ... ICLR 2018, 2018	842*	2018
Deep voice 2: Multi-speaker neural text-to-speech A Gibiansky, S Arik, G Diamos, J Miller, K Peng, W Ping, J Raiman, ... NIPS 2017, 2962-2970, 2017	614*	2017
Neural voice cloning with a few samples S Arik, J Chen, K Peng, W Ping, Y Zhou NeurIPS 2018, 10019-10029, 2018	395	2018
ClariNet: Parallel wave generation in end-to-end text-to-speech W Ping, K Peng, J Chen ICLR 2019, 2018	393	2018
Non-Autoregressive Neural Text-to-Speech K Peng, W Ping, Z Song, K Zhao ICML 2020, 2019	147*	2019
WaveFlow: A Compact Flow-based Model for Raw Audio W Ping, K Peng, K Zhao, Z Song ICML 2020, 2019	123	2019
Systems and methods for multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi US Patent 10,896,669, 2021	62	2021
Systems and methods for neural text-to-speech using convolutional sequence learning P Wei, P Kainan, S NARANG, A KANNAN, A GIBIANSKY, J RAIMAN, ... US Patent 10,796,686, 2020	38	2020
Incremental text-to-speech synthesis with prefix-to-prefix framework M Ma, B Zheng, K Liu, R Zheng, H Liu, K Peng, K Church, L Huang arXiv preprint arXiv:1911.02750, 2019	35	2019
Multi-speaker end-to-end speech synthesis J Park, K Zhao, K Peng, W Ping arXiv preprint arXiv:1907.04462, 2019	19	2019
Systems and methods for parallel wave generation in end-to-end text-to-speech P Wei, P Kainan, C Jitong US Patent 10,872,596, 2020	17	2020
Parallel neural text-to-speech P Kainan, P Wei, S Zhao, Z Kexin US Patent 11,017,761, 2021	10	2021
Neural voice cloning with a few samples OA Sercan, C Jitong, P Kainan, P Wei, Y Zhou Proc. 32nd Int. Conf. Neural Inf. Process. Syst., 10040-10050, 2018	5	2018
Systems and methods for neural voice cloning with a few samples C Jitong, P Kainan, P Wei, Z Yanqi US Patent 11,238,843, 2022	4	2022
Waveform generation using end-to-end text-to-waveform system P Wei, P Kainan, C Jitong US Patent 11,482,207, 2022	2	2022
Small-footprint flow-based models for raw audio P Wei, P Kainan, Z Kexin, S Zhao US Patent 11,521,592, 2022	1	2022
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing P Anastassiou, Z Tang, K Peng, D Jia, J Li, M Tu, Y Wang, Y Wang, M Ma arXiv preprint arXiv:2404.06674, 2024		2024
Multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi US Patent 11,651,763, 2023		2023
Zero-Shot Accent Conversion using Pseudo Siamese Disentanglement Network D Jia, Q Tian, K Peng, J Li, Y Chen, M Ma, Y Wang, Y Wang arXiv preprint arXiv:2212.05751, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors