Follow
William Chan
William Chan
Ideogram
Verified email at ideogram.ai - Homepage
Title
Cited by
Cited by
Year
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le
INTERSPEECH, 2019
36252019
Listen, Attend and Spell: A Neural Network for Large Vocabulary Conversational Speech Recognition
W Chan, N Jaitly, QV Le, O Vinyals
ICASSP, 2016
3105*2016
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
C Saharia, W Chan, S Saxena, L Li, J Whang, E Denton, ...
NeurIPS, 2022
29362022
Image Super-Resolution via Iterative Refinement
C Saharia, J Ho, W Chan, T Salimans, D Fleet, M Norouzi
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
9552022
Palette: Image-to-Image Diffusion Models
C Saharia, W Chan, H Chang, C A. Lee, J Ho, D Tim Salimans, J. Fleet, ...
SIGGRAPH, 2022
7972022
Video Diffusion Models
J Ho, T Salimans, A Gritsenko, W Chan, M Norouzi, D Fleet
arXiv:2204.03458, 2022
6752022
Cascaded Diffusion Models for High Fidelity Image Generation
J Ho, C Saharia, W Chan, D Fleet, M Norouzi, T Salimans
Journal of Machine Learning Research 23 (47), 1-33, 2022
6662022
Imagen Video: High Definition Video Generation with Diffusion Models
J Ho, W Chan, C Saharia, J Whang, R Gao, A Gritsenko, D P. Kingma, ...
arXiv:2210.02303, 2022
6322022
WaveGrad: Estimating Gradients for Waveform Generation
N Chen, Y Zhang, H Zen, R Weiss, M Norouzi, W Chan
ICLR, 2021
5642021
Very Deep Convolutional Networks for End-to-End Speech Recognition
Y Zhang, W Chan, N Jaitly
ICASSP, 2017
5412017
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
INTERSPEECH, 2017
3382017
Insertion Transformer: Flexible Sequence Generation via Insertion Operations
M Stern, W Chan, J Kiros, J Uszkoreit
ICML, 2019
2382019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1962019
Novel View Synthesis with Diffusion Models
D Watson, W Chan, R Martin-Brualla, J Ho, A Tagliasacchi, M Norouzi
ICLR, 2023
1522023
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
B Li, Y Zhang, T Sainath, Y Wu, W Chan
ICASSP, 2019
1462019
Predicting Collective Sentiment Dynamics from Time-series Social Media
L Nguyen, P Wu, W Chan, W Peng, Y Zhang
SIGKDD WISDOM, 2012
1372012
SpecAugment on Large Scale Datasets
D Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, Q Le, Y Wu
ICASSP, 2020
1362020
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ...
IEEE Journal of Selected Topics in Signal Processing, 2021
1342021
Non-Autoregressive Machine Translation with Latent Alignments
C Saharia, W Chan, S Saxena, Norouzi, Mohammad
EMNLP, 2020
1252020
SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network
W Chan, D Park, C Lee, Y Zhang, Q Le, M Norouzi
INTERSPEECH: Workshop on Machine Learning in Speech and Language Processing, 2021
1232021
The system can't perform the operation now. Try again later.
Articles 1–20