关注
Kyu Jeong Han
Kyu Jeong Han
Amazon Web Services (AWS)
在 amazon.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
2632022
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
M Li, KJ Han, S Narayanan
Computer Speech & Language 27 (1), 151-167, 2013
2282013
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
1132019
The CAPIO 2017 conversational speech recognition system
KJ Han, A Chandrashekaran, J Kim, I Lane
arXiv preprint arXiv:1801.00059, 2017
882017
Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization
KJ Han, S Kim, SS Narayanan
IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1590-1601, 2008
812008
State-of-the-art speech recognition using multi-stream self-attention with dilated 1d convolutions
KJ Han, R Prieto, T Ma
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 54-61, 2019
732019
Robust language identification using convolutional neural network features
S Ganapathy, K Han, S Thomas, M Omar, MV Segbroeck, SS Narayanan
Fifteenth annual conference of the international speech communication …, 2014
682014
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system.
KJ Han, SS Narayanan
Interspeech, 1853-1856, 2007
582007
E-branchformer: Branchformer with enhanced merging for speech recognition
K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe
2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023
442023
Slue: New benchmark tasks for spoken language understanding evaluation on natural speech
S Shon, A Pasad, F Wu, P Brusco, Y Artzi, K Livescu, KJ Han
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
442022
Combining five acoustic level modeling methods for automatic speaker age and gender recognition
M Li, CS Jung, KJ Han
Eleventh annual conference of the international speech communication association, 2010
442010
Multistream CNN for robust acoustic modeling
KJ Han, J Pan, VKN Tadala, T Ma, D Povey
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
422021
Deep Learning-Based Telephony Speech Recognition in the Wild
KJ Han, S Hahm, BH Kim, J Kim, IR Lane
INTERSPEECH, 1323-1327, 2017
372017
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
352020
ASAPP-ASR: Multistream CNN and self-attentive SRU for SOTA speech recognition
J Pan, J Shapiro, J Wohlwend, KJ Han, T Lei, T Ma
arXiv preprint arXiv:2005.10469, 2020
322020
Performance-efficiency trade-offs in unsupervised pre-training for speech recognition
F Wu, K Kim, J Pan, KJ Han, KQ Weinberger, Y Artzi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling
KJ Han, SS Narayanan
Ninth Annual Conference of the International Speech Communication Association, 2008
302008
Identifying a driver of a vehicle
SV Myers, S Elwart, WJ Talamonti, JT Mullen, ZD Nelson, T Smith, ...
US Patent 9,707,911, 2017
232017
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering
KJ Han, SS Narayanan
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
222008
Multi-Stride Self-Attention for Speech Recognition.
KJ Han, J Huang, Y Tang, X He, B Zhou
Interspeech, 2788-2792, 2019
202019
系统目前无法执行此操作,请稍后再试。
文章 1–20