关注
Shujie Hu
Shujie Hu
在 se.cuhk.edu.hk 的电子邮件经过验证
标题
引用次数
引用次数
年份
Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition
M Geng, X Xie, Z Ye, T Wang, G Li, S Hu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2597-2611, 2022
222022
Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition
S Hu, S Liu, X Xie, M Geng, T Wang, S Hu, M Cui, X Liu, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
122022
Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition
S Hu, X Xie, Z Jin, M Geng, Y Wang, M Cui, J Deng, X Liu, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Adversarial data augmentation using vae-gan for disordered speech recognition
Z Jin, X Xie, M Geng, T Wang, S Hu, J Deng, G Li, X Liu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
Personalized adversarial data augmentation for dysarthric and elderly speech recognition
Z Jin, M Geng, J Deng, T Wang, S Hu, G Li, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
82023
Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems
M Cui, J Deng, S Hu, X Xie, T Wang, S Hu, M Geng, B Xue, X Liu, H Meng
arXiv preprint arXiv:2206.11596, 2022
72022
Confidence score based speaker adaptation of conformer speech recognition systems
J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, G Li, S Hu, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1175-1190, 2023
62023
Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition
G Li, J Deng, M Geng, Z Jin, T Wang, S Hu, M Cui, H Meng, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
52023
Exploiting cross-domain and cross-lingual ultrasound tongue imaging features for elderly and dysarthric speech recognition
S Hu, X Xie, M Geng, M Cui, J Deng, G Li, T Wang, X Liu, H Meng
arXiv preprint arXiv:2206.07327, 2022
52022
Boosting large language model for speech synthesis: An empirical study
H Hao, L Zhou, S Liu, J Li, S Hu, R Wang, F Wei
arXiv preprint arXiv:2401.00246, 2023
32023
Use of Speech Impairment Severity for Dysarthric Speech Recognition
M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui, G Li, J Yu, X Xie, X Liu
arXiv preprint arXiv:2305.10659, 2023
22023
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition
M Geng, X Xie, R Su, J Yu, Z Jin, T Wang, S Hu, Z Ye, H Meng, X Liu
arXiv preprint arXiv:2203.14593, 2022
12022
Towards High-Performance and Low-Latency Feature-Based Speaker Adaptation of Conformer Speech Recognition Systems
J Deng, X Xie, G Li, M Cui, M Geng, Z Jin, T Wang, S Hu, Z Li, X Liu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Towards Automatic Data Augmentation for Disordered Speech Recognition
Z Jin, X Xie, T Wang, M Geng, J Deng, G Li, S Hu, X Liu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
WavLLM: Towards Robust and Adaptive Speech Large Language Model
S Hu, L Zhou, S Liu, S Chen, H Hao, J Pan, X Liu, J Li, S Sivasankaran, ...
arXiv preprint arXiv:2404.00656, 2024
2024
Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
H Wang, Z Jin, M Geng, S Hu, G Li, T Wang, H Xu, X Liu
arXiv preprint arXiv:2401.00662, 2024
2024
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
J Deng, G Li, X Xie, Z Jin, M Cui, T Wang, S Hu, M Geng, X Liu
arXiv preprint arXiv:2306.14608, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–17