Audio DistilBERT: a distilled audio BERT for speech representation learning F Yu, J Guo, W Xi, Z Yang, R Jiang, C Zhang 2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021 | 3 | 2021 |
Dual acoustic linguistic self-supervised representation learning for cross-domain speech recognition Z Yang, D Ng, C Zhang, X Fu, R Jiang, W Xi, Y Ma, C Ni, ES Chng, B Ma, ... Proc. Inter-speech 2023, 2023 | 2 | 2023 |
Dual-decoder transformer for end-to-end mandarin chinese speech recognition with pinyin and character Z Yang, W Xi, R Wang, R Jiang, J Zhao arXiv preprint arXiv:2201.10792, 2022 | 2 | 2022 |
Dual-memory multi-modal learning for continual spoken keyword spotting with confidence selection and diversity enhancement Z Yang, D Ng, X Li, C Zhang, R Jiang, W Xi, Y Ma, C Ni, J Zhao, B Ma, ... Proc. INTERSPEECH, 2023 | 1 | 2023 |
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR Z Yang, D Ng, X Fu, L Han, W Xi, R Wang, R Jiang, J Zhao arXiv preprint arXiv:2201.10792, 2022 | 1 | 2022 |
Speech2Stroke: Generate Chinese Character Strokes Directly from Speech Y Zhang, W Xi, Z Yang, S Men, R Jiang, Y Yang, J Zhao International Conference on Collaborative Computing: Networking …, 2020 | | 2020 |
Balanced Multimodal Learning: An Integrated Framework for Multi-Task Learning in Audio-Visual Fusion X Fu, W Xi, J Yang, Y Bai, Z Yang, R Jiang, LI XIZHE, J Gao, J Zhao | | |
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions Z Yang, D Ng, C Zhang, R Jiang, W Xi, Y Ma, C Ni, J Zhao, B Ma, ... | | |