Sequential Deformation for Accurate Scene Text Detection S Xiao, L Peng, R Yan, K An, G Yao, J Min ECCV, 108-124, 2020 | 32 | 2020 |
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency K An, H Xiang, Z Ou INTERSPEECH, 566-570, 2020 | 21 | 2020 |
Efficient neural architecture search for end-to-end speech recognition via straight-through gradients H Zheng, K An, Z Ou 2021 IEEE Spoken Language Technology Workshop (SLT), 60-67, 2021 | 20 | 2021 |
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao 2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021 | 18 | 2021 |
CUSIDE: chunking, simulating future context and decoding for streaming ASR K An, H Zheng, Z Ou, H Xiang, K Ding, G Wan arXiv preprint arXiv:2203.16758, 2022 | 16 | 2022 |
An empirical study of language model integration for transducer based speech recognition H Zheng, K An, Z Ou, C Huang, K Ding, G Wan arXiv preprint arXiv:2203.16776, 2022 | 7 | 2022 |
Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings C Zhu, K An, H Zheng, Z Ou 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 6 | 2021 |
CAT: crf-based ASR toolkit K An, H Xiang, Z Ou arXiv preprint arXiv:1911.08747, 2019 | 6 | 2019 |
Deformable TDNN with adaptive receptive fields for speech recognition K An, Y Zhang, Z Ou arXiv preprint arXiv:2104.14791, 2021 | 5 | 2021 |
BAT: Boundary aware transducer for memory-efficient and low-latency ASR K An, X Shi, S Zhang arXiv preprint arXiv:2305.11571, 2023 | 3 | 2023 |
Exploiting single-channel speech for multi-channel end-to-end speech recognition: A comparative study K An, J Xiao, Z Ou 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 2 | 2022 |
Exploring RWKV for Memory Efficient and Low Latency Streaming ASR K An, S Zhang arXiv preprint arXiv:2309.14758, 2023 | 1 | 2023 |
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures L Zuo, K An, S Zhang, Z Yan arXiv preprint arXiv:2312.14860, 2023 | | 2023 |
Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition K An, Z Ou arXiv preprint arXiv:2107.02670, 2021 | | 2021 |