Follow
Keyu An
Keyu An
Alibaba
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
Sequential Deformation for Accurate Scene Text Detection
S Xiao, L Peng, R Yan, K An, G Yao, J Min
ECCV, 108-124, 2020
322020
CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency
K An, H Xiang, Z Ou
INTERSPEECH, 566-570, 2020
212020
Efficient neural architecture search for end-to-end speech recognition via straight-through gradients
H Zheng, K An, Z Ou
2021 IEEE Spoken Language Technology Workshop (SLT), 60-67, 2021
202021
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines
F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao
2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021
182021
CUSIDE: chunking, simulating future context and decoding for streaming ASR
K An, H Zheng, Z Ou, H Xiang, K Ding, G Wan
arXiv preprint arXiv:2203.16758, 2022
162022
An empirical study of language model integration for transducer based speech recognition
H Zheng, K An, Z Ou, C Huang, K Ding, G Wan
arXiv preprint arXiv:2203.16776, 2022
72022
Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
C Zhu, K An, H Zheng, Z Ou
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
62021
CAT: crf-based ASR toolkit
K An, H Xiang, Z Ou
arXiv preprint arXiv:1911.08747, 2019
62019
Deformable TDNN with adaptive receptive fields for speech recognition
K An, Y Zhang, Z Ou
arXiv preprint arXiv:2104.14791, 2021
52021
BAT: Boundary aware transducer for memory-efficient and low-latency ASR
K An, X Shi, S Zhang
arXiv preprint arXiv:2305.11571, 2023
32023
Exploiting single-channel speech for multi-channel end-to-end speech recognition: A comparative study
K An, J Xiao, Z Ou
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
22022
Exploring RWKV for Memory Efficient and Low Latency Streaming ASR
K An, S Zhang
arXiv preprint arXiv:2309.14758, 2023
12023
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures
L Zuo, K An, S Zhang, Z Yan
arXiv preprint arXiv:2312.14860, 2023
2023
Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition
K An, Z Ou
arXiv preprint arXiv:2107.02670, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–14