Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 290 | 2021 |
Vlp: A survey on vision-language pre-training FL Chen, DZ Zhang, ML Han, XY Chen, J Shi, S Xu, B Xu Machine Intelligence Research 20 (1), 38-56, 2023 | 173 | 2023 |
An exploration of self-supervised pretrained representations for end-to-end speech recognition X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 82 | 2021 |
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021 | 82 | 2021 |
X-llm: Bootstrapping advanced large language models by treating multi-modalities as foreign languages F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu arXiv preprint arXiv:2305.04160, 2023 | 76 | 2023 |
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021 | 54 | 2021 |
Neural speaker diarization with speaker-wise chain rule Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu arXiv preprint arXiv:2006.01796, 2020 | 43 | 2020 |
Speaker-conditional chain model for speech separation and extraction J Shi, J Xu, Y Fujita, S Watanabe, B Xu arXiv preprint arXiv:2006.14149, 2020 | 28 | 2020 |
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation. J Shi, J Xu, G Liu, B Xu IJCAI, 4353-4360, 2018 | 28 | 2018 |
Modeling attention and memory for auditory selection in a cocktail party environment J Xu, J Shi, G Liu, X Chen, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 28 | 2018 |
Distilled binary neural network for monaural speech separation X Chen, G Liu, J Shi, J Xu, B Xu 2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018 | 27 | 2018 |
Sequence to multi-sequence learning via conditional chain mapping for mixture signals J Shi, X Chang, P Guo, S Watanabe, Y Fujita, J Xu, B Xu, L Xie Advances in Neural Information Processing Systems 33, 3735-3747, 2020 | 25 | 2020 |
Closing the gap between time-domain multi-channel speech enhancement on real and simulation conditions W Zhang, J Shi, C Li, S Watanabe, Y Qian 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021 | 24 | 2021 |
Discretization and re-synthesis: an alternative method to solve the cocktail party problem J Shi, X Chang, T Hayashi, YJ Lu, S Watanabe, B Xu arXiv preprint arXiv:2112.09382, 2021 | 18 | 2021 |
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments. Y Hao, J Xu, J Shi, P Zhang, L Qin, B Xu Interspeech, 1431-1435, 2020 | 18 | 2020 |
Ensemble of feature sets and classification methods for stance detection J Xu, S Zheng, J Shi, Y Yao, B Xu Natural Language Understanding and Intelligent Applications: 5th CCF …, 2016 | 18 | 2016 |
Hierarchical memory networks for answer selection on unknown words J Xu, J Shi, Y Yao, S Zheng, B Xu arXiv preprint arXiv:1609.08843, 2016 | 17 | 2016 |
Train from scratch: Single-stage joint training of speech separation and recognition J Shi, X Chang, S Watanabe, B Xu Computer Speech & Language 76, 101387, 2022 | 12 | 2022 |
Unsupervised and pseudo-supervised vision-language alignment in visual dialog F Chen, D Zhang, X Chen, J Shi, S Xu, B Xu Proceedings of the 30th ACM International Conference on Multimedia, 4142-4153, 2022 | 12 | 2022 |
Training noisy single-channel speech separation with noisy oracle sources: A large gap and a small step M Maciejewski, J Shi, S Watanabe, S Khudanpur ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 12 | 2021 |