Neural Dialogue Context Online End-of-Turn Detection R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018 | 35 | 2018 |
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Large Context End-to-end Automatic Speech Recognition via Extension of Hierarchical Recurrent Encoder-decoder Models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 34 | 2019 |
Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 31 | 2015 |
Neural Error Corrective Language Models for Automatic Speech Recognition. T Tanaka, R Masumura, H Masataki, Y Aono Interspeech, 401-405, 2018 | 29 | 2018 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 23 | 2020 |
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models T Ashihara, T Moriya, K Matsuura, T Tanaka arXiv preprint arXiv:2207.06867, 2022 | 22 | 2022 |
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019 | 20 | 2019 |
Automated structure discovery and parameter tuning of neural network language model based on evolution strategy T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh 2016 IEEE Spoken Language Technology Workshop (SLT), 665-671, 2016 | 20 | 2016 |
Evolution-strategy-based automation of system development for high-performance speech recognition T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (1), 77-88, 2018 | 16 | 2018 |
Multi-task and Multi-lingual Joint Learning of Neural Lexical Utterance Classification based on Partially-shared Modeling R Masumura, T Tanaka, R Higashinaka, H Masataki, Y Aono Proceedings of the 27th International Conference on Computational …, 2018 | 14 | 2018 |
Distilling Attention Weights for CTC-Based ASR Systems T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
Neural Speech-to-Text Language Models for Rescoring Hypotheses of DNN-HMM Hybrid Automatic Speech Recognition Systems T Tanaka, R Masumura, T Moriya, Y Aono 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 12 | 2018 |
Evolutionary optimization of long short-term memory neural network language model T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh The Journal of the Acoustical Society of America 140 (4), 3062-3062, 2016 | 10 | 2016 |
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi INTERSPEECH, 2822-2826, 2020 | 9 | 2020 |
Role Play Dialogue Aware Language Models Based on Conditional Hierarchical Recurrent Encoder-Decoder. R Masumura, T Tanaka, A Ando, H Masataki, Y Aono Interspeech, 1259-1263, 2018 | 9 | 2018 |
Leveraging Large Text Corpora For End-To-End Speech Summarization K Matsuura, T Ashihara, T Moriya, T Tanaka, A Ogawa, M Delcroix, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition T Tanaka, R Masumura, M Ihori, A Takashima, T Moriya, T Ashihara, ... arXiv preprint arXiv:2107.01569, 2021 | 8 | 2021 |
Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition T Moriya, T Ashihara, T Tanaka, T Ochiai, H Sato, A Ando, Y Ijima, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi, R Masumura ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |