The fifth'CHiME'Speech Separation and Recognition Challenge: Dataset, task and baselines J Barker, S Watanabe, E Vincent, J Trmal arXiv preprint arXiv:1803.10609, 2018 | 430 | 2018 |
Improving deep neural network acoustic models using generalized maxout networks X Zhang, J Trmal, D Povey, S Khudanpur 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 406 | 2014 |
A PITCH EXTRACTION ALGORITHM TUNED FOR AUTOMATIC SPEECH RECOGNITION P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ... | 394 | 2014 |
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020 | 344 | 2020 |
Multi-task self-supervised learning for Robust Speech Recognition M Ravanelli, J Zhong, S Pascual, P Swietojanski, J Monteiro, J Trmal, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 324 | 2020 |
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021 | 220 | 2021 |
Using proxies for OOV keywords in the keyword search task G Chen, O Yilmaz, J Trmal, D Povey, S Khudanpur 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 416-421, 2013 | 125 | 2013 |
Quantifying the value of pronunciation lexicons for keyword search in low resource languages G Chen, S Khudanpur, D Povey, J Trmal, D Yarowsky, O Yilmaz Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013 | 66 | 2013 |
A KEYWORD SEARCH SYSTEM USING OPEN SOURCE SOFTWARE J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ... Proceedings of SLT 2014; (accepted), 2014 | 52 | 2014 |
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... INTERSPEECH, 3597-3601, 2017 | 48 | 2017 |
DiPCo--Dinner Party Corpus M Van Segbroeck, A Zaid, K Kutsenko, C Huerta, T Nguyen, X Luo, ... arXiv preprint arXiv:1909.13447, 2019 | 47 | 2019 |
Adaptation of a feedforward artificial neural network using a linear transform J Trmal, J Zelinka, L Müller Text, Speech and Dialogue, 423-430, 2010 | 45 | 2010 |
Adversarial Attacks and Defenses for Speech Recognition Systems P Żelasko, S Joshi, Y Shao, J Villalba, J Trmal, N Dehak, S Khudanpur arXiv preprint arXiv:2103.17122, 2021 | 31 | 2021 |
Optimized acoustic likelihoods computation for NVIDIA and ATI/AMD graphics processors J Vaněk, J Trmal, JV Psutka, J Psutka Audio, Speech, and Language Processing, IEEE Transactions on 20 (6), 1818-1828, 2012 | 26 | 2012 |
Using ASR methods for OCR A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ... 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019 | 25 | 2019 |
Lhotse: a speech data representation library for the modern deep learning ecosystem P Żelasko, D Povey, J Trmal, S Khudanpur arXiv preprint arXiv:2110.12561, 2021 | 22 | 2021 |
Topic Identification for Speech without ASR C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur arXiv preprint arXiv:1703.07476, 2017 | 22 | 2017 |
Voice-supported electronic health record for temporomandibular joint disorders. R Hippmann, T Dostálová, J Zvárová, M Nagy, M Seydlová, P Hanzlicek, ... Methods of information in medicine 49 (2), 168-172, 2009 | 22 | 2009 |
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs. A Prazák, Z Loose, J Trmal, JV Psutka, J Psutka INTERSPEECH, 2012 | 21 | 2012 |
Combination of FST and CN search in spoken term detection J Chiu, Y Wang, J Trmal, D Povey, G Chen, A Rudnicky Proc. Interspeech, 2014 | 20 | 2014 |