Language ID in the wild: Unexpected challenges on the path to a thousand-language web text corpus I Caswell, T Breiner, D Van Esch, A Bapna arXiv preprint arXiv:2010.14571, 2020 | 79 | 2020 |
Building machine translation systems for the next thousand languages A Bapna, I Caswell, J Kreutzer, O Firat, D van Esch, A Siddhant, M Niu, ... arXiv preprint arXiv:2205.03983, 2022 | 69 | 2022 |
Scaling language model size in cross-device federated learning JH Ro, T Breiner, L McConnaughey, M Chen, AT Suresh, S Kumar, ... arXiv preprint arXiv:2204.09715, 2022 | 24 | 2022 |
Writing across the world's languages: Deep internationalization for Gboard, the Google keyboard D van Esch, E Sarbar, T Lucassen, J O'Brien, T Breiner, M Prasad, E Crew, ... arXiv preprint arXiv:1912.01218, 2019 | 24 | 2019 |
UserLibri: A dataset for ASR personalization using only text T Breiner, S Ramaswamy, E Variani, S Garg, R Mathews, KC Sim, ... arXiv preprint arXiv:2207.00706, 2022 | 17 | 2022 |
Mining Training Data for Language Modeling Across the World's Languages. M Prasad, T Breiner, D van Esch SLTU, 61-65, 2018 | 12 | 2018 |
Automatic keyboard layout design for low-resource latin-script languages T Breiner, C Nguyen, D van Esch, J O'Brien arXiv preprint arXiv:1901.06039, 2019 | 3 | 2019 |
Personalizing Speech Recognition Based on User-entered Text S Ramaswamy, T Breiner, I Pisarev, D Zivkovic, M Chen, R Mathews, ... | 1 | 2022 |
A large scale low-resource pronunciation data set mined from Wikipedia T Chakraborty, M Prasad, T Breiner, S Ritchie, D van Esch arXiv 2101, 2021 | 1 | 2021 |
UserLibri: A Dataset for ASR Personalization with Only Text E Variani, KC Sim, K Gupta, L McConnaughey, M Chen, R Mathews, ... | | 2022 |
Mining Large-Scale Low-Resource Pronunciation Data From Wikipedia T Chakraborty, M Prasad, T Breiner, S Ritchie, D van Esch arXiv preprint arXiv:2101.11575, 2021 | | 2021 |