fairseq: A Fast, Extensible Toolkit for Sequence Modeling M Ott, S Edunov, A Baevski, A Fan, S Gross, N Ng, D Grangier, M Auli arXiv preprint arXiv:1904.01038, 2019 | 2271 | 2019 |
Language modeling with gated convolutional networks YN Dauphin, A Fan, M Auli, D Grangier Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017 | 2037 | 2017 |
Hierarchical Neural Story Generation A Fan, M Lewis, Y Dauphin arXiv preprint arXiv:1805.04833, 2018 | 1020 | 2018 |
Wizard of Wikipedia: Knowledge-Powered Conversational agents E Dinan, S Roller, K Shuster, A Fan, M Auli, J Weston arXiv preprint arXiv:1811.01241, 2018 | 662 | 2018 |
Pay Less Attention with Lightweight and Dynamic Convolutions F Wu, A Fan, A Baevski, YN Dauphin, M Auli arXiv preprint arXiv:1901.10430, 2019 | 539 | 2019 |
Reducing Transformer Depth on Demand with Structured Dropout A Fan, E Grave, A Joulin arXiv preprint arXiv:1909.11556, 2019 | 392 | 2019 |
Beyond english-centric multilingual machine translation A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ... Journal of Machine Learning Research 22 (107), 1-48, 2021 | 362 | 2021 |
Controllable abstractive summarization A Fan, D Grangier, M Auli arXiv preprint arXiv:1711.05217, 2017 | 274 | 2017 |
Multilingual Translation from Denoising Pre-Training Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 255* | 2021 |
KILT: a benchmark for knowledge intensive language tasks F Petroni, A Piktus, A Fan, P Lewis, M Yazdani, N De Cao, J Thorne, ... arXiv preprint arXiv:2009.02252, 2020 | 241 | 2020 |
ELI5: Long Form Question Answering A Fan, Y Jernite, E Perez, D Grangier, J Weston, M Auli arXiv preprint arXiv:1907.09190, 2019 | 230 | 2019 |
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 211 | 2022 |
Strategies for Structuring Story Generation A Fan, M Lewis, Y Dauphin arXiv preprint arXiv:1902.01109, 2019 | 188 | 2019 |
Training with quantization noise for extreme model compression A Fan, P Stock, B Graham, E Grave, R Gribonval, H Jégou, A Joulin arXiv e-prints, arXiv: 2004.07320, 2020 | 161 | 2020 |
Nearest Neighbor Machine Translation U Khandelwal, A Fan, D Jurafsky, L Zettlemoyer, M Lewis arXiv preprint arXiv:2010.00710, 2020 | 157 | 2020 |
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ... Transactions of the Association for Computational Linguistics 10, 522-538, 2022 | 148 | 2022 |
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web H Schwenk, G Wenzek, S Edunov, E Grave, A Joulin, A Fan Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 143 | 2021 |
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation E Dinan, A Fan, A Williams, J Urbanek, D Kiela, J Weston arXiv preprint arXiv:1911.03842, 2019 | 135 | 2019 |
Learning to Speak and Act in a Fantasy Text Adventure Game J Urbanek, A Fan, S Karamcheti, S Jain, S Humeau, E Dinan, ... arXiv preprint arXiv:1903.03094, 2019 | 131 | 2019 |
Integration of responses within and across Arabidopsis natural accessions uncovers loci controlling root systems architecture U Rosas, A Cibrian-Jaramillo, D Ristova, JA Banta, ML Gifford, AH Fan, ... Proceedings of the National Academy of Sciences 110 (37), 15133-15138, 2013 | 98 | 2013 |