Bloom: A 176b-parameter open-access multilingual language model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 497 | 2022 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... TMLR, 2022 | 361 | 2022 |
A framework for few-shot language model evaluation L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ... GitHub Repository, 2021 | 149* | 2021 |
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... ACL 2023, 2022 | 111 | 2022 |
StarCoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... arXiv preprint arXiv:2305.06161, 2023 | 69* | 2023 |
SantaCoder: don't reach for the stars! LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ... ICLR 2023, 2023 | 42* | 2023 |
SGPT: GPT sentence embeddings for semantic search N Muennighoff arXiv preprint arXiv:2202.08904, 2022 | 41 | 2022 |
Nl-augmenter: A framework for task-sensitive natural language augmentation KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ... NEJLT 2023, 2021 | 41 | 2021 |
The hateful memes challenge: Competition report D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ... NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021 | 40 | 2021 |
Vilio: state-of-the-art Visio-Linguistic models applied to hateful memes N Muennighoff NeurIPS 2020 Competition and Demonstration Track, 344-360, 2020 | 40 | 2020 |
What Language Model to Train if You Have One Million GPU Hours? TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, S Bideman, ... EMNLP 2022, 2022 | 36 | 2022 |
MTEB: Massive text embedding benchmark N Muennighoff, N Tazi, L Magne, N Reimers EACL 2023, 2022 | 32 | 2022 |
BLOOM+ 1: Adding Language Support to BLOOM for Zero-Shot Prompting ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ... ACL 2023, 2022 | 12 | 2022 |
Scaling Data-Constrained Language Models N Muennighoff, AM Rush, B Barak, TL Scao, A Piktus, N Tazi, S Pyysalo, ... NeurIPS 2023, 2023 | 8 | 2023 |
Octopack: Instruction tuning code large language models N Muennighoff, Q Liu, A Zebaze, Q Zheng, B Hui, TY Zhuo, S Singh, ... arXiv preprint arXiv:2308.07124, 2023 | 3 | 2023 |
A framework for the evaluation of code generation models LB Allal, N Muennighoff, L Von Werra | 2 | 2022 |
C-Pack: Packaged Resources To Advance General Chinese Embedding S Xiao, Z Liu, P Zhang, N Muennighoff arXiv preprint arXiv:2309.07597, 2023 | | 2023 |
Diagnosing the Impact of AI on Radiology in China N Muennighoff Peking University, 2021 | | 2021 |
The Data Provenance Project S Longpre, R Mahari, N Muennighoff, A Chen, K Perisetla, W Brannon, ... | | |