Niklas Muennighoff
Cited by
Cited by
Bloom: A 176b-parameter open-access multilingual language model
TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
arXiv preprint arXiv:2211.05100, 2022
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
TMLR, 2022
A framework for few-shot language model evaluation
L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ...
GitHub Repository, 2021
Crosslingual generalization through multitask finetuning
N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ...
ACL 2023, 2022
StarCoder: may the source be with you!
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 2023
SantaCoder: don't reach for the stars!
LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ...
ICLR 2023, 2023
SGPT: GPT sentence embeddings for semantic search
N Muennighoff
arXiv preprint arXiv:2202.08904, 2022
Nl-augmenter: A framework for task-sensitive natural language augmentation
KD Dhole, V Gangal, S Gehrmann, A Gupta, Z Li, S Mahamood, ...
NEJLT 2023, 2021
The hateful memes challenge: Competition report
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ...
NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021
Vilio: state-of-the-art Visio-Linguistic models applied to hateful memes
N Muennighoff
NeurIPS 2020 Competition and Demonstration Track, 344-360, 2020
What Language Model to Train if You Have One Million GPU Hours?
TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, S Bideman, ...
EMNLP 2022, 2022
MTEB: Massive text embedding benchmark
N Muennighoff, N Tazi, L Magne, N Reimers
EACL 2023, 2022
BLOOM+ 1: Adding Language Support to BLOOM for Zero-Shot Prompting
ZX Yong, H Schoelkopf, N Muennighoff, AF Aji, DI Adelani, K Almubarak, ...
ACL 2023, 2022
Scaling Data-Constrained Language Models
N Muennighoff, AM Rush, B Barak, TL Scao, A Piktus, N Tazi, S Pyysalo, ...
NeurIPS 2023, 2023
Octopack: Instruction tuning code large language models
N Muennighoff, Q Liu, A Zebaze, Q Zheng, B Hui, TY Zhuo, S Singh, ...
arXiv preprint arXiv:2308.07124, 2023
A framework for the evaluation of code generation models
LB Allal, N Muennighoff, L Von Werra
C-Pack: Packaged Resources To Advance General Chinese Embedding
S Xiao, Z Liu, P Zhang, N Muennighoff
arXiv preprint arXiv:2309.07597, 2023
Diagnosing the Impact of AI on Radiology in China
N Muennighoff
Peking University, 2021
The Data Provenance Project
S Longpre, R Mahari, N Muennighoff, A Chen, K Perisetla, W Brannon, ...
The system can't perform the operation now. Try again later.
Articles 1–19