Gpt-neox-20b: An open-source autoregressive language model S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 513 | 2022 |
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 443 | 2023 |
Roentgen: Vision-language foundation model for chest x-ray generation P Chambon, C Bluethgen, JB Delbrouck, R Van der Sluijs, M Połacin, ... arXiv preprint arXiv:2211.12737, 2022 | 59 | 2022 |
Emergent and predictable memorization in large language models S Biderman, U PRASHANTH, L Sutawika, H Schoelkopf, Q Anthony, ... Advances in Neural Information Processing Systems 36, 2024 | 56 | 2024 |
Gpt-neox-20b: An open-source autoregressive language model, 2022 S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... URL https://arxiv. org/abs/2204.06745, 2022 | 27 | 2022 |
GPT-NeoX-20B: An open-source autoregressive language model. arXiv S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 8 | 2022 |