Follow
Alicia Parrish
Alicia Parrish
Google
Verified email at nyu.edu - Homepage
Title
Cited by
Cited by
Year
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
6992023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
6932022
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
3482023
BLiMP: The benchmark of linguistic minimal pairs for English
A Warstadt, A Parrish, H Liu, A Mohananey, W Peng, SF Wang, ...
Transactions of the Association for Computational Linguistics 8, 377-392, 2020
3182020
Investigating BERT's knowledge of language: five analysis methods with NPIs
A Warstadt, Y Cao, I Grosu, W Peng, H Blix, Y Nie, A Alsop, S Bordia, ...
arXiv preprint arXiv:1909.02597, 2019
1242019
BBQ: A hand-built bias benchmark for question answering
A Parrish, A Chen, N Nangia, V Padmakumar, J Phang, J Thompson, ...
arXiv preprint arXiv:2110.08193, 2021
1212021
Dataperf: Benchmarks for data-centric ai development
M Mazumder, C Banbury, X Yao, B Karlaš, W Gaviria Rojas, S Diamos, ...
Advances in Neural Information Processing Systems 36, 2024
712024
QuALITY: Question answering with long input texts, yes!
RY Pang, A Parrish, N Joshi, N Nangia, J Phang, A Chen, V Padmakumar, ...
arXiv preprint arXiv:2112.08608, 2021
582021
Inverse Scaling: When Bigger Isn't Better
IR McKenzie, A Lyzhov, M Pieler, A Parrish, A Mueller, A Prabhu, ...
arXiv preprint arXiv:2306.09479, 2023
352023
Does putting a linguist in the loop improve NLU data collection?
A Parrish, W Huang, O Agha, SH Lee, N Nangia, A Warstadt, K Aggarwal, ...
arXiv preprint arXiv:2104.07179, 2021
262021
NOPE: A corpus of naturally-occurring presuppositions in English
A Parrish, S Schuster, A Warstadt, O Agha, SH Lee, Z Zhao, SR Bowman, ...
arXiv preprint arXiv:2109.06987, 2021
212021
What do nlp researchers believe? results of the nlp community metasurvey
J Michael, A Holtzman, A Parrish, A Mueller, A Wang, A Chen, D Madaan, ...
arXiv preprint arXiv:2208.12852, 2022
182022
Single-turn debate does not help humans answer hard reading-comprehension questions
A Parrish, H Trivedi, E Perez, A Chen, N Nangia, J Phang, SR Bowman
arXiv preprint arXiv:2204.05212, 2022
112022
Two failures of self-consistency in the multi-step reasoning of llms
A Chen, J Phang, A Parrish, V Padmakumar, C Zhao, SR Bowman, K Cho
arXiv preprint arXiv:2305.14279, 2023
102023
Conceptual combination in the LATL with and without syntactic composition
A Parrish, L Pylkkänen
Neurobiology of Language 3 (1), 46-66, 2022
102022
Dices dataset: Diversity in conversational ai evaluation for safety
L Aroyo, A Taylor, M Diaz, C Homan, A Parrish, G Serapio-García, ...
Advances in Neural Information Processing Systems 36, 2024
82024
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
A Parrish, H Trivedi, N Nangia, V Padmakumar, J Phang, AS Saimbhi, ...
arXiv preprint arXiv:2210.10860, 2022
42022
Adversarial nibbler: A data-centric challenge for improving the safety of text-to-image models
A Parrish, HR Kirk, J Quaye, C Rastogi, M Bartolo, O Inel, J Ciro, ...
arXiv preprint arXiv:2305.14384, 2023
32023
DMLR: Data-centric Machine Learning Research--Past, Present and Future
L Oala, M Maskey, L Bat-Leah, A Parrish, NM Gürel, TS Kuo, Y Liu, R Dror, ...
arXiv preprint arXiv:2311.13028, 2023
22023
Intersectionality in Conversational AI Safety: How Bayesian Multilevel Models Help Understand Diverse Perceptions of Safety
CM Homan, G Serapio-Garcia, L Aroyo, M Diaz, A Parrish, V Prabhakaran, ...
arXiv preprint arXiv:2306.11530, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20