Human evaluation of text-to-image models on a multi-task benchmark V Petsiuk, AE Siemenn, S Surbehera, Z Chin, K Tyser, G Hunter, ... arXiv preprint arXiv:2211.12112, 2022 | 18 | 2022 |
Exploring the mit mathematics and eecs curriculum using large language models SJ Zhang, S Florin, AN Lee, E Niknafs, A Marginean, A Wang, K Tyser, ... arXiv preprint arXiv:2306.08997, 2023 | 16 | 2023 |
From human days to machine seconds: Automatically answering and generating machine learning final exams MU Iddo Drori, Sarah Zhang, Reece Shuttleworth, Zad Chin, Pedro Lantigua ... International Conference on Knowledge Discovery and Data Mining (KDD), 2023 | 4* | 2023 |
A dataset and benchmark for automatically answering and generating machine learning final exams S Zhang, R Shuttleworth, D Austin, Y Hicke, L Tang, S Karnik, ... arXiv preprint arXiv:2206.05442, 2022 | 4 | 2022 |
Automatically Answering and Generating Machine Learning Final Exams S Zhang, RS Shuttleworth, Z Chin, P Lantigua, S Surbehera, G Hunter, ... | 2 | 2022 |
A dataset for learning university STEM courses at scale and generating questions at a human level Educational Advances in Artificial Intelligence (EAAI), 2023 | 1* | 2023 |
Text to graphics by program synthesis with error correction ID Ivan Nikitovic, Trisha Anil, Showndarya Madhavan, Arvind Raghavan, Zad ... CVPR Generative Models for Computer Vision Workshop (GCV), 2023 | | 2023 |
Identifying Structure in the MIMIC ICU Dataset Z Chin, S Raval, F Doshi-Velez, M Wattenberg, LA Celi NeurIPS 2022 Workshop on Learning from Time Series for Health, 2022 | | 2022 |