Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... 🏆 CVPR (Best Paper Finalist), 9556-9567, 2024 | 554 | 2024 |
Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023 | 435* | 2023 |
Graph embedding on biomedical networks: methods, applications and evaluations X Yue, Z Wang, J Huang, S Parthasarathy, S Moosavinasab, Y Huang, ... Bioinformatics 36 (4), 1241-1251, 2020 | 413 | 2020 |
Mind2web: Towards a generalist agent for the web X Deng, Y Gu, B Zheng, S Chen, S Stevens, B Wang, H Sun, Y Su 🏆 Advances in Neural Information Processing Systems (Spotlight) 36, 2024 | 344 | 2024 |
Turl: Table understanding through representation learning X Deng*, H Sun*, A Lees, Y Wu, C Yu 🏆 VLDB (ACM SIGMOD Research Highlight Award) 51 (1), 33-40, 2021 | 324 | 2021 |
Mammoth: Building math generalist models through hybrid instruction tuning X Yue, X Qu, G Zhang, Y Fu, W Huang, H Sun, Y Su, W Chen 🏆 ICLR (Spotlight), 2024 | 278 | 2024 |
Interpreting the public sentiment variations on twitter S Tan, Y Li, H Sun, Z Guan, X Yan, J Bu, C Chen, X He IEEE transactions on knowledge and data engineering 26 (5), 1158-1170, 2013 | 230 | 2013 |
Towards understanding chain-of-thought prompting: An empirical study of what matters B Wang, S Min, X Deng, J Shen, Y Wu, L Zettlemoyer, H Sun 🏆 ACL (Honorable Mention for Best Paper Awards), 2023 | 210* | 2023 |
Magicbrush: A manually annotated dataset for instruction-guided image editing K Zhang, L Mo, W Chen, H Sun, Y Su Advances in Neural Information Processing Systems 36, 2024 | 179 | 2024 |
Thinking about gpt-3 in-context learning for biomedical ie? think again BJ Gutiérrez, N McNeal, C Washington, Y Chen, L Li, H Sun, Y Su arXiv preprint arXiv:2203.08410, 2022 | 160* | 2022 |
Table cell search for question answering H Sun, H Ma, X He, W Yih, Y Su, X Yan Proceedings of the 25th International Conference on World Wide Web, 771-782, 2016 | 153 | 2016 |
Gpt-4v (ision) is a generalist web agent, if grounded B Zheng, B Gou, J Kil, H Sun, Y Su arXiv preprint arXiv:2401.01614, 2024 | 150 | 2024 |
Structure-grounded pretraining for text-to-SQL X Deng, AH Awadallah, C Meek, O Polozov, H Sun, M Richardson arXiv preprint arXiv:2010.12773, 2020 | 144 | 2020 |
On generating characteristic-rich question sets for qa evaluation Y Su, H Sun, B Sadler, M Srivatsa, I Gür, Z Yan, X Yan Proceedings of the 2016 Conference on Empirical Methods in Natural Language …, 2016 | 144 | 2016 |
Coacor: Code annotation for code retrieval with reinforcement learning Z Yao, JR Peddamail, H Sun The world wide web conference, 2203-2214, 2019 | 125 | 2019 |
Open domain question answering via semantic enrichment H Sun, H Ma, W Yih, CT Tsai, J Liu, MW Chang Proceedings of the 24th International Conference on World Wide Web, 1045-1055, 2015 | 125 | 2015 |
Iteratively prompt pre-trained language models for chain of thought B Wang, X Deng, H Sun arXiv preprint arXiv:2203.08383, 2022 | 124 | 2022 |
Multitask prompt tuning enables parameter-efficient transfer learning Z Wang, R Panda, L Karlinsky, R Feris, H Sun, Y Kim arXiv preprint arXiv:2303.02861, 2023 | 114 | 2023 |
Staqc: A systematically mined question-code dataset from stack overflow Z Yao, DS Weld, WP Chen, H Sun Proceedings of the 2018 World Wide Web Conference, 1693-1703, 2018 | 113 | 2018 |
Schemaless and structureless graph querying S Yang, Y Wu, H Sun, X Yan Proceedings of the VLDB Endowment 7 (7), 565-576, 2014 | 100 | 2014 |