OpenMEVA: A benchmark for evaluating open-ended story generation metrics J Guan, Z Zhang, Z Feng, Z Liu, W Ding, X Mao, C Fan, M Huang arXiv preprint arXiv:2105.08920, 2021 | 41 | 2021 |
LOT: A story-centric benchmark for evaluating Chinese long text understanding and generation J Guan, Z Feng, Y Chen, R He, X Mao, C Fan, M Huang Transactions of the Association for Computational Linguistics 10, 434-451, 2022 | 28 | 2022 |
Alignbench: Benchmarking chinese alignment of large language models X Liu, X Lei, S Wang, Y Huang, Z Feng, B Wen, J Cheng, P Ke, Y Xu, ... arXiv preprint arXiv:2311.18743, 2023 | 14 | 2023 |
Critiquellm: Scaling llm-as-critic for effective and explainable evaluation of large language model generation P Ke, B Wen, Z Feng, X Liu, X Lei, J Cheng, S Wang, A Zeng, Y Dong, ... arXiv preprint arXiv:2311.18702, 2023 | 14 | 2023 |
Lot: A benchmark for evaluating chinese long text understanding and generation J Guan, Z Feng, Y Chen, R He, X Mao, C Fan, M Huang arXiv preprint arXiv:2108.12960, 2021 | 6 | 2021 |
Networking Chemicals Flows: Efficiency–Value–Environment Functionalized Symbiosis Algorithms and Application Y Lyu, ZA Feng, T Ji, J Tian, L Chen Environmental Science & Technology 57 (46), 18225-18235, 2023 | 1 | 2023 |