Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data Y Li*, P Yuan*, S Feng, B Pan, B Sun, X Wang, H Wang, K Li AAAI 2024, 2023 | 6 | 2023 |
BatchEval: Towards Human-like Text Evaluation P Yuan, S Feng, Y Li, X Wang, B Pan, H Wang, K Li ACL 2024 main, 2023 | 3 | 2023 |
Generative Dense Retrieval: Memory Can Be a Burden P Yuan*, X Wang*, S Feng, B Pan, Y Li, H Wang, X Miao, K Li EACL 2024, 2024 | 2 | 2024 |
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning Y Li*, P Yuan*, S Feng, B Pan, X Wang, B Sun, H Wang, K Li ICLR 2024, 2024 | 2 | 2024 |
Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation X Wang, Y Li, S Feng, P Yuan, B Pan, H Wang, Y Hu, K Li arXiv preprint arXiv:2407.02056, 2024 | | 2024 |
Better correlation and robustness: a distribution-balanced self-supervised learning framework for automatic dialogue evaluation P Yuan, X Wang, J Shi, B Sun, Y Li Advances in Neural Information Processing Systems 36, 2024 | | 2024 |
Parallel Corpora Alignment Framework for Multilingual and Robust Automatic Dialogue Evaluation X Wang*, J Shi*, P Yuan*, K Li Proceedings of The Eleventh Dialog System Technology Challenge, 123-132, 2023 | | 2023 |
Mode: A Benchmark and a Probe into Multimodal Open-Domain Dialogue Evaluation H Yin, X Wang, Y Zhang, P Lu, B Sun, P Yuan, K Li Available at SSRN 4888542, 0 | | |
Tracking Cognitive Development of Large Language Models X Wang, P Yuan, S Feng, B Pan, Y Li, B Sun, H Wang, K Li | | |