Inception convolution with efficient dilation search J Liu, C Li, F Liang, C Lin, M Sun, J Yan, W Ouyang, D Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 34 | 2021 |
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency C Li, J Liu, Y Zhang, Y Wei, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2023), 2023 | 16 | 2023 |
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization Z Zhou, J Liu, C Yang, J Shao, Y Liu, X Yue, W Ouyang, Y Qiao arXiv preprint arXiv:2310.03708, 2023 | 10 | 2023 |
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors L Wang, J Liu, H Shao, W Wang, R Chen, Y Liu, SL Waslander Robotics: Science and Systems (RSS 2023), 2023 | 10 | 2023 |
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues G Bai, J Liu, X Bu, Y He, J Liu, Z Zhou, Z Lin, W Su, T Ge, B Zheng, ... arXiv preprint arXiv:2402.14762, 2024 | 2 | 2024 |
Masked Pretraining for Multi-Agent Decision Making J Liu, Y Zhang, C Li, C Yang, Y Yang, Y Liu, W Ouyang arXiv preprint arXiv:2310.11846, 2023 | 1 | 2023 |
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning C Li, R Jia, J Liu, Y Zhang, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the European Conference on Artificial Intelligence, 2023 | 1 | 2023 |
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models Y Wu, J Liu, X Bu, J Liu, Z Zhou, Y Zhang, C Zhang, Z Bai, H Chen, T Ge, ... arXiv preprint arXiv:2402.14660, 2024 | | 2024 |
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! Z Zhou, J Liu, Z Dong, J Liu, C Yang, W Ouyang, Y Qiao arXiv preprint arXiv:2402.12343, 2024 | | 2024 |
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning Y Zhang, J Liu, C Li, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024), 2023 | | 2023 |
Adaptive Gradient Method with Resilience and Momentum J Liu, C Lin, C Li, L Sheng, M Sun, J Yan, W Ouyang arXiv preprint arXiv:2010.11041, 2020 | | 2020 |