Jie Liu (刘杰)

引用次数

	总计	2019 年至今
引用	74	74
h 指数	4	4
i10 指数	4	4

20212022202320249 7 28 30

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Wanli Ouyang (欧阳万里)Shanghai AI Lab在 sydney.edu.au 的电子邮件经过验证
Yu LiuSenseTime在 sensetime.com 的电子邮件经过验证
Yaodong YangBOYA (博雅) Assistant Professor at Peking University在 pku.edu.cn 的电子邮件经过验证
Ming SunKuaishou Tech在 kuaishou.com 的电子邮件经过验证
Feng (Jeff) LiangPhD Student, The University of Texas at Austin在 utexas.edu 的电子邮件经过验证
Chen LinTorr Vision Group, University of Oxford在 eng.ox.ac.uk 的电子邮件经过验证
Dong Xu, Professor & IEEE FellowUniversity of Hong Kong在 hku.hk 的电子邮件经过验证
Letian WangUniversity of Toronto | Carnegie Mellon University | UC Berkeley在 mail.utoronto.ca 的电子邮件经过验证
Chuming LiUniversity of Sydney

关注

Jie Liu (刘杰)

The Chinese University of Hong Kong

在 link.cuhk.edu.hk 的电子邮件经过验证 - 首页

Large Language Model Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Inception convolution with efficient dilation search J Liu, C Li, F Liang, C Lin, M Sun, J Yan, W Ouyang, D Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	34	2021
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency C Li, J Liu, Y Zhang, Y Wei, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2023), 2023	16	2023
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization Z Zhou, J Liu, C Yang, J Shao, Y Liu, X Yue, W Ouyang, Y Qiao arXiv preprint arXiv:2310.03708, 2023	10	2023
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors L Wang, J Liu, H Shao, W Wang, R Chen, Y Liu, SL Waslander Robotics: Science and Systems (RSS 2023), 2023	10	2023
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues G Bai, J Liu, X Bu, Y He, J Liu, Z Zhou, Z Lin, W Su, T Ge, B Zheng, ... arXiv preprint arXiv:2402.14762, 2024	2	2024
Masked Pretraining for Multi-Agent Decision Making J Liu, Y Zhang, C Li, C Yang, Y Yang, Y Liu, W Ouyang arXiv preprint arXiv:2310.11846, 2023	1	2023
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning C Li, R Jia, J Liu, Y Zhang, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the European Conference on Artificial Intelligence, 2023	1	2023
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models Y Wu, J Liu, X Bu, J Liu, Z Zhou, Y Zhang, C Zhang, Z Bai, H Chen, T Ge, ... arXiv preprint arXiv:2402.14660, 2024		2024
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! Z Zhou, J Liu, Z Dong, J Liu, C Yang, W Ouyang, Y Qiao arXiv preprint arXiv:2402.12343, 2024		2024
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning Y Zhang, J Liu, C Li, Y Niu, Y Yang, Y Liu, W Ouyang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024), 2023		2023
Adaptive Gradient Method with Resilience and Momentum J Liu, C Lin, C Li, L Sheng, M Sun, J Yan, W Ouyang arXiv preprint arXiv:2010.11041, 2020		2020

系统目前无法执行此操作，请稍后再试。

文章 1–11

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者