关注
Chengqian Gao
Chengqian Gao
MBZUAI
在 mbzuai.ac.ae 的电子邮件经过验证
标题
引用次数
引用次数
年份
Value penalized q-learning for recommender systems
C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan, P Zhao
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
182022
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
C Gao, K Xu, L Liu, D Ye, P Zhao, Z Xu
arXiv preprint arXiv:2210.10469, 2022
12022
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
C Gao, W de Vazelhes, H Zhang, B Gu, Z Xu
arXiv preprint arXiv:2405.01615, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–3