Follow
Rui Yang
Title
Cited by
Cited by
Year
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang
International Conference on Learning Representations (ICLR) 2022, 2022
652022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
R Yang*, C Bai*, X Ma, Z Wang, C Zhang, L Han
Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022
642022
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
L Li, R Yang, D Luo
International Conference on Learning Representations (ICLR) 2021, 2020
622020
Exploiting Reward Shifting in Value-Based Deep RL
H Sun, L Han, R Yang, X Ma, J Guo, B Zhou
Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022
34*2022
Arithmetic control of llms for diverse user preferences: Directional preference alignment with multi-objective rewards
H Wang, Y Lin, W Xiong, R Yang, S Diao, S Qiu, H Zhao, T Zhang
Annual Meeting of the Association for Computational Linguistics (ACL) 2024, 2024
242024
MHER: Model-based Hindsight Experience Replay
R Yang, M Fang, L Han, Y Du, F Luo, X Li
NeurIPS 2021 Deep RL Workshop, 2021
232021
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
R Yang, Y Lin, X Ma, H Hu, C Zhang, T Zhang
International Conference on Machine Learning (ICML) 2023, 2023
182023
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
R Yang*, X Pan*, F Luo*, S Qiu*, H Zhong, D Yu, J Chen
International Conference on Machine Learning (ICML) 2024, 2024
172024
A survey on sparse reward algorithms in reinforcement learning-theory and experiment
杨瑞, 严江鹏, 李秀
智能系统学报 15 (5), 888-899, 2020
17*2020
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
C Ye*, R Yang*, Q Gu, T Zhang
Advances in Neural Information Processing Systems (NeurIPS) 2023, 2023
102023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
R Yang*, H Zhong*, J Xu*, A Zhang, C Zhang, L Han, T Zhang
International Conference on Learning Representations (ICLR) 2024, 2023
92023
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning
R Yang, J Lyu, Y Yang, J Yan, F Luo, D Luo, L Li, X Li
arXiv preprint arXiv:2102.12962, 2021
9*2021
Towards robust offline-to-online reinforcement learning via uncertainty and smoothness
X Wen, X Yu, R Yang, C Bai, Z Wang
Journal of Artificial Intelligence Research, 2024, 2023
72023
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
R Yang, R Ding, Y Lin, H Zhang, T Zhang
Advances in Neural Information Processing Systems (NeurIPS) 2024, 2024
62024
Efficient multi-goal reinforcement learning via value consistency prioritization
J Xu, S Li, R Yang, C Yuan, L Han
Journal of Artificial Intelligence Research 77, 355-376, 2023
42023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
M Wang*, R Yang*, X Chen, H Sun, M Fang, M Giovanni
Transactions on Machine Learning Research (TMLR) 2024., 2023
32023
Combining hindsight with goal-enhanced prediction for multi-goal reinforcement learning
R Yang, F Luo, X Li
2021 IEEE 33rd International Conference on Tools with Artificial …, 2021
22021
Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
S Qiu, D Zhang, R Yang, B Lyu, T Zhang
arXiv preprint arXiv:2407.17466, 2024
2024
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling
J Xu*, R Yang*, F Luo, M Fang, B Wang, L Han
arXiv preprint arXiv:2407.04285, 2024
2024
Robot control method, apparatus and device, storage medium and program product
R Yang, L Li, D Luo
US Patent App. 17/957,710, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20