Discovering generalizable multi-agent coordination skills from multi-task offline data F Zhang, C Jia, YC Li, L Yuan, Y Yu, Z Zhang The Eleventh International Conference on Learning Representations, 2023 | 25 | 2023 |
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning Y Ran, YC Li, F Zhang, Z Zhang, Y Yu Proceedings of the 40th International Conference on Machine Learning 202 ¡K, 2023 | 16 | 2023 |
Efficient Human-AI Coordination via Preparatory Language-based Convention C Guan, L Zhang, C Fan, Y Li, F Chen, L Li, Y Tian, L Yuan, Y Yu arXiv preprint arXiv:2311.00416, 2023 | 3 | 2023 |
Cost-aware offline safe meta reinforcement learning with robust in-distribution online task adaptation C Guan, R Xue, Z Zhang, L Li, YC Li, L Yuan, Y Yu Proceedings of the 23rd International Conference on Autonomous Agents and ¡K, 2024 | 2 | 2024 |
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation C Jia, F Zhang, YC Li, CX Gao, XH Liu, L Yuan, Z Zhang, Y Yu arXiv preprint arXiv:2403.07261, 2024 | 2 | 2024 |
Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator R Xue, Z Zhang, L Li, F Chen, YC Li, Y Yu, L Yuan Joint European Conference on Machine Learning and Knowledge Discovery in ¡K, 2024 | | 2024 |
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning H Lin, YY Xu, Y Sun, Z Zhang, YC Li, C Jia, J Ye, J Zhang, Y Yu arXiv preprint arXiv:2405.17031, 2024 | | 2024 |
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics X Zhang, W Qiu, YC Li, L Yuan, C Jia, Z Zhang, Y Yu arXiv preprint arXiv:2402.11317, 2024 | | 2024 |
Learning generalizable batch active learning strategies via deep Q-networks (student abstract) YC Li, WJ Shen, B Zhang, F Mao, Z Zhang, Y Yu Proceedings of the AAAI Conference on Artificial Intelligence 37 (13), 16258 ¡K, 2023 | | 2023 |
Deep Demonstration Tracing: Learning Generalized Imitator for Runtime Imitation from a Single Demonstration XH Chen, J Ye, H Zhao, YC Li, XH Liu, H Shi, YY Xu, Z Ye, SH Yang, Y Yu, ... Forty-first International Conference on Machine Learning, 0 | | |
Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal L Li, R Chen, Z Zhang, Z Wu, YC Li, C Guan, Y Yu, L Yuan | | |