Follow
Hao Hu
Hao Hu
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Generalizable episodic memory for deep reinforcement learning
H Hu, J Ye, G Zhu, Z Ren, C Zhang
Thirty-eighth International Conference on Machine Learning (ICML 2021), 2021
372021
Metacure: Meta reinforcement learning with empowerment-driven exploration
J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang
Thirty-eighth International Conference on Machine Learning (ICML 2021 …, 2021
32*2021
Offline Reinforcement Learning with Value-based Episodic Memory
X Ma*, Y Yang*, H Hu*, Q Liu, J Yang, C Zhang, Q Zhao, B Liang
Tenth International Conference on Learning Representations (ICLR 2022), 2021
312021
On the Estimation Bias in Double Q-Learning
Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS …, 2021
122021
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang
arXiv preprint arXiv:2305.18258, 2023
92023
On the Role of Discount Factor in Offline Reinforcement Learning
H Hu, Y Yang, Q Zhao, C Zhang
Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022
92022
What is essential for unseen goal generalization of offline goal-conditioned RL?
R Yang, L Yong, X Ma, H Hu, C Zhang, T Zhang
International Conference on Machine Learning, 39543-39571, 2023
72023
The provable benefits of unsupervised data sharing for offline reinforcement learning
H Hu, Y Yang, Q Zhao, C Zhang
arXiv preprint arXiv:2302.13493, 2023
62023
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery
Y Yang*, H Hu*, W Li*, S Li, J Yang, Q Zhao, C Zhang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, 2022
62022
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang
arXiv preprint arXiv:2309.17382, 2023
42023
Maximize to explore: One objective function fusing estimation, planning, and exploration
Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang
Advances in Neural Information Processing Systems 36, 2024
22024
Unsupervised Behavior Extraction via Random Intent Priors
H Hu, Y Yang, J Ye, Z Mai, C Zhang
Advances in Neural Information Processing Systems 36, 2024
2024
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets
Y Mao, C Wu, X Chen, H Hu, J Jiang, T Zhou, T Lv, C Fan, Z Hu, Y Wu, ...
The Twelfth International Conference on Learning Representations, 2023
2023
Bayesian Offline-to-Online Reinforcement Learning: A Realist Approach
H Hu, Y Yang, J Ye, Z Mai, Y Hu, T Lv, C Fan, Q Zhao, C Zhang
2023
Query-Efficient Offline Preference-Based Reinforcement Learning via In-Dataset Exploration
H Hu, Y Yang, J Zhang, S Wang, B Liu, Y Gao, C Zhang
2023
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang
2023
Intrinsically Guided Exploration in Meta Reinforcement Learning
J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang
2020
The system can't perform the operation now. Try again later.
Articles 1–17