Hao Hu

Cited by

	All	Since 2019
Citations	194	194
h-index	8	8
i10-index	8	8

202120222023202410 37 78 67

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Chongjie zhangWashington University in St. LouisVerified email at wustl.edu
Yiqin YangTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Qianchuan ZhaoCenter for Intelligent and Networked Systems, Dept. Automation, Tsinghua University, Beijing, ChinaVerified email at tsinghua.edu.cn
Shenao ZhangNorthwestern UniversityVerified email at gatech.edu
Zhihan LiuPh.D.Verified email at u.northwestern.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu
Yingfeng Chen(陈赢峰)Fuxi AI Lab in NeteaseVerified email at mail.ustc.edu.cn
Zhizhou RenUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Guangxiang ZhuTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Xiaoteng Ma（马骁腾）Center for Intelligent and Networked Systems, Dept. Automation, Tsinghua University, Beijing, ChinaVerified email at mails.tsinghua.edu.cn
Jianhao WangPhd of Computer Science, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Jin ZhangTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Zhuoran YangYale UniversityVerified email at yale.edu
Wei XiongComputer Science, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Han ZhongPeking UniversityVerified email at stu.pku.edu.cn
Miao LuStanford UniversityVerified email at stanford.edu
Qihan LiuTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Boyi LiuNorthwestern UniversityVerified email at u.northwestern.edu
Shuqi KeCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Beining HanComputer Science, Princeton UniversityVerified email at princeton.edu

Hao Hu

Tsinghua University

Verified email at mails.tsinghua.edu.cn - Homepage

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Generalizable episodic memory for deep reinforcement learning H Hu, J Ye, G Zhu, Z Ren, C Zhang Thirty-eighth International Conference on Machine Learning (ICML 2021), 2021	39	2021
Metacure: Meta reinforcement learning with empowerment-driven exploration J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang Thirty-eighth International Conference on Machine Learning (ICML 2021 …, 2021	38*	2021
Offline Reinforcement Learning with Value-based Episodic Memory X Ma, Y Yang, H Hu*, Q Liu, J Yang, C Zhang, Q Zhao, B Liang Tenth International Conference on Learning Representations (ICLR 2022), 2021	34	2021
Maximize to explore: One objective function fusing estimation, planning, and exploration Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang Advances in Neural Information Processing Systems 36, 2024	20*	2024
On the Estimation Bias in Double Q-Learning Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS …, 2021	13	2021
On the Role of Discount Factor in Offline Reinforcement Learning H Hu, Y Yang, Q Zhao, C Zhang Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022	12	2022
What is essential for unseen goal generalization of offline goal-conditioned RL? R Yang, L Yong, X Ma, H Hu, C Zhang, T Zhang International Conference on Machine Learning, 39543-39571, 2023	11	2023
Reason for future, act for now: A principled framework for autonomous llm agents with provable sample efficiency Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang arXiv preprint arXiv:2309.17382, 2023	10*	2023
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Y Yang, H Hu, W Li*, S Li, J Yang, Q Zhao, C Zhang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, 2022	8	2022
The provable benefits of unsupervised data sharing for offline reinforcement learning H Hu, Y Yang, Q Zhao, C Zhang arXiv preprint arXiv:2302.13493, 2023	7	2023
Unsupervised behavior extraction via random intent priors H Hu, Y Yang, J Ye, Z Mai, C Zhang Advances in Neural Information Processing Systems 36, 2024	2	2024
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets Y Mao, C Wu, X Chen, H Hu, J Jiang, T Zhou, T Lv, C Fan, Z Hu, Y Wu, ... The Twelfth International Conference on Learning Representations, 2023		2023
Bayesian Offline-to-Online Reinforcement Learning: A Realist Approach H Hu, Y Yang, J Ye, Z Mai, Y Hu, T Lv, C Fan, Q Zhao, C Zhang		2023
Query-Efficient Offline Preference-Based Reinforcement Learning via In-Dataset Exploration H Hu, Y Yang, J Zhang, S Wang, B Liu, Y Gao, C Zhang		2023

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors