关注
Hao Liu
Hao Liu
在 cs.berkeley.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Behavior From the Void: Unsupervised Active Pre-Training
H Liu, P Abbeel
arXiv preprint arXiv:2103.04551, 2021
1032021
Taming MAML: Efficient unbiased meta-reinforcement learning
H Liu, R Socher, C Xiong
International Conference on Machine Learning, 4061-4071, 2019
832019
Action-depedent Control Variates for Policy Optimization via Stein's Identity
H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu
arXiv preprint arXiv:1710.11198, 2017
802017
APS: Active Pretraining with Successor Features
H Liu, P Abbeel
International Conference on Machine Learning, 6736-6747, 2021
662021
URLB: Unsupervised Reinforcement Learning Benchmark
M Laskin, D Yarats, H Liu, K Lee, A Zhan, K Lu, C Cang, L Pinto, P Abbeel
arXiv preprint arXiv:2110.15191, 2021
572021
Competitive Experience Replay
H Liu, A Trott, R Socher, C Xiong
International Conference on Learning Representations(ICLR) 2019, 2019
512019
Variational inference with tail-adaptive f-divergence
D Wang, H Liu, Q Liu
Advances in Neural Information Processing Systems 31, 2018
482018
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, L Pinto
arXiv preprint arXiv:2201.13425, 2022
362022
Hybrid discriminative-generative training via contrastive learning
H Liu, P Abbeel
arXiv preprint arXiv:2007.09070, 2020
352020
Multimodal Masked Autoencoders Learn Transferable Representations
X Geng, H Liu, L Lee, D Schuurams, S Levine, P Abbeel
arXiv preprint arXiv:2205.14204, 2022
302022
Structured Inference for Recurrent Hidden Semi-markov Model.
H Liu, L He, H Bai, B Dai, K Bai, Z Xu
IJCAI, 2447-2453, 2018
272018
Masked world models for visual control
Y Seo, D Hafner, H Liu, F Liu, S James, K Lee, P Abbeel
Conference on Robot Learning, 1332-1344, 2023
252023
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
M Laskin, H Liu, XB Peng, D Yarats, A Rajeswaran, P Abbeel
arXiv preprint arXiv:2202.00161, 2022
212022
Koala: A dialogue model for academic research
X Geng, A Gudibande, H Liu, E Wallace, P Abbeel, S Levine, D Song
Blog post, April 1, 2023
152023
Aligning Text-to-Image Models using Human Feedback
K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ...
arXiv preprint arXiv:2302.12192, 2023
112023
Chain of Hindsight Aligns Language Models with Feedback
H Liu, C Sferrazza, P Abbeel
arXiv preprint arXiv:2302.02676, 2023
11*2023
Efficient Off-Policy Credit Assignment
H Liu, R Socher, C Xiong
US Patent App. 16/653,890, 2020
92020
Meta-Reinforcement Learning Gradient Estimation with Variance Reduction
H Liu
US Patent App. 16/395,083, 2020
82020
InstructRL: Simple yet Effective Instruction-Following Agents with Multimodal Transformer
H Liu, L Lee, K Lee, P Abbeel
arXiv preprint arXiv:2210.13431, 2022
7*2022
Stochastic sequential neural networks with structured inference
H Liu, H Bai, L He, Z Xu
arXiv preprint arXiv:1705.08695, 2017
52017
系统目前无法执行此操作,请稍后再试。
文章 1–20