Behavior From the Void: Unsupervised Active Pre-Training H Liu, P Abbeel arXiv preprint arXiv:2103.04551, 2021 | 103 | 2021 |
Taming MAML: Efficient unbiased meta-reinforcement learning H Liu, R Socher, C Xiong International Conference on Machine Learning, 4061-4071, 2019 | 83 | 2019 |
Action-depedent Control Variates for Policy Optimization via Stein's Identity H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu arXiv preprint arXiv:1710.11198, 2017 | 80 | 2017 |
APS: Active Pretraining with Successor Features H Liu, P Abbeel International Conference on Machine Learning, 6736-6747, 2021 | 66 | 2021 |
URLB: Unsupervised Reinforcement Learning Benchmark M Laskin, D Yarats, H Liu, K Lee, A Zhan, K Lu, C Cang, L Pinto, P Abbeel arXiv preprint arXiv:2110.15191, 2021 | 57 | 2021 |
Competitive Experience Replay H Liu, A Trott, R Socher, C Xiong International Conference on Learning Representations(ICLR) 2019, 2019 | 51 | 2019 |
Variational inference with tail-adaptive f-divergence D Wang, H Liu, Q Liu Advances in Neural Information Processing Systems 31, 2018 | 48 | 2018 |
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, L Pinto arXiv preprint arXiv:2201.13425, 2022 | 36 | 2022 |
Hybrid discriminative-generative training via contrastive learning H Liu, P Abbeel arXiv preprint arXiv:2007.09070, 2020 | 35 | 2020 |
Multimodal Masked Autoencoders Learn Transferable Representations X Geng, H Liu, L Lee, D Schuurams, S Levine, P Abbeel arXiv preprint arXiv:2205.14204, 2022 | 30 | 2022 |
Structured Inference for Recurrent Hidden Semi-markov Model. H Liu, L He, H Bai, B Dai, K Bai, Z Xu IJCAI, 2447-2453, 2018 | 27 | 2018 |
Masked world models for visual control Y Seo, D Hafner, H Liu, F Liu, S James, K Lee, P Abbeel Conference on Robot Learning, 1332-1344, 2023 | 25 | 2023 |
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery M Laskin, H Liu, XB Peng, D Yarats, A Rajeswaran, P Abbeel arXiv preprint arXiv:2202.00161, 2022 | 21 | 2022 |
Koala: A dialogue model for academic research X Geng, A Gudibande, H Liu, E Wallace, P Abbeel, S Levine, D Song Blog post, April 1, 2023 | 15 | 2023 |
Aligning Text-to-Image Models using Human Feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023 | 11 | 2023 |
Chain of Hindsight Aligns Language Models with Feedback H Liu, C Sferrazza, P Abbeel arXiv preprint arXiv:2302.02676, 2023 | 11* | 2023 |
Efficient Off-Policy Credit Assignment H Liu, R Socher, C Xiong US Patent App. 16/653,890, 2020 | 9 | 2020 |
Meta-Reinforcement Learning Gradient Estimation with Variance Reduction H Liu US Patent App. 16/395,083, 2020 | 8 | 2020 |
InstructRL: Simple yet Effective Instruction-Following Agents with Multimodal Transformer H Liu, L Lee, K Lee, P Abbeel arXiv preprint arXiv:2210.13431, 2022 | 7* | 2022 |
Stochastic sequential neural networks with structured inference H Liu, H Bai, L He, Z Xu arXiv preprint arXiv:1705.08695, 2017 | 5 | 2017 |