Action-depedent Control Variates for Policy Optimization via Stein's Identity H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu arXiv preprint arXiv:1710.11198, 2017 | 46 | 2017 |
Competitive Experience Replay H Liu, A Trott, R Socher, C Xiong International Conference on Learning Representations(ICLR) 2019, 2019 | 33* | 2019 |
Variational inference with tail-adaptive f-divergence D Wang, H Liu, Q Liu arXiv preprint arXiv:1810.11943, 2018 | 23 | 2018 |
Taming MAML: Efficient unbiased meta-reinforcement learning H Liu, R Socher, C Xiong International Conference on Machine Learning, 4061-4071, 2019 | 22 | 2019 |
Structured Inference for Recurrent Hidden Semi-markov Model. H Liu, L He, H Bai, B Dai, K Bai, Z Xu IJCAI, 2447-2453, 2018 | 15 | 2018 |
Hybrid discriminative-generative training via contrastive learning H Liu, P Abbeel arXiv preprint arXiv:2007.09070, 2020 | 6 | 2020 |
Sample-efficient policy optimization with stein control variate H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu arXiv preprint arXiv:1710.11198, 2017 | 4 | 2017 |
Stochastic sequential neural networks with structured inference H Liu, H Bai, L He, Z Xu arXiv preprint arXiv:1705.08695, 2017 | 4 | 2017 |
Behavior From the Void: Unsupervised Active Pre-Training L Hao, A Pieter arXiv preprint arXiv:2103.04551, 2021 | | 2021 |
Shrinkage-based Bias-Variance Trade-off for Deep Reinforcement Learning Y Feng, H Liu, J Peng, Q Liu | | 2018 |