Reducing variance in temporal-difference value estimation via ensemble of deep networks L Liang, Y Xu, S McAleer, D Hu, A Ihler, P Abbeel, R Fox International Conference on Machine Learning (ICML 2022), 2022 | 22 | 2022 |
Modular Framework for Visuomotor Language Grounding K Nottingham, L Liang, D Shin, CC Fowlkes, R Fox, S Singh Embodied AI Workshop @ CVPR 2021, 2021 | 16 | 2021 |
Target Entropy Annealing for Discrete Soft Actor-Critic Y Xu, D Hu, L Liang, S McAleer, P Abbeel, R Fox Deep Reinforcement Learning Worshop @ NeurIPS 2021, 2021 | 13 | 2021 |
Reparameterized Policy Learning for Multimodal Trajectory Optimization Z Huang, L Liang, Z Ling, X Li, C Gan, H Su International Conference on Machine Learning (ICML 2023), 2023 | 9 | 2023 |
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates L Liang, Y Xu, S McAleer, D Hu, A Ihler, P Abbeel, R Fox Deep Reinforcement Learning Workshop @ NeurIPS 2021, 2021 | 5 | 2021 |
Robo360: A 3D Omnispective Multi-Material Robotic Manipulation Dataset L Liang, L Bian, C Xiao, J Zhang, L Chen, I Liu, F Xiang, Z Huang, H Su arXiv preprint arXiv:2312.06686, 2023 | 4 | 2023 |
Variational Reparameterized Policy Learning with Differentiable Physics Z Huang, L Liang, Z Ling, X Li, C Gan, H Su Deep Reinforcement Learning Workshop @ NeurIPS 2022, 2022 | 2* | 2022 |
When should we prefer state-to-visual dagger over visual reinforcement learning? T Mu, Z Li, SW Strzelecki, X Yuan, Y Yao, L Liang, H Su arXiv preprint arXiv:2412.13662, 2024 | | 2024 |