关注
Ching-An Cheng
Ching-An Cheng
Microsoft Research
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Agile autonomous driving using end-to-end deep imitation learning
Y Pan, CA Cheng, K Saigol, K Lee, X Yan, E Theodorou, B Boots
Robotics: science and systems, 2018
3772018
Bellman-consistent pessimism for offline reinforcement learning
T Xie, CA Cheng, N Jiang, P Mineiro, A Agarwal
Advances in neural information processing systems 34, 6683-6694, 2021
2922021
Truncated back-propagation for bilevel optimization
A Shaban, CA Cheng, N Hatch, B Boots
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
2842019
Imitation learning for agile autonomous driving
Y Pan, CA Cheng, K Saigol, K Lee, X Yan, EA Theodorou, B Boots
The International Journal of Robotics Research 39 (2-3), 286-302, 2020
1722020
Adversarially trained actor critic for offline reinforcement learning
CA Cheng, T Xie, N Jiang, A Agarwal
International Conference on Machine Learning, 3852-3878, 2022
1412022
Fast policy learning through imitation and reinforcement
CA Cheng, X Yan, N Wagener, B Boots
arXiv preprint arXiv:1805.10413, 2018
972018
RMPflow: A Computational Graph for Automatic Motion Policy Generation
CA Cheng, M Mukadam, J Issac, S Birchfield, D Fox, B Boots, N Ratliff
Algorithmic Foundations of Robotics XIII: Proceedings of the 13th Workshop …, 2020
932020
Variational inference for Gaussian process models with linear complexity
CA Cheng, B Boots
Advances in Neural Information Processing Systems 30, 2017
922017
An online learning approach to model predictive control
N Wagener, CA Cheng, J Sacks, B Boots
arXiv preprint arXiv:1902.08967, 2019
892019
Intra order-preserving functions for calibration of multi-class neural networks
A Rahimi, A Shaban, CA Cheng, R Hartley, B Boots
Advances in Neural Information Processing Systems 33, 13456-13467, 2020
722020
Direct nash optimization: Teaching language models to self-improve with general preferences
C Rosset, CA Cheng, A Mitra, M Santacroce, A Awadallah, T Xie
arXiv preprint arXiv:2404.03715, 2024
652024
Heuristic-guided reinforcement learning
CA Cheng, A Kolobov, A Swaminathan
Advances in Neural Information Processing Systems 34, 13550-13563, 2021
642021
Cautiously optimistic policy optimization and exploration with linear function approximation
A Zanette, CA Cheng, A Agarwal
Conference on Learning Theory, 4473-4525, 2021
622021
Safe reinforcement learning using advantage-based intervention
NC Wagener, B Boots, CA Cheng
International Conference on Machine Learning, 10630-10640, 2021
602021
Orthogonally decoupled variational Gaussian processes
H Salimbeni, CA Cheng, B Boots, M Deisenroth
Advances in neural information processing systems 31, 2018
532018
Incremental variational sparse Gaussian process regression
CA Cheng, B Boots
Advances in Neural Information Processing Systems 29, 2016
532016
Virtual impedance control for safe human-robot interaction
SY Lo, CA Cheng, HP Huang
Journal of Intelligent & Robotic Systems 82, 3-19, 2016
442016
RMPflow: A Geometric Framework for Generation of Multitask Motion Policies
CA Cheng, M Mukadam, J Issac, S Birchfield, D Fox, B Boots, N Ratliff
IEEE Transactions on Automation Science and Engineering 18 (3), 968-987, 2021
392021
Policy improvement via imitation of multiple oracles
CA Cheng, A Kolobov, A Agarwal
Advances in Neural Information Processing Systems 33, 5587-5598, 2020
342020
Convergence of value aggregation for imitation learning
CA Cheng, B Boots
International Conference on Artificial Intelligence and Statistics, 1801-1809, 2018
342018
系统目前无法执行此操作,请稍后再试。
文章 1–20