关注
Andrey Kolobov
Andrey Kolobov
Microsoft Research
在 microsoft.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
BLOG: Probabilistic Models with Unknown Objects
B Milch, B Marthi, S Russell, D Sontag, DL Ong, A Kolobov
Statistical relational learning, 373, 2007
6192007
Open x-embodiment: Robotic learning datasets and rt-x models
A O'Neill, A Rehman, A Gupta, A Maddukuri, A Gupta, A Padalkar, A Lee, ...
arXiv preprint arXiv:2310.08864, 2023
361*2023
Parallel task routing for crowdsourcing
J Bragg, A Kolobov, M Mausam, D Weld
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing 2 …, 2014
221*2014
Planning with Markov decision processes: An AI perspective
Mausam, A Kolobov
Synthesis Lectures on Artificial Intelligence and Machine Learning 6 (1), 1-210, 2012
205*2012
Introduction to statistical relational learning
D Koller, N Friedman, S Džeroski, C Sutton, A McCallum, A Pfeffer, ...
MIT press, 2007
1802007
Interactive teaching strategies for agent training
O Amir, E Kamar, A Kolobov, B Grosz
IJCAI 2016, 2016
1492016
Heuristic search for generalized stochastic shortest path MDPs
A Kolobov, Mausam, DS Weld, H Geffner
Twenty-First International Conference on Automated Planning and Scheduling, 2011
113*2011
Safe reinforcement learning via curriculum induction
M Turchetta, A Kolobov, S Shah, A Krause, A Agarwal
Advances in Neural Information Processing Systems 33, 12151-12162, 2020
1092020
A Theory of Goal-Oriented MDPs with Dead Ends
A Kolobov, Mausam, DS Weld
UAI, 2012
103*2012
LRTDP vs. UCT for Online Probabilistic Planning
A Kolobov, Mausam, DS Weld
Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
75*2012
Approximate inference for infinite contingent Bayesian networks
B Milch, B Marthi, D Sontag, S Russell, DL Ong, A Kolobov
AISTATS, 2005
732005
Heuristic-guided reinforcement learning
CA Cheng, A Kolobov, A Swaminathan
Advances in Neural Information Processing Systems 34, 13550-13563, 2021
642021
Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors
A Kolobov, P Dai, Mausam, DS Weld
Proceedings of the 22nd International Conference on Automated Planning and …, 2012
53*2012
Metareasoning for planning under uncertainty
CH Lin, A Kolobov, E Kamar, E Horvitz
arXiv preprint arXiv:1505.00399, 2015
462015
ReTrASE: Integrating paradigms for approximate probabilistic planning
A Kolobov, Mausam, DS Weld
Twenty-First International Joint Conference on Artificial Intelligence, 1746 …, 2009
432009
SixthSense: Fast and reliable recognition of dead ends in MDPs
A Kolobov, Mausam, DS Weld
Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
37*2010
TODTLER: Two-order-deep transfer learning
J Van Haaren, A Kolobov, J Davis
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
362015
Classical Planning in MDP Heuristics: With a Little Help from Generalization
A Kolobov, Mausam, DS Weld
Twentieth International Conference on Automated Planning and Scheduling, 97-104, 2010
35*2010
Policy improvement via imitation of multiple oracles
CA Cheng, A Kolobov, A Agarwal
Advances in Neural Information Processing Systems 33, 5587-5598, 2020
342020
Cross-trajectory representation learning for zero-shot generalization in RL
B Mazoure, AM Ahmed, P MacAlpine, RD Hjelm, A Kolobov
arXiv preprint arXiv:2106.02193, 2021
292021
系统目前无法执行此操作,请稍后再试。
文章 1–20