Satinder Singh

引用次数

	总计	2019 年至今
引用	42304	21474
h 指数	78	59
i10 指数	210	146

4700

2350

1175

3525

1994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024169 146 207 238 330 312 321 398 517 551 680 806 906 1079 1005 1012 991 1016 994 1051 1059 1057 1350 1538 2475 3081 3809 4023 4452 4677 1424

开放获取的出版物数量

查看全部

41 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of Michigan在 umich.edu 的电子邮件经过验证
Richard S. SuttonKeen, Amii, and University of Alberta在 richsutton.com 的电子邮件经过验证
Michael KearnsProfessor of Computer Science, University of Pennsylvania在 cis.upenn.edu 的电子邮件经过验证
Doina PrecupDeepMind and McGill University在 cs.mcgill.ca 的电子邮件经过验证
Andrew BartoUniversity of Massachusetts Amherst在 cs.umass.edu 的电子邮件经过验证
Junhyuk OhResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Yishay MansourTel Aviv University在 tauex.tau.ac.il 的电子邮件经过验证
Michael LittmanBrown University在 brown.edu 的电子邮件经过验证
David McAllesterProfessor, Toyota Technological Institute at Chicago在 ttic.edu 的电子邮件经过验证
Honglak LeeLG AI Research / U. Michigan在 umich.edu 的电子邮件经过验证
Tommi JaakkolaMIT在 csail.mit.edu 的电子邮件经过验证
David SilverDeepMind, UCL在 google.com 的电子邮件经过验证
Michael WellmanProfessor of Computer Science & Engineering, University of Michigan在 umich.edu 的电子邮件经过验证
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCL在 google.com 的电子邮件经过验证
Yevgeniy VorobeychikWashington University in Saint Louis在 wustl.edu 的电子邮件经过验证
Edmund DurfeeProfessor Emeritus of Computer Science and Engineering, University of Michigan在 umich.edu 的电子邮件经过验证
Tom ZahavyStaff Research Scientist, Google DeepMind在 deepmind.com 的电子邮件经过验证
Nan JiangAssistant Professor of Computer Science, UIUC在 illinois.edu 的电子邮件经过验证
Peter StoneProfessor of Computer Science, The University of Texas at Austin在 cs.utexas.edu 的电子邮件经过验证
Xiaoxiao GuoLinkedIn在 fb.com 的电子邮件经过验证

关注

Satinder Singh

Google DeepMind / U. of Michigan

在 umich.edu 的电子邮件经过验证 - 首页

Reinforcement Learning Computational Game Theory Artificial Intelligence


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Policy Gradient Methods for Reinforcement Learning with Function Approximation R Sutton, D McAllester, S Singh, Y Mansour Neural Information Processing Systems, 1999	8130	1999
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4308	1999
Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995	1643	1995
Near-optimal reinforcement learning in polynomial time M Kearns, S Singh Machine learning 49, 209-232, 2002	1309	2002
Convergence of stochastic iterative dynamic programming algorithms T Jaakkola, M Jordan, S Singh Advances in neural information processing systems 6, 1993	1300	1993
Action-conditional video prediction using deep networks in atari games J Oh, X Guo, H Lee, RL Lewis, S Singh Advances in neural information processing systems 28, 2015	1029	2015
Reinforcement learning with replacing eligibility traces SP Singh, RS Sutton Machine learning 22 (1), 123-158, 1996	1024	1996
Intrinsically motivated reinforcement learning N Chentanez, A Barto, S Singh Advances in neural information processing systems 17, 2004	1014	2004
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000	988	2000
Eligibility traces for off-policy policy evaluation D Precup, R Sutton, S Singh Computer Science Department Faculty Publication Series, 80, 2000	920	2000
Graphical models for game theory M Kearns, ML Littman, S Singh arXiv preprint arXiv:1301.2281, 2013	811	2013
Predictive representations of state ML Littman, RS Sutton, S Singh Advances in neural information processing systems, 1555-1561, 2002	714	2002
Learning without state-estimation in partially observable Markovian decision processes SP Singh, T Jaakkola, MI Jordan Machine Learning Proceedings 1994, 284-292, 1994	612	1994
Intrinsically motivated learning of hierarchical collections of skills AG Barto, S Singh, N Chentanez Proceedings of the 3rd International Conference on Development and Learning …, 2004	560	2004
Intrinsically motivated reinforcement learning: An evolutionary perspective S Singh, RL Lewis, AG Barto, J Sorg IEEE Transactions on Autonomous Mental Development 2 (2), 70-82, 2010	555	2010
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	515	2021
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system S Singh, D Litman, M Kearns, M Walker Journal of Artificial Intelligence Research 16, 105-133, 2002	513	2002
Transfer of learning by composing solutions of elemental sequential tasks SP Singh Machine learning 8, 323-339, 1992	495	1992
Reinforcement Learning with Soft State Aggregation S Singh, T Jaakkola, M Jordan Neural Information Processing Systems, 1995	464	1995
Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning X Guo, S Singh, H Lee, RL Lewis, X Wang Advances in neural information processing systems 27, 2014	446	2014

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者