Joel Z Leibo

引用次数

	总计	2019 年至今
引用	13452	11524
h 指数	41	36
i10 指数	66	56

2700

1350

675

2025

20132014201520162017201820192020202120222023202462 84 92 155 435 890 1300 1743 2130 2262 2696 1373

开放获取的出版物数量

查看全部

10 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCL在 ucl.ac.uk 的电子邮件经过验证
TOMASO POGGIOMcDermott Professor in Brain Sciences, MIT在 ai.mit.edu 的电子邮件经过验证
Edward HughesStaff Research Engineer, DeepMind在 google.com 的电子邮件经过验证
Marc LanctotResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Edgar A. Duéñez-GuzmánGoogle DeepMind在 oeb.harvard.edu 的电子邮件经过验证
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of Liverpool在 hcompany.ai 的电子邮件经过验证
Wojciech Marian Czarnecki.在 google.com 的电子邮件经过验证
Matthew BotvinickGoogle DeepMind, Yale Law School, University College London在 google.com 的电子邮件经过验证
Charlie BeattieSoftware Engineer, DeepMind在 google.com 的电子邮件经过验证
Peter SunehagGoogle - DeepMind在 google.com 的电子邮件经过验证
Tom SchaulSenior Staff Scientist, DeepMind在 nyu.edu 的电子邮件经过验证
Kevin R. McKeeStaff Research Scientist, Google DeepMind在 deepmind.com 的电子邮件经过验证
Raphael KösterGoogle DeepMind在 google.com 的电子邮件经过验证
Audrūnas Gruslys在 gruslys.com 的电子邮件经过验证
Jane X. WangStaff Research Scientist, DeepMind在 google.com 的电子邮件经过验证
Max JaderbergChief AI Scientist, Isomorphic Labs在 robots.ox.ac.uk 的电子邮件经过验证
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliate在 units.it 的电子邮件经过验证
Vinicius ZambaldiGoogle Deepmind在 google.com 的电子邮件经过验证
Dharshan KumaranGoogle DeepMind在 fil.ion.ucl.ac.uk 的电子邮件经过验证
Zeb Kurth-NelsonDeepMind, UCL在 google.com 的电子邮件经过验证

关注

Joel Z Leibo

Research scientist

在 google.com 的电子邮件经过验证 - 首页

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1703	2017
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1389*	2018
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1388	2016
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1065	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	945	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	883	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	634	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	601	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	525	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	296	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	279	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	268	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	246	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	217	2017
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	197	2020
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	196	2018
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	184	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	177	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	144	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	143	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者