Rémi Munos

引用次数

	总计	2019 年至今
引用	39209	30501
h 指数	87	77
i10 指数	191	156

8000

4000

2000

6000

200720082009201020112012201320142015201620172018201920202021202220232024168 242 225 349 465 537 593 763 788 897 1121 2008 3020 4107 5757 7005 7886 2706

开放获取的出版物数量

查看全部

20 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind在 meta.com 的电子邮件经过验证
Mohammad Gheshlaghi AzarCohere在 google.com 的电子邮件经过验证
Marc G. BellemareGoogle Brain在 google.com 的电子邮件经过验证
Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence Research在 inria.fr 的电子邮件经过验证
koray kavukcuogluDeepMind在 kavukcuoglu.org 的电子邮件经过验证
Odalric-Ambrym MaillardInria Lille - Nord Europe在 inria.fr 的电子邮件经过验证
Sebastien BubeckVP GenAI Research, Microsoft AI在 microsoft.com 的电子邮件经过验证
Andrew MooreDean, School of Computer Science, Carnegie Mellon在 cs.cmu.edu 的电子邮件经过验证
Anna HarutyunyanDeepMind在 google.com 的电子邮件经过验证
Marc LanctotResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Tom SchaulSenior Staff Scientist, DeepMind在 nyu.edu 的电子邮件经过验证
András AntosBudapest University of Technology and Economics在 cs.bme.hu 的电子邮件经过验证
Volodymyr MnihDeepMind在 cs.toronto.edu 的电子邮件经过验证
Hilbert Johan KappenRadboud University在 science.ru.nl 的电子邮件经过验证
David SilverDeepMind, UCL在 google.com 的电子邮件经过验证
Lucian BusoniuProfessor and Group Lead, Automation Department, Technical University of Cluj-Napoca在 aut.utcluj.ro 的电子邮件经过验证
Andre BarretoResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Olivier Teytaudfacebook在 fb.com 的电子邮件经过验证
Sylvain GellyGoogle Brain Zurich在 m4x.org 的电子邮件经过验证

关注

Rémi Munos

DeepMind

在 inria.fr 的电子邮件经过验证 - 首页

Reinforcement learning deep learning bandit theory statistical learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Bootstrap your own latent-a new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Advances in neural information processing systems 33, 21271-21284, 2020	5867	2020
A distributional perspective on reinforcement learning MG Bellemare, W Dabney, R Munos International conference on machine learning, 449-458, 2017	1652	2017
Unifying count-based exploration and intrinsic motivation M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos Advances in neural information processing systems 29, 2016	1614	2016
Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures L Espeholt, H Soyer, R Munos, K Simonyan, V Mnih, T Ward, Y Doron, ... International conference on machine learning, 1407-1416, 2018	1524	2018
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	997	2016
Sample efficient actor-critic with experience replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016	958	2016
Best arm identification in multi-armed bandits JY Audibert, S Bubeck COLT-23th Conference on learning theory-2010, 13 p., 2010	903	2010
Minimax regret bounds for reinforcement learning MG Azar, I Osband, R Munos International conference on machine learning, 263-272, 2017	779	2017
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits JY Audibert, R Munos, C Szepesvári Theoretical Computer Science 410 (19), 1876-1902, 2009	761	2009
Thompson sampling: An asymptotically optimal finite-time analysis E Kaufmann, N Korda, R Munos International conference on algorithmic learning theory, 199-213, 2012	758	2012
Distributional reinforcement learning with quantile regression W Dabney, M Rowland, M Bellemare, R Munos Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	746	2018
Count-based exploration with neural density models G Ostrovski, MG Bellemare, A Oord, R Munos International conference on machine learning, 2721-2730, 2017	690	2017
Safe and efficient off-policy reinforcement learning R Munos, T Stepleton, A Harutyunyan, M Bellemare Advances in neural information processing systems 29, 2016	683	2016
Finite-Time Bounds for Fitted Value Iteration. R Munos, C Szepesvári Journal of Machine Learning Research 9 (5), 2008	612	2008
Automated curriculum learning for neural networks A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu international conference on machine learning, 1311-1320, 2017	587	2017
Pure exploration in multi-armed bandits problems S Bubeck, R Munos, G Stoltz Algorithmic Learning Theory: 20th International Conference, ALT 2009, Porto …, 2009	585	2009
Successor features for transfer in reinforcement learning A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ... Advances in neural information processing systems 30, 2017	582	2017
Implicit quantile networks for distributional reinforcement learning W Dabney, G Ostrovski, D Silver, R Munos International conference on machine learning, 1096-1105, 2018	547	2018
Modiﬁcation of UCT with patterns in Monte-Carlo Go S Gelly, Y Wang, R Munos, O Teytaud INRIA, 2006	539	2006
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018	511	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者