Robert Dadashi

引用次数

	总计	2019 年至今
引用	2270	2270
h 指数	15	15
i10 指数	16	16

1600

800

400

1200

20192020202120222023202426 55 151 228 302 1503

合著作者

Léonard HussenotGoogle DeepMind在 google.com 的电子邮件经过验证
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)在 univ-lorraine.fr 的电子邮件经过验证
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)在 univ-lille.fr 的电子邮件经过验证
Marc G. BellemareReliant AI, prev. Google Brain, DeepMind在 reliant.ai 的电子邮件经过验证
Dale SchuurmansUniversity of Alberta, Google DeepMind在 cs.ualberta.ca 的电子邮件经过验证
Nicolas Le RouxMicrosoft Research, McGill, UdeM在 le-roux.name 的电子邮件经过验证
Saurabh KumarStanford在 stanford.edu 的电子邮件经过验证

关注

Robert Dadashi

Google DeepMind

在 google.com 的电子邮件经过验证 - 首页

Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Gemini: A family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	254	2024
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
Primal Wasserstein Imitation Learning R Dadashi, L Hussenot, M Geist, O Pietquin International Conference on Learning Representations (ICLR), 2021	130	2021
A Geometric Perspective on Optimal Representations for Reinforcement Learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Neural Information Processing Systems (NeurIPS), 2019	104	2019
Statistics and Samples in Distributional Reinforcement Learning M Rowland, R Dadashi, S Kumar, R Munos, MG Bellemare, W Dabney International Conference on Machine Learning (ICML), 2019	95	2019
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning W Dabney, A Barreto, M Rowland, R Dadashi, J Quan, MG Bellemare, ... AAAI Conference on Artificial Intelligence, 2021	68	2021
What Matters for Adversarial Imitation Learning? M Orsini, A Raichuk, L Hussenot, D Vincent, R Dadashi, S Girgin, M Geist, ... Neural Information Processing Systems (NeurIPS), 2021	67	2021
Offline Reinforcement Learning as Anti-Exploration S Rezaeifar, R Dadashi, N Vieillard, L Hussenot, O Bachem, O Pietquin, ... AAAI Conference on Artificial Intelligence, 2022	51	2022
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ... Annual Meeting of the Association for Computational Linguistics (ACL), 2023	44	2023
The Value Function Polytope in Reinforcement Learning R Dadashi, AA Taïga, NL Roux, D Schuurmans, MG Bellemare International Conference on Machine Learning (ICML), 2019	41	2019
Offline Reinforcement Learning with Pseudometric Learning R Dadashi, S Rezaeifar, N Vieillard, L Hussenot, O Pietquin, M Geist International Conference on Machine Learning (ICML), 2021	38	2021
Continuous Control with Action Quantization from Demonstrations R Dadashi, L Hussenot, D Vincent, S Girgin, A Raichuk, M Geist, ... International Conference on Machine Learning (ICML), 2022	29	2022
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... arXiv preprint arXiv:2401.12187, 2024	26	2024
Hyperparameter Selection for Imitation Learning L Hussenot, M Andrychowicz, D Vincent, R Dadashi, A Raichuk, ... International Conference on Machine Learning (ICML), 2021	17	2021
Show me the Way: Intrinsic Motivation from Demonstrations L Hussenot, R Dadashi, M Geist, O Pietquin International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	10	2020
Learning Energy Networks with Generalized Fenchel-Young Losses M Blondel, F Llinares-López, R Dadashi, L Hussenot, M Geist Neural Information Processing Systems (NeurIPS), 2022	6	2022
Get Back Here: Robust Imitation by Return-to-Distribution Planning G Cideron, B Tabanpour, S Curi, S Girgin, L Hussenot, G Dulac-Arnold, ... arXiv preprint arXiv:2305.01400, 2023	4	2023
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024	2	2024
Generalized policy updates for policy optimization S Kumar, R Dadashi, Z Ahmed, D Schuurmans, MG Bellemare NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019	2	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者