Jakub Grudzien Kuba - Google 学术搜索

创建我的个人资料

引用次数

	总计	2019 年至今
引用	778	778
h 指数	8	8
i10 指数	8	8

0

460

230

115

345

20212022202320248 72 252 442

开放获取的出版物数量

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Yaodong YangBOYA (博雅) Assistant Professor at Peking University在 pku.edu.cn 的电子邮件经过验证
Jun WangProfessor, Computer Science, University College London在 cs.ucl.ac.uk 的电子邮件经过验证
Jakob FoersterAssociate Professor, University of Oxford在 eng.ox.ac.uk 的电子邮件经过验证
Linghui MengInstitute of Automation, Chinese Academy of Sciences, China在 ia.ac.cn 的电子邮件经过验证
Michał GrudzieńUndergraduate student at The university of Oxford在 worc.ox.ac.uk 的电子邮件经过验证

Jakub Grudzien Kuba

Jakub Grudzien Kuba

在 berkeley.edu 的电子邮件经过验证 - 首页

Reinforcement Learning Multi-Agent Reinforcement Learning Meta Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Trust region policy optimisation in multi-agent reinforcement learning JG Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang International Conference on Learning Representations 2022, 2021	226	2021
Multi-agent reinforcement learning is a sequence modeling problem M Wen, J Kuba, R Lin, W Zhang, Y Wen, J Wang, Y Yang Advances in Neural Information Processing Systems 35, 16509-16521, 2022	167	2022
Safe multi-agent reinforcement learning for multi-robot control S Gu, JG Kuba, Y Chen, Y Du, L Yang, A Knoll, Y Yang Artificial Intelligence 319, 103905, 2023	95*	2023
Idql: Implicit q-learning as an actor-critic method with diffusion policies P Hansen-Estruch, I Kostrikov, M Janner, JG Kuba, S Levine arXiv preprint arXiv:2304.10573, 2023	91	2023
Discovered policy optimisation C Lu, J Kuba, A Letcher, L Metz, C Schroeder de Witt, J Foerster Advances in Neural Information Processing Systems 35, 16455-16468, 2022	66	2022
Settling the variance of multi-agent policy gradients JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang Advances in Neural Information Processing Systems 34, 13458-13470, 2021	58	2021
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl JG Kuba, X Feng, S Ding, H Dong, J Wang, Y Yang arXiv preprint arXiv:2208.01682, 2022	41*	2022
Mirror learning: A unifying framework of policy optimisation J Grudzien, CAS De Witt, J Foerster International Conference on Machine Learning, 7825-7844, 2022	23*	2022
Understanding value decomposition algorithms in deep cooperative multi-agent reinforcement learning Z Dou, JG Kuba, Y Yang arXiv preprint arXiv:2202.04868, 2022	8	2022
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization K Grudzien, M Uehara, S Levine, P Abbeel International Conference on Artificial Intelligence and Statistics, 2449-2457, 2024	3	2024
Cliqueformer: Model-Based Optimization with Structured Transformers JG Kuba, P Abbeel, S Levine arXiv preprint arXiv:2410.13106, 2024		2024
Advantage-Conditioned Diffusion: Offline RL via Generalization JG Kuba, P Abbeel, S Levine

系统目前无法执行此操作，请稍后再试。

文章 1–12