Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... nature 529 (7587), 484-489, 2016 | 18630 | 2016 |
Mastering the game of go without human knowledge D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ... nature 550 (7676), 354-359, 2017 | 10581 | 2017 |
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018 | 4258 | 2018 |
Mastering atari, go, chess and shogi by planning with a learned model J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ... Nature 588 (7839), 604-609, 2020 | 2192 | 2020 |
Mastering chess and shogi by self-play with a general reinforcement learning algorithm D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... arXiv preprint arXiv:1712.01815, 2017 | 2151 | 2017 |
Starcraft ii: A new challenge for reinforcement learning O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ... arXiv preprint arXiv:1708.04782, 2017 | 1025 | 2017 |
Competition-level code generation with alphacode Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ... Science 378 (6624), 1092-1097, 2022 | 646 | 2022 |
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016 | 588 | 2016 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 548 | 2023 |
Discovering faster matrix multiplication algorithms with reinforcement learning A Fawzi, M Balog, A Huang, T Hubert, B Romera-Paredes, M Barekatain, ... Nature 610 (7930), 47-53, 2022 | 446 | 2022 |
OpenSpiel: A framework for reinforcement learning in games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019 | 241 | 2019 |
Cyprien de Masson d’Autume, Igor Babuschkin, Xinyun Chen, Po-Sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ... Science 378 (6624), 1092-1097, 2022 | 202 | 2022 |
loannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Mastering the game of go with deep neural networks and tree search. nature …, 2016 | 181 | 2016 |
Bayesian optimization in alphago Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ... arXiv preprint arXiv:1812.06855, 2018 | 147 | 2018 |
& Hassabis, D.(2016). Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche Nature 529 (7587), 484-489, 0 | 114 | |
Online and offline reinforcement learning by planning with a learned model J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ... Advances in Neural Information Processing Systems 34, 27580-27591, 2021 | 102 | 2021 |
Adrian Bolton και others D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ... Mastering the game of go without human knowledge. nature 550 (7676), 354-359, 2017 | 90 | 2017 |
Faster sorting algorithms discovered using deep reinforcement learning DJ Mankowitz, A Michi, A Zhernov, M Gelmi, M Selvi, C Paduraru, ... Nature 618 (7964), 257-263, 2023 | 86 | 2023 |
Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv 2017 D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... arXiv preprint arXiv:1712.01815, 2017 | 86 | 2017 |
Learning and planning in complex action spaces T Hubert, J Schrittwieser, I Antonoglou, M Barekatain, S Schmitt, D Silver International Conference on Machine Learning, 4476-4486, 2021 | 62 | 2021 |