关注
Noah Y. Siegel
Noah Y. Siegel
DeepMind
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning
NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ...
arXiv preprint arXiv:2002.08396, 2020
2742020
Critic regularized regression
Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ...
Advances in Neural Information Processing Systems 33, 7768-7778, 2020
2732020
Figureseer: Parsing result-figures in research papers
N Siegel, Z Horvitz, R Levin, S Divvala, A Farhadi
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
1812016
Extracting scientific figures with distantly supervised neural networks
N Siegel, N Lourie, R Power, W Ammar
Proceedings of the 18th ACM/IEEE on joint conference on digital libraries …, 2018
1382018
From motor control to team play in simulated humanoid football
S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ...
Science Robotics 7 (69), eabo0235, 2022
1012022
Solving math word problems with process-and outcome-based feedback
J Uesato, N Kushman, R Kumar, F Song, N Siegel, L Wang, A Creswell, ...
arXiv preprint arXiv:2211.14275, 2022
722022
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models
A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ...
Conference on Robot Learning, 566-589, 2020
422020
Data-efficient hindsight off-policy option learning
M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ...
International Conference on Machine Learning, 11340-11350, 2021
402021
Learning agile soccer skills for a bipedal robot with deep reinforcement learning
T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, M Wulfmeier, ...
arXiv preprint arXiv:2304.13653, 2023
362023
Compositional transfer in hierarchical reinforcement learning
M Wulfmeier, A Abdolmaleki, R Hafner, JT Springenberg, M Neunert, ...
arXiv preprint arXiv:1906.11228, 2019
332019
Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors
S Bohez, S Tunyasuvunakool, P Brakel, F Sadeghi, L Hasenclever, ...
arXiv preprint arXiv:2203.17138, 2022
312022
Regularized hierarchical policies for compositional transfer in robotics
M Wulfmeier, A Abdolmaleki, R Hafner, JT Springenberg, M Neunert, ...
arXiv preprint arXiv:1906.11228, 2019
272019
Towards real robot learning in the wild: A case study in bipedal locomotion
M Bloesch, J Humplik, V Patraucean, R Hafner, T Haarnoja, A Byravan, ...
Conference on Robot Learning, 1502-1511, 2022
212022
Simple sensor intentions for exploration
T Hertweck, M Riedmiller, M Bloesch, JT Springenberg, N Siegel, ...
arXiv preprint arXiv:2005.07541, 2020
52020
Solving math word problems with process-based and outcome-based feedback
J Uesato, N Kushman, R Kumar, HF Song, NY Siegel, L Wang, A Creswell, ...
32022
Understanding charts in research papers: A learning approach
N Siegel
Technical report, 2015
22015
Learning agile soccer skills for a bipedal robot with deep reinforcement learning
T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ...
Science Robotics 9 (89), eadi8022, 2024
2024
The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models
NY Siegel, OM Camburu, N Heess, M Perez-Ortiz
arXiv preprint arXiv:2404.03189, 2024
2024
Challenging systematic prejudices. An investigation into bias against women and girls in large language models.
D van Niekerk, M Pérez-Ortiz, J Shawe-Taylor, D Orlič, I Drobnjak, J Kay, ...
Education Journal Review 30 (1), 2024
2024
Challenging Systematic Prejudices: An Investigation into Bias Against Women and Girls
D Van Niekerk, M Peréz-Ortiz, J Shawe-Taylor, D Orlic, J Kay, N Siegel, ...
UNESCO, IRCAI, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20