Nathan Lambert

引用次数

	总计	2019 年至今
引用	2037	2030
h 指数	22	22
i10 指数	30	30

1200

600

300

900

20192020202120222023202416 44 121 200 532 1109

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)在 tu-dresden.de 的电子邮件经过验证
Kristofer PISTERUC Berkeley在 berkeley.edu 的电子邮件经过验证
Tom ZickHarvard在 berkeley.edu 的电子邮件经过验证
Daniel S. DrewUniversity of Utah在 utah.edu 的电子邮件经过验证
Thomas Krendl GilbertNew York Academy of Sciences在 nyas.org 的电子邮件经过验证
Brandon AmosMeta在 fb.com 的电子邮件经过验证
Sarah DeanCornell在 cornell.edu 的电子邮件经过验证
Luis PinedaResearch Engineer, Facebook AI Research在 fb.com 的电子邮件经过验证
Craig B. SchindlerUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Lydia LeeSandia National Laboratories在 sandia.gov 的电子邮件经过验证

关注

Nathan Lambert

Research Scientist, Allen AI

在 allenai.org 的电子邮件经过验证 - 首页

Reinforcement Learning Machine Learning Robotics Responsible AI


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	292*	2022
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	245	2023
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	196	2023
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	171	2019
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	112*	2020
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	107	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	94	2020
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	88	2023
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	87*	2022
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	76	2018
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	48	2020
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	45	2021
Dolma: An open corpus of three trillion tokens for language model pretraining research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... arXiv preprint arXiv:2402.00159, 2024	38	2024
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	38	2022
Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... arXiv preprint arXiv:2402.00838, 2024	35	2024
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	34	2023
Rewardbench: Evaluating reward models for language modeling N Lambert, V Pyatkin, J Morrison, LJ Miranda, BY Lin, K Chandu, N Dziri, ... arXiv preprint arXiv:2403.13787, 2024	32	2024
The alignment handbook L Tunstall, E Beeching, N Lambert, N Rajani, S Huang, K Rasul, AM Rush, ...	27	2023
A survey on data selection for language models A Albalak, Y Elazar, SM Xie, S Longpre, N Lambert, X Wang, ... arXiv preprint arXiv:2402.16827, 2024	24	2024
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	24	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者