关注
Nathan Lambert
Nathan Lambert
Research Scientist, Allen AI
在 allenai.org 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
[Github] Diffusers: State-of-the-art diffusion models
P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ...
https://github.com/huggingface/diffusers, 2022
207*2022
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning
N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister
IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019
1622019
Zephyr: Direct distillation of lm alignment
L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ...
arXiv preprint arXiv:2310.16944, 2023
1302023
Open LLM Leaderboard
E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ...
URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023
1052023
On the importance of hyperparameter optimization for model-based reinforcement learning
B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ...
International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021
1002021
Objective Mismatch in Model-based Reinforcement Learning
N Lambert, B Amos, O Yadan, R Calandra
Learning for Dynamics and Control (L4DC), 2020
862020
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts
D Drew, N Lambert, C Schindler, K Pister
IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018
732018
[Blog] Illustrating reinforcement learning from human feedback (RLHF)
N Lambert, L Castricato, L von Werra, A Havrilla
https://hf.co/blog/rlhf, 2022
68*2022
[Github] Trl: Transformer reinforcement learning
L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert
https://github.com/lvwerra/trl, 2020
55*2020
Mbrl-lib: A modular library for model-based reinforcement learning
L Pineda, B Amos, A Zhang, NO Lambert, R Calandra
arXiv preprint arXiv:2104.10159, 2021
472021
Learning generalizable locomotion skills with hierarchical reinforcement learning
T Li, N Lambert, R Calandra, F Meier, A Rai
IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020
432020
Camels in a changing climate: Enhancing lm adaptation with tulu 2
H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ...
arXiv preprint arXiv:2311.10702, 2023
372023
The challenges of exploration for offline reinforcement learning
N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ...
arXiv preprint arXiv:2201.11861, 2022
362022
Reward reports for reinforcement learning
TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023
282023
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
N Lambert, A Wilcox, H Zhang, K Pister, R Calandra
IEEE Conference on Decision and Control (CDC), 2880-2887, 2021
242021
Investigating compounding prediction errors in learned dynamics models
N Lambert, K Pister, R Calandra
arXiv preprint arXiv:2203.09637, 2022
162022
Stackllama: An rl fine-tuned llama model for stack exchange question and answering
E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ...
URL https://huggingface.co/blog/stackllama, 2023
142023
[HuggingFace] H4 Stack Exchange Preference Dataset
N Lambert, NR Lewis Tunstall, T Thrush
https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences, 2023
13*2023
[Blog] Stable Diffusion with 🧨 Diffusers
S Patil, P Cuenca, N Lambert, P von Platen
Hugging Face–The AI community building the future. https://huggingface.co …, 2022
13*2022
Enhanced lithium niobate pyroelectric ionizer for chip-scale ion mobility-based gas sensing
KB Vinayakumar, V Gund, N Lambert, S Lodha, A Lal
IEEE SENSORS, 1-3, 2016
132016
系统目前无法执行此操作,请稍后再试。
文章 1–20