Nathan Lambert

Cytowane przez

	Wszystkie	Od 2019
Cytowania	1377	1371
h-indeks	16	16
i10-indeks	24	24

560

280

140

420

20182019202020212022202320245 16 43 120 201 549 437

Dostęp publiczny

Wyświetl wszystko

2 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)Zweryfikowany adres z tu-dresden.de
Kristofer PISTERUC BerkeleyZweryfikowany adres z berkeley.edu
Daniel S. DrewUniversity of UtahZweryfikowany adres z utah.edu
Tom ZickHarvardZweryfikowany adres z berkeley.edu
Thomas Krendl GilbertNew York Academy of SciencesZweryfikowany adres z nyas.org
Brandon AmosMetaZweryfikowany adres z fb.com
Sarah DeanCornellZweryfikowany adres z cornell.edu
Luis PinedaResearch Engineer, Facebook AI ResearchZweryfikowany adres z fb.com
Craig B. SchindlerUniversity of California, BerkeleyZweryfikowany adres z berkeley.edu
Lydia LeeSandia National LaboratoriesZweryfikowany adres z sandia.gov

Obserwuj

Nathan Lambert

Research Scientist, Allen AI

Zweryfikowany adres z allenai.org - Strona główna

Reinforcement Learning Machine Learning Robotics Responsible AI


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	209*	2022
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	162	2019
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	131	2023
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	105	2023
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	100	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	86	2020
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	73	2018
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	68*	2022
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	54*	2020
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	47	2021
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	44	2020
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	39	2023
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	37	2022
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	28	2023
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	24	2021
Investigating compounding prediction errors in learned dynamics models N Lambert, K Pister, R Calandra arXiv preprint arXiv:2203.09637, 2022	16	2022
Stackllama: An rl fine-tuned llama model for stack exchange question and answering E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ... URL https://huggingface.co/blog/stackllama, 2023	14	2023
[HuggingFace] H4 Stack Exchange Preference Dataset N Lambert, NR Lewis Tunstall, T Thrush https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences, 2023	13*	2023
[Blog] Stable Diffusion with 🧨 Diffusers S Patil, P Cuenca, N Lambert, P von Platen Hugging Face–The AI community building the future. https://huggingface.co …, 2022	13*	2022
Enhanced lithium niobate pyroelectric ionizer for chip-scale ion mobility-based gas sensing KB Vinayakumar, V Gund, N Lambert, S Lodha, A Lal IEEE SENSORS, 1-3, 2016	13	2016

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy