‪Eric Hambro‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	8725	8716
h-index	9	9
i10-index	8	8

0

6000

3000

1500

4500

20222023202472 3533 5069

Co-authors

Roberta RaileanuResearch Scientist, MetaVerified email at fb.com
Heinrich KüttlerxAIVerified email at math.lmu.de
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Sharath Chandra RaparthyMeta AIVerified email at mila.quebec
Mikayel SamvelyanMeta AI, UCLVerified email at meta.com

Eric Hambro

Eric Hambro

Anthropic

Verified email at anthropic.com - Homepage

Machine Learning Reinforcement Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
LLaMA: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023	7576	2023
Toolformer: Language models can teach themselves to use tools T Schick, J Dwivedi-Yu, R Dessì, R Raileanu, M Lomeli, E Hambro, ... Advances in Neural Information Processing Systems 36, 2024	937	2024
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks, 2021	77	2021
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023	40	2023
GPflux: A library for deep Gaussian processes V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ... arXiv preprint arXiv:2104.05674, 2021	27	2021
Insights from the Neurips 2021 Nethack Challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	18	2022
Rainbow teaming: Open-ended generation of diverse adversarial prompts M Samvelyan, SC Raparthy, A Lupu, E Hambro, AH Markosyan, M Bhatt, ... arXiv preprint arXiv:2402.16822, 2024	13	2024
Dungeons and Data: A Large-Scale NetHack Dataset E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ... Advances in Neural Information Processing Systems 35, 24864-24878, 2022	12	2022
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	9	2024
moolib: A Platform for Distributed RL. 2022 V Mella, E Hambro, D Rothermel, H Küttler URL https://github. com/facebookresearch/moolib 8, 18, 2022	7*	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	5	2023
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	4	2024
Know When To Stop: A Study of Semantic Drift in Text Generation A Spataru, E Hambro, E Voita, N Cancedda arXiv preprint arXiv:2404.05411, 2024		2024
Learning to Solve New sequential decision-making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu NeurIPS 2023 Foundation Models for Decision Making Workshop, 0

The system can't perform the operation now. Try again later.

Articles 1–14