John Aslanides

Cited by

	All	Since 2019
Citations	3226	3173
h-index	15	14
i10-index	15	15

1200

600

300

900

2017201820192020202120222023202413 16 87 180 264 534 1188 914

Co-authors

Nat McAleeseOpenAIVerified email at openai.com
Ian OsbandOpenAIVerified email at openai.com
Geoffrey IrvingUK AI Safety Institute (AISI)Verified email at naml.us
H Francis SongDeepMindVerified email at google.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Ethan PerezAnthropic; New York UniversityVerified email at anthropic.com
Tor LattimoreDeepMindVerified email at google.com
David SilverDeepMind, UCLVerified email at google.com
Yotam DoronDeepMindVerified email at google.com
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Eren SezenerDeepMindVerified email at google.com
Craig M SavageProfessor of Physics, Australian National UniversityVerified email at anu.edu.au
Jan LeikeOpenAIVerified email at openai.com
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Silvia ChiappaSenior Staff Research Scientist, Google DeepMind; Honorary Professor, UCLVerified email at google.com
Nando de FreitasCIFAR & DeepMindVerified email at google.com
Bobak ShahriariDeepMindVerified email at google.com

John Aslanides

DeepMind

Verified email at google.com - Homepage

Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Scaling Language Models: Methods, Analysis & Insights from Training Gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	746	2021
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	427	2023
Randomized Prior Functions for Deep Reinforcement Learning I Osband, J Aslanides, A Cassirer Neural Information Processing Systems 32, 2018	395	2018
Red Teaming Language Models with Language Models E Perez, S Huang, F Song, T Cai, R Ring, J Aslanides, A Glaese, ... arXiv preprint arXiv:2202.03286, 2022	298	2022
Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022	286	2022
Acme: A Research Framework for Distributed Reinforcement Learning M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ... arXiv preprint arXiv:2006.00979, 2020	229	2020
When to use parametric models in reinforcement learning? H van Hasselt, M Hessel, J Aslanides Neural Information Processing Systems 33, 2019	196	2019
Behaviour Suite for Reinforcement Learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... International Conference on Learning Representations 8, 2020	174	2020
Teaching language models to support answers with verified quotes J Menick, M Trebacz, V Mikulik, J Aslanides, F Song, M Chadwick, ... arXiv preprint arXiv:2203.11147, 2022	137	2022
Fine-tuning language models to find agreement among humans with diverse preferences M Bakker, M Chadwick, H Sheahan, M Tessler, L Campbell-Gillingham, ... Advances in Neural Information Processing Systems 35, 38176-38189, 2022	96	2022
Relativity concept inventory: Development, analysis, and results JS Aslanides, CM Savage Physical Review Special Topics-Physics Education Research 9 (1), 010118, 2013	78	2013
A general approach to fairness with optimal transport S Chiappa, R Jiang, T Stepleton, A Pacchiano, H Jiang, J Aslanides AAAI, 2020	67*	2020
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning G Parascandolo, L Buesing, J Merel, L Hasenclever, J Aslanides, ... arXiv preprint arXiv:2004.11410, 2020	30	2020
Universal Reinforcement Learning Algorithms: Survey and Experiments J Aslanides, J Leike, M Hutter International Joint Conference on Artificial Intelligence 26, 1403-1410, 2017	25	2017
TF-Replicator: Distributed Machine Learning for Researchers P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ... arXiv preprint arXiv:1902.00465, 2019	24	2019
Fine-Tuning Language Models via Epistemic Neural Networks I Osband, SM Asghari, B Van Roy, N McAleese, J Aslanides, G Irving arXiv preprint arXiv:2211.01568, 2022	7	2022
AIXIjs: A software demo for general reinforcement learning J Aslanides arXiv preprint arXiv:1705.07615, 2017	6	2017
Generalised discount functions applied to a Monte-Carlo AImu implementation S Lamont, J Aslanides, J Leike, M Hutter Autonomous Agents and Multiagent Systems, 2017, 2017	5	2017

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors