Arthur Guez
Arthur Guez
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
111672016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
60462017
Deep reinforcement learning with double q-learning
H Van Hasselt, A Guez, D Silver
Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016
38622016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
17032018
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
11252017
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
4712020
Imagination-augmented agents for deep reinforcement learning
S Racanière, T Weber, DP Reichert, L Buesing, A Guez, D Rezende, ...
Proceedings of the 31st International Conference on Neural Information …, 2017
2532017
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
2142017
Imagination-augmented agents for deep reinforcement learning
T Weber, S Racanière, DP Reichert, L Buesing, A Guez, DJ Rezende, ...
arXiv preprint arXiv:1707.06203, 2017
1992017
Efficient Bayes-adaptive reinforcement learning using sample-based search
A Guez, D Silver, P Dayan
arXiv preprint arXiv:1205.3109, 2012
1312012
Learning values across many orders of magnitude
HP van Hasselt, A Guez, M Hessel, V Mnih, D Silver
Advances In Neural Information Processing Systems, 4287-4295, 2016
1252016
Increasing the action gap: New operators for reinforcement learning
MG Bellemare, G Ostrovski, A Guez, P Thomas, R Munos
Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016
1092016
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning.
A Guez, RD Vincent, M Avoli, J Pineau
AAAI, 1671-1678, 2008
962008
Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
A Guez, D Silver, P Dayan
Journal of Artificial Intelligence Research 48, 841-883, 2013
702013
Woulda, coulda, shoulda: Counterfactually-guided policy search
L Buesing, T Weber, Y Zwols, S Racaniere, A Guez, JB Lespiau, N Heess
arXiv preprint arXiv:1811.06272, 2018
662018
Learning to search with mctsnets
A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ...
International conference on machine learning, 1822-1831, 2018
632018
Treating epilepsy via adaptive neurostimulation: a reinforcement learning approach
J Pineau, A Guez, R Vincent, G Panuccio, M Avoli
International journal of neural systems 19 (04), 227-240, 2009
612009
An investigation of model-free planning
A Guez, M Mirza, K Gregor, R Kabra, S Racanière, T Weber, D Raposo, ...
International Conference on Machine Learning, 2464-2473, 2019
462019
Adaptive control of epileptiform excitability in an in vitro model of limbic seizures
G Panuccio, A Guez, R Vincent, M Avoli, J Pineau
Experimental neurology 241, 179-183, 2013
282013
Bayes-adaptive simulation-based search with value function approximation
A Guez, N Heess, D Silver, P Dayan
Advances in Neural Information Processing Systems, 451-459, 2014
212014
The system can't perform the operation now. Try again later.
Articles 1–20