Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford
Verified email at cs.ox.ac.uk - Homepage
TitleCited byYear
Learning to communicate with deep multi-agent reinforcement learning
J Foerster, IA Assael, N de Freitas, S Whiteson
Advances in Neural Information Processing Systems, 2137-2145, 2016
2932016
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
2822006
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
2182014
Counterfactual multi-agent policy gradients
JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2002018
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
1522016
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1372017
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
1232008
Automatic feature selection in neuroevolution
S Whiteson, P Stone, KO Stanley, R Miikkulainen, N Kohl
Proceedings of the 7th annual conference on Genetic and evolutionary …, 2005
1122005
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1112007
Evolving soccer keepaway players through task decomposition
S Whiteson, N Kohl, R Miikkulainen, P Stone
Machine Learning 59 (1-2), 5-30, 2005
1092005
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
ME Taylor, S Whiteson, P Stone
Proceedings of the 8th annual conference on Genetic and evolutionary …, 2006
1082006
Learning with opponent-learning awareness
J Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
Proceedings of the 17th International Conference on Autonomous Agents and …, 2018
1072018
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, S Whiteson, N Vlassis
Proceedings of the 7th international joint conference on Autonomous agents …, 2008
1012008
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
992013
A probabilistic method for inferring preferences from clicks
K Hofmann, S Whiteson, M De Rijke
Proceedings of the 20th ACM international conference on Information and …, 2011
972011
Adaptive tile coding for value function approximation
S Whiteson, ME Taylor, P Stone
Computer Science Department, University of Texas at Austin, 2007
932007
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2009
852009
Reusing historical interaction data for faster online learning to rank for IR
K Hofmann, A Schuth, S Whiteson, M de Rijke
Proceedings of the sixth ACM international conference on Web search and data …, 2013
682013
Measurement of the top-quark mass with dilepton events selected using neuroevolution at CDF
T Aaltonen, J Adelman, T Akimoto, MG Albrow, BÁ González, S Amerio, ...
Physical review letters 102 (15), 152001, 2009
672009
Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem
M Zoghi, S Whiteson, R Munos, M de Rijke
ICML 2014: Proceedings of the Thirty-First International Conference on …, 2014
662014
The system can't perform the operation now. Try again later.
Articles 1–20