Relational inductive biases, deep learning, and graph networks PW Battaglia, JB Hamrick, V Bapst, A Sanchez-Gonzalez, V Zambaldi, ... arXiv preprint arXiv:1806.01261, 2018 | 992 | 2018 |
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017 | 327 | 2017 |
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... Advances in neural information processing systems, 4190-4203, 2017 | 256 | 2017 |
Deep reinforcement learning with relational inductive biases V Zambaldi, D Raposo, A Santoro, V Bapst, Y Li, I Babuschkin, K Tuyls, ... International Conference on Learning Representations, 2018 | 181* | 2018 |
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward. P Sunehag, G Lever, A Gruslys, WM Czarnecki, VF Zambaldi, ... AAMAS, 2085-2087, 2018 | 137 | 2018 |
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017 | 108 | 2017 |
Dawn of the selfie era: The whos, wheres, and hows of selfies on Instagram F Souza, D de Las Casas, V Flores, SB Youn, M Cha, D Quercia, ... Proceedings of the 2015 ACM on conference on online social networks, 221-231, 2015 | 91 | 2015 |
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in Neural Information Processing Systems, 3643-3652, 2017 | 89 | 2017 |
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in neural information processing systems, 3422-3435, 2018 | 75 | 2018 |
Relational forward models for multi-agent learning A Tacchetti, HF Song, PAM Mediano, V Zambaldi, NC Rabinowitz, ... arXiv preprint arXiv:1809.11044, 2018 | 32 | 2018 |
OpenSpiel: A framework for reinforcement learning in games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019 | 31 | 2019 |
CompILE: Compositional imitation learning and execution T Kipf, Y Li, H Dai, V Zambaldi, A Sanchez-Gonzalez, E Grefenstette, ... International Conference on Machine Learning, 3418-3428, 2019 | 15 | 2019 |
Lightweight contextual ranking of city pictures: urban sociology to the rescue V Zambaldi, J Pesce, D Quercia, V Almeida Proceedings of the International AAAI Conference on Web and Social Media 8 (1), 2014 | 13 | 2014 |
Compositional imitation learning: Explaining and executing one task at a time T Kipf, Y Li, H Dai, V Zambaldi, E Grefenstette, P Kohli, P Battaglia arXiv preprint arXiv:1812.01483, 2018 | 10 | 2018 |
Memo: A deep network for flexible combination of episodic memories A Banino, AP Badia, R Köster, MJ Chadwick, V Zambaldi, D Hassabis, ... arXiv preprint arXiv:2001.10913, 2020 | 7 | 2020 |
The advantage regret-matching actor-critic A Gruslys, M Lanctot, R Munos, F Timbers, M Schmid, J Perolat, D Morrill, ... arXiv preprint arXiv:2008.12234, 2020 | 1 | 2020 |
The Spatial Memory Pipeline: a model of egocentric to allocentric understanding in mammalian brains B Uria, B Ibarz, A Banino, V Zambaldi, D Kumaran, D Hassabis, C Barry, ... bioRxiv, 2020 | | 2020 |
Deep Learning Monitor CT Page, M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, ... Nature Communications 11 (1), 1760, 2020 | | 2020 |
Reinforcement learning using a relational network for generating data encoding relationships between entities in an environment Y Li, VC Bapst, V Zambaldi, DN Raposo, AA Santoro US Patent App. 16/417,580, 2019 | | 2019 |
CompILE: Compositional Imitation Learning and Execution Download PDF T Kipf, Y Li, H Dai, V Zambaldi, A Sanchez-Gonzalez, E Grefenstette, ... | | |