Sergio Gómez Colmenarejo
Sergio Gómez Colmenarejo
Research Engineer, DeepMind
Verified email at
Cited by
Cited by
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
Learning to learn by gradient descent by gradient descent
M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ...
Advances in neural information processing systems, 3981-3989, 2016
Policy distillation
AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ...
arXiv preprint arXiv:1511.06295, 2015
Learning to learn without gradient descent by gradient descent
Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ...
International Conference on Machine Learning, 748-756, 2017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, D Belov, ...
arXiv preprint arXiv:1703.03664, 2017
Learned optimizers that scale and generalize
O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ...
arXiv preprint arXiv:1703.04813, 2017
Programmable agents
M Denil, SG Colmenarejo, S Cabi, D Saxton, N de Freitas
arXiv preprint arXiv:1706.06383, 2017
Learning awareness models
B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ...
arXiv preprint arXiv:1804.06318, 2018
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously
S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N De Freitas
arXiv preprint arXiv:1707.03300, 2017
Tf-replicator: Distributed machine learning for researchers
P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ...
arXiv preprint arXiv:1902.00465, 2019
One-shot high-fidelity imitation: Training large-scale deep nets with rl
TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ...
arXiv preprint arXiv:1810.05017, 2018
Task-relevant adversarial imitation learning
K Zolna, S Reed, A Novikov, SG Colmenarej, D Budden, S Cabi, M Denil, ...
arXiv preprint arXiv:1910.01077, 2019
A Framework for Data-Driven Robotics
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
arXiv preprint arXiv:1909.12200, 2019
Acme: A Research Framework for Distributed Reinforcement Learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, S Gómez Colmenarejo, A Novikov, K Konyushkova, S Reed, ...
arXiv, arXiv: 1909.12200, 2019
Rl unplugged: Benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, TL Paine, SG Colmenarejo, K Zolna, ...
arXiv preprint arXiv:2006.13888, 2020
Approximate hubel-wiesel modules and the data structures of neural computation
JZ Leibo, J Cornebise, S Gómez, D Hassabis
arXiv preprint arXiv:1512.08457, 2015
The system can't perform the operation now. Try again later.
Articles 1–17