Jordi Grau-Moya
Jordi Grau-Moya
Research Scientist at Google DeepMind
Geverifieerd e-mailadres voor deepmind.com - Homepage
Geciteerd door
Geciteerd door
Bounded rationality, abstraction and hierarchical decision-making: an information-theoretic optimality principle
T Genewein, F Leibfried, J Grau-Moya, DAB Braun
Frontiers in Robotics and AI 2, 27, 2015
Balancing Two-Player Stochastic Games with Soft Q-Learning
J Grau-Moya, F Leibfried, H Bou-Ammar
Proceedings of the 27th International Joint Conference on Artificial …, 2018
Soft Q-Learning with Mutual-Information Regularization
J Grau-Moya, F Leibfried, P Vrancx
International Conference on Learning Representations (ICLR), 2019
Signaling equilibria in sensorimotor interactions.
F Leibfried, J Grau-Moya, DA Braun
Cognition 141, 73-86, 2015
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
J Grau-Moya, F Leibfried, T Genewein, DA Braun
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2016
An information-theoretic optimality principle for deep reinforcement learning
F Leibfried, J Grau-Moya, H Bou-Ammar
NeurIPS Workshop on Deep Reinforcement Learning, 2017
A unified bellman optimality principle combining reward maximization and empowerment
F Leibfried, S Pascual-Diaz, J Grau-Moya
Advances in Neural Information Processing Systems, 7869-7880, 2019
The effect of model uncertainty on cooperation in sensorimotor interactions
J Grau-Moya, E Hez, G Pezzulo, DA Braun
Journal of The Royal Society Interface 10 (87), 20130554, 2013
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
F Leibfried, J Grau-Moya
Conference on Robot Learning (CoRL), 2019
Disentangled Skill Embeddings for Reinforcement Learning
JC Petangoda, S Pascual-Diaz, V Adam, P Vrancx, J Grau-Moya
NeurIPS Workshop on Learning Transferable Skills, 2019
Risk-Sensitivity in Bayesian Sensorimotor Integration
J Grau-Moya, PA Ortega, DA Braun
PLOS Computational Biology 8 (9), e1002698, 2012
Shaking the foundations: delusions in sequence models for interaction and control
PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ...
arXiv preprint arXiv:2110.10819, 2021
Non-equilibrium relations for bounded rational decision-making in changing environments
J Grau-Moya, M Krüger, DA Braun
Entropy 20 (1), 1, 2017
Causal Analysis of Agent Behavior for AI Safety
G Déletang, J Grau-Moya, M Martic, T Genewein, T McGrath, V Mikulik, ...
arXiv preprint arXiv:2103.03938, 2021
Model-Free Risk-Sensitive Reinforcement Learning
G Delétang, J Grau-Moya, M Kunesch, T Genewein, R Brekelmans, ...
arXiv preprint arXiv:2111.02907, 2021
Decision-making under ambiguity is modulated by visual framing, but not by motor vs. non-motor context. experiments and an information-theoretic ambiguity model
J Grau-Moya, PA Ortega, DA Braun
PloS one 11 (4), e0153179, 2016
Noise-induced up/down dynamics in scale-free neuronal networks
J Grau-Moya, AJ Pons, J Garcia-Ojalvo
International Journal of Bifurcation and Chaos 22 (07), 1250175, 2012
Neural Networks and the Chomsky Hierarchy
G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ...
arXiv preprint arXiv:2207.02098, 2022
Bounded Rational Decision-Making in Changing Environments
J Grau-Moya, DA Braun
NIPS 2013 Workshop on Planning with Information Constraints, 2013
Your Policy Regularizer is Secretly an Adversary
R Brekelmans, T Genewein, J Grau-Moya, G Delétang, M Kunesch, ...
arXiv preprint arXiv:2203.12592, 2022
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20