Volgen
Laurent Orseau
Laurent Orseau
Research Scientist at Google DeepMind
Geverifieerd e-mailadres voor google.com
Titel
Geciteerd door
Geciteerd door
Jaar
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
3202017
Safely Interruptible Agents
L Orseau, S Armstrong
Uncertainty in Artificial Intelligence, 557–566, 2016
1432016
Reinforcement learning with a corrupted reward channel
T Everitt, V Krakovna, L Orseau, M Hutter, S Legg
arXiv preprint arXiv:1705.08417, 2017
1142017
Delusion, survival, and intelligent agents
M Ring, L Orseau
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
902011
An investigation of model-free planning
A Guez, M Mirza, K Gregor, R Kabra, S Racanière, T Weber, D Raposo, ...
International conference on machine learning, 2464-2473, 2019
882019
Logarithmic pruning is all you need
L Orseau, M Hutter, O Rivasplata
Advances in Neural Information Processing Systems 33, 2925-2934, 2020
872020
Goal misgeneralization in deep reinforcement learning
LL Di Langosco, J Koch, LD Sharkey, J Pfau, D Krueger
International Conference on Machine Learning, 12004-12019, 2022
672022
Penalizing side effects using stepwise relative reachability
V Krakovna, L Orseau, R Kumar, M Martic, S Legg
arXiv preprint arXiv:1806.01186, 2018
532018
Space-Time Embedded Intelligence
L Orseau, M Ring
Artificial General Intelligence, 209-218, 2012
502012
Universal knowledge-seeking agents for stochastic environments
L Orseau, T Lattimore, M Hutter
Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013
482013
Self-modification and mortality in artificial agents
L Orseau, M Ring
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
442011
Thompson sampling is asymptotically optimal in general environments
J Leike, T Lattimore, L Orseau, M Hutter
arXiv preprint arXiv:1602.07905, 2016
422016
Language modeling is compression
G Delétang, A Ruoss, PA Duquenne, E Catt, T Genewein, C Mattern, ...
arXiv preprint arXiv:2309.10668, 2023
392023
Avoiding side effects by considering future tasks
V Krakovna, L Orseau, R Ngo, M Martic, S Legg
Advances in Neural Information Processing Systems 33, 19064-19074, 2020
392020
Single-agent policy tree search with guarantees
L Orseau, L Lelis, T Lattimore, T Weber
Advances in Neural Information Processing Systems 31, 2018
352018
Universal knowledge-seeking agents
L Orseau
Theoretical Computer Science 519, 127-139, 2014
272014
Soft-bayes: Prod for mixtures of experts with log-loss
L Orseau, T Lattimore, S Legg
International Conference on Algorithmic Learning Theory, 372-399, 2017
262017
Pitfalls of learning a reward function online
S Armstrong, J Leike, L Orseau, S Legg
arXiv preprint arXiv:2004.13654, 2020
232020
Optimality issues of universal greedy agents with static priors
L Orseau
Algorithmic Learning Theory: 21st International Conference, ALT 2010 …, 2010
232010
Learning to prove from synthetic theorems
E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ...
arXiv preprint arXiv:2006.11259, 2020
202020
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20