Cosmin Paduraru
Cosmin Paduraru
DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Safe exploration in continuous action spaces
G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa
arXiv preprint arXiv:1801.08757, 2018
1092018
Off-policy learning with options and recognizers
D Precup, C Paduraru, A Koop, RS Sutton, SP Singh
Advances in Neural Information Processing Systems, 1097-1104, 2005
262005
An empirical investigation of the challenges of real-world reinforcement learning
G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ...
arXiv preprint arXiv:2003.11881, 2020
242020
Off-policy evaluation in Markov decision processes
C Paduraru
Ph. D. Dissertation. McGill University, 2012
222012
Hyperparameter selection for offline reinforcement learning
TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ...
arXiv preprint arXiv:2007.09055, 2020
152020
Rl unplugged: Benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, TL Paine, SG Colmenarejo, K Zolna, ...
arXiv preprint arXiv:2006.13888, 2020
142020
Grounding abstractions in predictive state representations
B Tanner, V Bulitko, A Koop, C Paduraru
Proceedings of the twentieth international joint conference on artificial …, 2007
122007
An empirical analysis of off-policy learning in discrete mdps
C Păduraru, D Precup, J Pineau, G Comănici
European Workshop on Reinforcement Learning, 89-102, 2013
82013
A framework for computing bounds for the return of a policy
C Păduraru, D Precup, J Pineau
European Workshop on Reinforcement Learning, 201-212, 2011
72011
Responding to new information in a mining complex: fast mechanisms using machine learning
C Paduraru, R Dimitrakopoulos
Mining Technology, 2019
62019
Adaptive policies for short-term material flow optimization in a mining complex
C Paduraru, R Dimitrakopoulos
Mining Technology 127 (1), 56-63, 2018
62018
Planning with Approximate and Learned Models of Markov Decision Processes
C Paduraru
MSc thesis, Department of Computing Science, University of Alberta, 2007
52007
Temporal abstraction
D Precup, C Paduraru, A Koop, RS Sutton, S Singh
URL: http://videolectures. net/site/normal_dl/tag 1199094, 2018
42018
A study of off-policy learning in computational sustainability
C Paduraru, D Precup, J Pineau, G Comanici
European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012
42012
Model-based reinforcement learning with state aggregation
C Paduraru, R Kaplow, D Precup, J Pineau
8th European Workshop on Reinforcement Learning, 2008
42008
Mineral Supply Chain Optimization Under Uncertainty Using Approximate Dynamic Programming
C Paduraru, RG Dimitrakopoulos
GERAD HEC Montréal, 2015
32015
RL Unplugged: Benchmarks for Offline Reinforcement Learning
Ç Gülçehre, Z Wang, A Novikov, T Le Paine, SG Colmenarejo, K Zolna, ...
22020
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ...
Machine Learning, 1-50, 2021
12021
Challenges of Real-World Reinforcement Learning: Definitions, Benchmarks & Analysis
C Paduraru, DJ Mankowitz, G Dulac-Arnold, J Li, N Levine, S Gowal, ...
12021
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
DJ Mankowitz, DA Calian, R Jeong, C Paduraru, N Heess, S Dathathri, ...
arXiv preprint arXiv:2010.10644, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–20