Follow
Ronald Ortner
Ronald Ortner
Verified email at unileoben.ac.at - Homepage
Title
Cited by
Cited by
Year
Near-optimal regret bounds for reinforcement learning
P Auer, T Jaksch, R Ortner
Advances in neural information processing systems 21, 2008
9882008
UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem
P Auer, R Ortner
Periodica Mathematica Hungarica 61 (1-2), 55-65, 2010
2932010
Improved rates for the stochastic continuum-armed bandit problem
P Auer, R Ortner, C SzepesvŠri
International Conference on Computational Learning Theory, 454-468, 2007
2222007
Logarithmic online regret bounds for undiscounted reinforcement learning
P Auer, R Ortner
Advances in neural information processing systems 19, 2006
2162006
A boosting approach to multiple instance learning
P Auer, R Ortner
European Conference on Machine Learning, 63-74, 2004
1002004
Online regret bounds for undiscounted continuous reinforcement learning
R Ortner, D Ryabko
Advances in Neural Information Processing Systems 25, 2012
782012
Adaptively tracking the best bandit arm with an unknown number of distribution changes
P Auer, P Gajane, R Ortner
Conference on Learning Theory, 138-158, 2019
76*2019
Efficient bias-span-constrained exploration-exploitation in reinforcement learning
R Fruit, M Pirotta, A Lazaric, R Ortner
International Conference on Machine Learning, 1578-1586, 2018
652018
Regret bounds for restless markov bandits
R Ortner, D Ryabko, P Auer, R Munos
International conference on algorithmic learning theory, 214-228, 2012
522012
PAC-Bayesian analysis of contextual bandits
Y Seldin, P Auer, J Shawe-taylor, R Ortner, F Laviolette
Advances in neural information processing systems 24, 2011
492011
Non-backtracking random walks and cogrowth of graphs
R Ortner, W Woess
Canadian Journal of Mathematics 59 (4), 828-844, 2007
452007
Variational regret bounds for reinforcement learning
R Ortner, P Gajane, P Auer
Uncertainty in Artificial Intelligence, 81-90, 2020
412020
Regret bounds for restless Markov bandits
R Ortner, D Ryabko, P Auer, R Munos
Theoretical Computer Science 558, 62-76, 2014
402014
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
Artificial Intelligence and Statistics, 1004-1012, 2016
352016
Pseudometrics for state aggregation in average reward Markov decision processes
R Ortner
International Conference on Algorithmic Learning Theory, 373-387, 2007
342007
Improved regret bounds for undiscounted continuous reinforcement learning
K Lakshmanan, R Ortner, D Ryabko
International Conference on Machine Learning, 524-532, 2015
332015
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
R Ortner
Annals of Operations Research 208 (1), 321-336, 2013
312013
Pareto front identification from stochastic bandit feedback
P Auer, CK Chiang, R Ortner, M Drugan
Artificial intelligence and statistics, 939-947, 2016
292016
Regret bounds for reinforcement learning via markov chain concentration
R Ortner
Journal of Artificial Intelligence Research 67, 115-128, 2020
262020
A sliding-window algorithm for markov decision processes with arbitrarily changing rewards and transitions
P Gajane, R Ortner, P Auer
arXiv preprint arXiv:1805.10066, 2018
252018
The system can't perform the operation now. Try again later.
Articles 1–20