Follow
Julian Zimmert
Title
Cited by
Cited by
Year
An optimal algorithm for stochastic and adversarial bandits
J Zimmert, Y Seldin
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
782019
Beating stochastic and adversarial semi-bandits optimally and simultaneously
J Zimmert, H Luo, CY Wei
International Conference on Machine Learning, 7683-7692, 2019
492019
Model selection in contextual stochastic bandit problems
A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ...
Advances in Neural Information Processing Systems 33, 10328-10337, 2020
472020
Adapting to misspecification in contextual bandits
DJ Foster, C Gentile, M Mohri, J Zimmert
Advances in Neural Information Processing Systems 33, 11478-11489, 2020
342020
Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits.
J Zimmert, Y Seldin
J. Mach. Learn. Res. 22, 28:1-28:49, 2021
272021
An optimal algorithm for adversarial bandits with arbitrary delays
J Zimmert, Y Seldin
International Conference on Artificial Intelligence and Statistics, 3285-3294, 2020
272020
Connections between mirror descent, thompson sampling and the information ratio
J Zimmert, T Lattimore
Advances in Neural Information Processing Systems 32, 2019
212019
Safe screening for support vector machines
J Zimmert, CS de Witt, G Kerg, M Kloft
NIPS 2015 Workshop on Optimization in Machine Learning (OPT), 2015
192015
Distributed optimization of multi-class SVMs
M Alber, J Zimmert, U Dogan, M Kloft
PloS one 12 (6), e0178161, 2017
122017
Factored bandits
J Zimmert, Y Seldin
Advances in Neural Information Processing Systems 31, 2018
102018
Beyond value-function gaps: Improved instance-dependent regret bounds for episodic reinforcement learning
C Dann, TV Marinov, M Mohri, J Zimmert
Advances in Neural Information Processing Systems 34, 2021
52021
A model selection approach for corruption robust reinforcement learning
CY Wei, C Dann, J Zimmert
International Conference on Algorithmic Learning Theory, 1043-1096, 2022
32022
The Pareto Frontier of model selection for general Contextual Bandits
TV Marinov, J Zimmert
Advances in Neural Information Processing Systems 34, 2021
32021
Online learning for active cache synchronization
A Kolobov, S Bubeck, J Zimmert
International Conference on Machine Learning, 5371-5380, 2020
32020
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
C Dann, M Mohri, T Zhang, J Zimmert
Advances in Neural Information Processing Systems 34, 2021
22021
Efficient Methods for Online Multiclass Logistic Regression
N Agarwal, S Kale, J Zimmert
International Conference on Algorithmic Learning Theory, 3-33, 2022
2022
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States
J Zimmert, N Agarwal, S Kale
arXiv preprint arXiv:2202.02765, 2022
2022
Adversarially robust stochastic multi-armed bandits
J Zimmert
The system can't perform the operation now. Try again later.
Articles 1–18