Volgen
Zheng Wen
Zheng Wen
DeepMind
Geverifieerd e-mailadres voor google.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
A Tutorial on Thompson Sampling
D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen
arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0
848*
Generalization and exploration via randomized value functions
I Osband, B Van Roy, Z Wen
International Conference on Machine Learning, 2377-2386, 2016
2972016
Cascading bandits: Learning to rank in the cascade model
B Kveton, C Szepesvári, Z Wen, A Ashkan
ICML, 2015
2792015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
B Kveton, Z Wen, A Ashkan, C Szepesvari
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014
2642014
Optimal demand response using device based reinforcement learning
Z Wen, D O'Neill, HR Maei
IEEE Transactions on Smart Grid, 2014
2632014
Deep Exploration via Randomized Value Functions.
I Osband, B Van Roy, DJ Russo, Z Wen
J. Mach. Learn. Res. 20 (124), 1-62, 2019
2522019
Online influence maximization under independent cascade model with semi-bandit feedback
Z Wen, B Kveton, M Valko, S Vaswani
Advances in neural information processing systems 30, 2017
124*2017
Matroid bandits: Fast combinatorial optimization with learning
B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson
UAI 2014, 2014
1142014
Combinatorial cascading bandits
B Kveton, Z Wen, A Ashkan, C Szepesvari
Advances in Neural Information Processing Systems 28, 2015
1102015
Cascading bandits for large-scale recommendation problems
S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton
arXiv preprint arXiv:1603.05359, 2016
1082016
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit
Y Cao, Z Wen, B Kveton, Y Xie
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
103*2019
Efficient learning in large-scale combinatorial semi-bandits
Z Wen, B Kveton, A Ashkan
http://jmlr.org/proceedings/papers/v37/wen15.html, 2014
1002014
Optimal Greedy Diversity for Recommendation
A Ashkan, B Kveton, S Berkovsky, Z Wen
962015
Online learning to rank in stochastic click models
M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen
International conference on machine learning, 4199-4208, 2017
922017
DCM Bandits: Learning to Rank with Multiple Clicks
S Katariya, B Kveton, C Szepesvári, Z Wen
arXiv, 2016
812016
Efficient Exploration and Value Function Generalization in Deterministic Systems
Z Wen, B Van Roy
Advances in Neural Information Processing Systems, 3021--3029, 2013
752013
Model-independent online learning for influence maximization
S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ...
International Conference on Machine Learning, 3530-3539, 2017
67*2017
Stochastic rank-1 bandits
S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen
Artificial Intelligence and Statistics, 392-401, 2017
672017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits
B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh
International Conference on Machine Learning, 3601-3610, 2019
602019
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
Advances in Neural Information Processing Systems 26, 2013
582013
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20