Zheng Wen

Geciteerd door

	Alles	Sinds 2019
Citaties	5181	4364
h-index	31	30
i10-index	54	50

1000

500

250

750

2014201520162017201820192020202120222023202428 68 146 184 330 502 738 847 985 908 382

Openbare toegang

Alles bekijken

8 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Branislav KvetonAmazonGeverifieerd e-mailadres voor amazon.com
Benjamin Van RoyStanford UniversityGeverifieerd e-mailadres voor stanford.edu
Ian OsbandOpenAIGeverifieerd e-mailadres voor openai.com
Csaba SzepesvariDeepMind & University of AlbertaGeverifieerd e-mailadres voor cs.ualberta.ca
Azin AshkanGoogleGeverifieerd e-mailadres voor uwaterloo.ca
Xiuyuan LuGoogle DeepMindGeverifieerd e-mailadres voor google.com
Yasin Abbasi YadkoriDeepMindGeverifieerd e-mailadres voor google.com
Vikranth DwaracherlaDeepMindGeverifieerd e-mailadres voor google.com
Morteza IbrahimiStanford UniversityGeverifieerd e-mailadres voor stanford.edu
Mohammad GhavamzadehAmazonGeverifieerd e-mailadres voor amazon.com
Sharan VaswaniSimon Fraser UniversityGeverifieerd e-mailadres voor sfu.ca
Daniel RussoColumbia UniversityGeverifieerd e-mailadres voor gsb.columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindGeverifieerd e-mailadres voor meta.com
Seyed Mohammad AsghariResearch Engineer, DeepMindGeverifieerd e-mailadres voor google.com
Botao HaoDeepmindGeverifieerd e-mailadres voor google.com
Brian ErikssonAdobeGeverifieerd e-mailadres voor adobe.com
S MuthukrishnanRutgers UnivGeverifieerd e-mailadres voor cs.rutgers.edu
Sumeet KatariyaAmazonGeverifieerd e-mailadres voor wisc.edu
Shlomo BerkovskyMacquarie UniversityGeverifieerd e-mailadres voor mq.edu.au
Abbas KazerouniStanford UniversityGeverifieerd e-mailadres voor stanford.edu

Volgen

Zheng Wen

Google DeepMind

Geverifieerd e-mailadres voor google.com - Homepage

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1057*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	328	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	321	2019
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	307	2015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	306	2014
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	303	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	144*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	128*	2019
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	126	2016
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	125	2015
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	125	2014
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	108	2014
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	107	2015
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	106	2017
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	88	2016
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	86	2013
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	81*	2017
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	79	2024
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	73	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	72	2019

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs