Pierre Ménard

Cited by

	All	Since 2020
Citations	1592	1482
h-index	21	21
i10-index	29	27

420

210

105

315

20162017201820192020202120222023202420258 4 25 54 110 225 275 406 386 80

Public access

View all

24 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Michal ValkoChief Models Officer @ Stealth Startup, Inria & MVA - Ex: Llama at Meta; Gemini and BYOL @ DeepmindVerified email at meta.com
Omar Darwiche DominguesCohereVerified email at cohere.com
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Verified email at inria.fr
Rémi MunosFAIR, MetaVerified email at inria.fr
Aurélien GarivierEcole Normale Supérieure de LyonVerified email at ens-lyon.fr
Edouard LeurentDeepMindVerified email at deepmind.com
Anders JonssonArtificial Intelligence and Machine Learning group, Universitat Pompeu FabraVerified email at upf.edu
Xuedong ShangINRIA (SequeL -> SCOOL)Verified email at inria.fr
Matteo PirottaResearch Scientist, Meta (FAIR)Verified email at fb.com
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des SciencesVerified email at polytechnique.edu
Daniil TiapkinÉcole PolytechniqueVerified email at polytechnique.edu
Alexey NaumovProfessor, HSE UniversityVerified email at hse.ru
Prof. Dr. Denis BelomestnyDuisburg-Essen UniversityVerified email at uni-due.de
Tadashi KozunoOMRON SINIC XVerified email at alumni.oist.jp
Rémy DegenneInria LilleVerified email at inria.fr
Rianne de HeideAssistant professor, Department of Applied Mathematics, University of TwenteVerified email at utwente.nl
Wouter M. KoolenCentrum Wiskunde & Informatica; University of TwenteVerified email at cwi.nl
Hedi HADIJICentraleSupelecVerified email at centralesupelec.fr
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Sébastien GerchinovitzResearch scientist, IRT Saint Exupéry, ToulouseVerified email at math.univ-toulouse.fr

Pierre Ménard

OvGU Magdeburg

Verified email at inria.fr - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Explore first, exploit next: The true shape of regret in bandit problems A Garivier, P Ménard, G Stoltz Mathematics of Operations Research 44 (2), 377-399, 2019	219	2019
Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited O Darwiche Domingues, P Ménard, E Kaufmann, M Valko arXiv e-prints, arXiv: 2010.03531, 2020	130*	2020
Fast active learning for pure exploration in reinforcement learning P Ménard, OD Domingues, A Jonsson, E Kaufmann, E Leurent, M Valko International Conference on Machine Learning, 7599-7608, 2021	117	2021
Non-asymptotic pure exploration by solving games R Degenne, WM Koolen, P Ménard Advances in Neural Information Processing Systems 32, 2019	116	2019
Gamification of pure exploration for linear bandits R Degenne, P Ménard, X Shang, M Valko International Conference on Machine Learning, 2432-2442, 2020	105	2020
Adaptive reward-free exploration E Kaufmann, P Ménard, OD Domingues, A Jonsson, E Leurent, M Valko Algorithmic Learning Theory, 865-891, 2021	100	2021
Fixed-confidence guarantees for bayesian best-arm identification X Shang, R Heide, P Menard, E Kaufmann, M Valko International Conference on Artificial Intelligence and Statistics, 1823-1832, 2020	82	2020
A minimax and asymptotically optimal algorithm for stochastic bandits P Ménard, A Garivier International Conference on Algorithmic Learning Theory, 223-237, 2017	63	2017
Kernel-based reinforcement learning: A finite-time analysis OD Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko International Conference on Machine Learning, 2783-2792, 2021	55*	2021
Learning in two-player zero-sum partially observable Markov games with perfect recall T Kozuno, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 34, 11987-11998, 2021	51	2021
Ucb momentum q-learning: Correcting the bias without forgetting P Ménard, OD Domingues, X Shang, M Valko International Conference on Machine Learning, 7609-7618, 2021	49	2021
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints A Garivier, H Hadiji, P Menard, G Stoltz Journal of Machine Learning Research 23 (179), 1-66, 2022	48	2022
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces O Darwiche Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko arXiv e-prints, arXiv: 2007.05078, 2020	48*	2020
Planning in markov decision processes with gap-dependent sample complexity A Jonsson, E Kaufmann, P Ménard, O Darwiche Domingues, E Leurent, ... Advances in Neural Information Processing Systems 33, 1253-1263, 2020	42	2020
A single algorithm for both restless and rested rotting bandits J Seznec, P Menard, A Lazaric, M Valko International Conference on Artificial Intelligence and Statistics, 3784-3794, 2020	41	2020
Fano’s inequality for random variables S Gerchinovitz, P Ménard, G Stoltz	40	2020
Thresholding bandit for dose-ranging: The impact of monotonicity A Garivier, P Ménard, L Rossi, P Menard arXiv preprint arXiv:1711.04454, 2017	31	2017
Bandits with many optimal arms R De Heide, J Cheshire, P Ménard, A Carpentier Advances in Neural Information Processing Systems 34, 22457-22469, 2021	24	2021
rlberry-A Reinforcement Learning Library for Research and Education OD Domingues, Y Flet-Berliac, E Leurent, P Ménard, X Shang, M Valko October, 2021	24	2021
Gradient ascent for active exploration in bandit problems P Ménard arXiv preprint arXiv:1905.08165, 2019	24	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors