Pierre Ménard

Citada per

	Totes	Des de 2019
Citacions	1218	1165
Índex h	18	18
Índex i10	24	22

380

190

285

2016201720182019202020212022202320248 4 24 55 110 228 277 367 128

Accés públic

Mostra-ho tot

22 articles

0 articles

disponibles

no disponibles

Es basa en els requisits de les agències que proporcionen el finançament

Coautors

Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindCorreu electrònic verificat a meta.com
Omar Darwiche DominguesOwkinCorreu electrònic verificat a owkin.com
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Correu electrònic verificat a inria.fr
Aurélien GarivierEcole Normale Supérieure de LyonCorreu electrònic verificat a ens-lyon.fr
Rémi MunosDeepMindCorreu electrònic verificat a inria.fr
Edouard LeurentDeepMindCorreu electrònic verificat a deepmind.com
Xuedong ShangINRIA (SequeL -> SCOOL)Correu electrònic verificat a inria.fr
Anders JonssonArtificial Intelligence and Machine Learning group, Universitat Pompeu FabraCorreu electrònic verificat a upf.edu
Matteo PirottaResearch Scientist, Meta (FAIR)Correu electrònic verificat a fb.com
Tadashi KozunoOmron Sinic XCorreu electrònic verificat a sinicx.com
Rémy DegenneInria LilleCorreu electrònic verificat a inria.fr
Daniil TiapkinÉcole PolytechniqueCorreu electrònic verificat a polytechnique.edu
Alexey NaumovNational Research University Higher School of EconomicsCorreu electrònic verificat a hse.ru
Prof. Dr. Denis BelomestnyDuisburg-Essen UniversityCorreu electrònic verificat a uni-due.de
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des SciencesCorreu electrònic verificat a polytechnique.edu
Rianne de HeideAssistant professor, Mathematics department, Vrije Universiteit AmsterdamCorreu electrònic verificat a vu.nl
Hedi HADIJICentraleSupelecCorreu electrònic verificat a centralesupelec.fr
Wouter M. KoolenCentrum Wiskunde & Informatica; University of TwenteCorreu electrònic verificat a cwi.nl
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchCorreu electrònic verificat a inria.fr
Sébastien GerchinovitzResearch scientist, IRT Saint Exupéry, ToulouseCorreu electrònic verificat a math.univ-toulouse.fr

Segueix

Pierre Ménard

OvGU Magdeburg

Correu electrònic verificat a inria.fr - Pàgina d'inici


Títol Ordena per cites Ordena per any Ordena per títol	Citada per Citada per	Any
Explore first, exploit next: The true shape of regret in bandit problems A Garivier, P Ménard, G Stoltz Mathematics of Operations Research 44 (2), 377-399, 2019	190	2019
Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited O Darwiche Domingues, P Ménard, E Kaufmann, M Valko arXiv e-prints, arXiv: 2010.03531, 2020	103*	2020
Fast active learning for pure exploration in reinforcement learning P Ménard, OD Domingues, A Jonsson, E Kaufmann, E Leurent, M Valko International Conference on Machine Learning, 7599-7608, 2021	87	2021
Non-asymptotic pure exploration by solving games R Degenne, WM Koolen, P Ménard Advances in Neural Information Processing Systems 32, 2019	85	2019
Adaptive reward-free exploration E Kaufmann, P Ménard, OD Domingues, A Jonsson, E Leurent, M Valko Algorithmic Learning Theory, 865-891, 2021	83	2021
Gamification of pure exploration for linear bandits R Degenne, P Ménard, X Shang, M Valko International Conference on Machine Learning, 2432-2442, 2020	79	2020
Fixed-confidence guarantees for bayesian best-arm identification X Shang, R Heide, P Menard, E Kaufmann, M Valko International Conference on Artificial Intelligence and Statistics, 1823-1832, 2020	64	2020
A minimax and asymptotically optimal algorithm for stochastic bandits P Ménard, A Garivier International Conference on Algorithmic Learning Theory, 223-237, 2017	55	2017
Kernel-based reinforcement learning: A finite-time analysis OD Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko International Conference on Machine Learning, 2783-2792, 2021	45*	2021
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints A Garivier, H Hadiji, P Menard, G Stoltz Journal of Machine Learning Research 23 (179), 1-66, 2022	42	2022
Ucb momentum q-learning: Correcting the bias without forgetting P Ménard, OD Domingues, X Shang, M Valko International Conference on Machine Learning, 7609-7618, 2021	39	2021
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces O Darwiche Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko arXiv e-prints, arXiv: 2007.05078, 2020	35*	2020
Model-free learning for two-player zero-sum partially observable markov games with perfect recall T Kozuno, P Ménard, R Munos, M Valko arXiv preprint arXiv:2106.06279, 2021	33*	2021
Fano’s inequality for random variables S Gerchinovitz, P Ménard, G Stoltz	33	2020
A single algorithm for both restless and rested rotting bandits J Seznec, P Menard, A Lazaric, M Valko International Conference on Artificial Intelligence and Statistics, 3784-3794, 2020	31	2020
Thresholding bandit for dose-ranging: The impact of monotonicity A Garivier, P Ménard, L Rossi, P Menard arXiv preprint arXiv:1711.04454, 2017	28	2017
Planning in markov decision processes with gap-dependent sample complexity A Jonsson, E Kaufmann, P Ménard, O Darwiche Domingues, E Leurent, ... Advances in Neural Information Processing Systems 33, 1253-1263, 2020	27	2020
Bandits with many optimal arms R De Heide, J Cheshire, P Ménard, A Carpentier Advances in Neural Information Processing Systems 34, 22457-22469, 2021	19	2021
Planning in entropy-regularized Markov decision processes and games JB Grill, O Darwiche Domingues, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 32, 2019	18	2019
Gradient ascent for active exploration in bandit problems P Ménard arXiv preprint arXiv:1905.08165, 2019	17	2019

En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.

Articles 1–20

Cites per any

Cites duplicades

Cites combinades

Addició de coautorsCoautors

Segueix

Citada per

Coautors