Sebastien Bubeck
Sebastien Bubeck
Sr Principal Researcher, Microsoft Research
Dirección de correo verificada de microsoft.com - Página principal
Título
Citado por
Citado por
Año
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
S Bubeck, N Cesa-Bianchi
Foundations and Trends in Machine Learning 5, 1-122, 2012
15932012
Convex optimization: Algorithms and complexity
S Bubeck
Foundations and Trends® in Machine Learning 8 (3-4), 231-357, 2015
752*2015
Best arm identification in multi-armed bandits
JY Audibert, S Bubeck, R Munos
COLT 2010, 2010
4152010
Pure exploration in multi-armed bandits problems
S Bubeck, R Munos, G Stoltz
Algorithmic Learning Theory, 23-37, 2009
2962009
X-armed bandits
S Bubeck, R Munos, G Stoltz, C Szepesvári
Journal of Machine Learning Research 12, 1587-1627, 2011
2892011
Minimax policies for adversarial and stochastic bandits
JY Audibert, S Bubeck
COLT 2009, 2009
2842009
lil'UCB: An Optimal Exploration Algorithm for Multi-Armed Bandits
K Jamieson, M Malloy, R Nowak, S Bubeck
COLT 2014, 2013
2022013
Online optimization in X-armed bandits
S Bubeck, R Munos, G Stoltz, C Szepesvári
NIPS 2008, 2008
1952008
Regret bounds and minimax policies under partial monitoring
JY Audibert, S Bubeck
The Journal of Machine Learning Research 11, 2635-2686, 2010
1742010
Pure exploration in finitely-armed and continuous-armed bandits
S Bubeck, R Munos, G Stoltz
Theoretical Computer Science 412, 1832-1852, 2010
1532010
Regret in online combinatorial optimization
JY Audibert, S Bubeck, G Lugosi
Mathematics of Operations Research 39 (1), 31-45, 2014
1522014
Multiple identifications in multi-armed bandits
S Bubeck, T Wang, N Viswanathan
ICML 2012, 2012
1302012
Bandits with heavy tail
S Bubeck, N Cesa-Bianchi, G Lugosi
IEEE Transactions on Information Theory 59 (11), 7711-7717, 2013
1182013
The best of both worlds: Stochastic and adversarial bandits
S Bubeck, A Slivkins
COLT 2012, 2012
1092012
Optimal algorithms for smooth and strongly convex distributed optimization in networks
K Seaman, F Bach, S Bubeck, YT Lee, L Massoulié
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1062017
A geometric alternative to Nesterov's accelerated gradient descent
S Bubeck, YT Lee, M Singh
arXiv preprint arXiv:1506.08187, 2015
1012015
Is q-learning provably efficient?
C Jin, Z Allen-Zhu, S Bubeck, MI Jordan
Advances in Neural Information Processing Systems, 4863-4873, 2018
882018
Introduction to online optimization
S Bubeck
Lecture Notes 2, 2011
862011
Towards minimax policies for online linear optimization with bandit feedback
S Bubeck, N Cesa-Bianchi, SM Kakade
COLT 2012, 2012
782012
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
NIPS 2011, 2011
752011
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20