Obserwuj
Odalric-Ambrym Maillard
Odalric-Ambrym Maillard
Inria Lille - Nord Europe
Zweryfikowany adres z inria.fr - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Kullback-Leibler upper confidence bounds for optimal sequential allocation
O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz
The Annals of Statistics, 1516-1541, 2013
4132013
Concentration inequalities for sampling without replacement
R Bardenet, OA Maillard
1872015
CATS, a low pressure multiwire proportionnal chamber for secondary beam tracking at GANIL
S Ottini-Hustache, C Mazur, F Auger, A Musumarra, N Alamanos, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 1999
1671999
A finite-time analysis of multi-armed bandits problems with kullback-leibler divergences
OA Maillard, R Munos, G Stoltz
Proceedings of the 24th annual Conference On Learning Theory, 497-514, 2011
1592011
Compressed least-squares regression
OA Maillard, R Munos
Advances in Neural Information Processing Systems, 2009
1382009
Latent Bandits.
OA Maillard, S Mannor
International Conference on Machine Learning, 136-144, 2014
1002014
The non-stationary stochastic multi-armed bandit problem
R Allesiardo, R Féraud, OA Maillard
International Journal of Data Science and Analytics 3, 267-283, 2017
822017
Robust risk-averse stochastic multi-armed bandits
OA Maillard
Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013
752013
LSTD with random projections
M Ghavamzadeh, A Lazaric, OA Maillard, R Munos
Advances in Neural Information Processing Systems 23, 721--729, 2010
722010
Variance-aware regret bounds for undiscounted reinforcement learning in mdps
MS Talebi, OA Maillard
Algorithmic Learning Theory, 770-805, 2018
702018
Sub-sampling for multi-armed bandits
A Baransi, OA Maillard, S Mannor
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014
642014
PICOSEC: Charged particle timing at sub-25 picosecond precision with a Micromegas based detector
J Bortfeldt, F Brunbauer, C David, D Desforge, G Fanourakis, J Franchi, ...
Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2018
602018
Linear regression with random projections
O Maillard, R Munos
Journal of Machine Learning Research 13 (1), 2735-2772, 2012
602012
How hard is my MDP?" The distribution-norm to the rescue"
OA Maillard, TA Mann, S Mannor
Advances in Neural Information Processing Systems 27, 2014
582014
Online learning in adversarial lipschitz environments
OA Maillard, R Munos
Joint european conference on machine learning and knowledge discovery in …, 2010
512010
Finite-sample analysis of Bellman residual minimization
OA Maillard, R Munos, A Lazaric, M Ghavamzadeh
Proceedings of 2nd Asian Conference on Machine Learning, 299-314, 2010
462010
Selecting the state-representation in reinforcement learning
OA Maillard, D Ryabko, R Munos
Advances in Neural Information Processing Systems 24, 2011
452011
Adaptive Bandits: Towards the best history-dependent strategy
OA Maillard, R Munos
Proceedings of the Fourteenth International Conference on Artificial …, 2011
40*2011
Optimal thompson sampling strategies for support-aware cvar bandits
D Baudry, R Gautron, E Kaufmann, O Maillard
International Conference on Machine Learning, 716-726, 2021
352021
Sequential change-point detection: Laplace concentration of scan statistics and non-asymptotic delay bounds
OA Maillard
Algorithmic Learning Theory, 610-632, 2019
352019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20