Odalric-Ambrym Maillard

Cytowane przez

	Wszystkie	Od 2019
Cytowania	2913	1972
h-indeks	27	24
i10-indeks	56	52

440

220

110

330

200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202411 10 6 8 3 18 11 8 7 5 19 46 60 70 98 107 108 145 187 246 269 357 397 428 270

Dostęp publiczny

Wyświetl wszystko

47 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Rémi MunosGoogle DeepMindZweryfikowany adres z inria.fr
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAZweryfikowany adres z univ-lille.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaZweryfikowany adres z technion.ac.il
Olivier CappéCNRSZweryfikowany adres z cnrs.fr
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchZweryfikowany adres z inria.fr
Daniil RyabkoZweryfikowany adres z ryabko.net
Rémi BardenetCNRS, CRIStAL, Ecole Centrale Lille, Univ. Lille, FranceZweryfikowany adres z ec-lille.fr
Timothy A MannMetaZweryfikowany adres z fb.com
Akram BaransiZweryfikowany adres z tx.technion.ac.il
Nicolas VayatisFull Professor, Centre Borelli, Department of Mathematics, ENS Paris-SaclayZweryfikowany adres z ens-paris-saclay.fr
Rémi CoulomUniversité Lille 3Zweryfikowany adres z univ-lille3.fr

Obserwuj

Odalric-Ambrym Maillard

Inria Lille - Nord Europe

Zweryfikowany adres z inria.fr - Strona główna

Multi-armed Bandits Stochastic Dynamical Systems Statistical Learning Reinforcement Learning Random matrices


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Kullback-Leibler upper confidence bounds for optimal sequential allocation O Cappé, A Garivier, OA Maillard, R Munos, G Stoltz The Annals of Statistics, 1516-1541, 2013	426	2013
Concentration inequalities for sampling without replacement R Bardenet, OA Maillard	196	2015
CATS, a low pressure multiwire proportionnal chamber for secondary beam tracking at GANIL S Ottini-Hustache, C Mazur, F Auger, A Musumarra, N Alamanos, ... Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 1999	167	1999
A finite-time analysis of multi-armed bandits problems with kullback-leibler divergences OA Maillard, R Munos, G Stoltz Proceedings of the 24th annual Conference On Learning Theory, 497-514, 2011	161	2011
Compressed least-squares regression OA Maillard, R Munos Advances in Neural Information Processing Systems, 2009	135	2009
Latent Bandits. OA Maillard, S Mannor International Conference on Machine Learning, 136-144, 2014	106	2014
The non-stationary stochastic multi-armed bandit problem R Allesiardo, R Féraud, OA Maillard International Journal of Data Science and Analytics 3, 267-283, 2017	87	2017
Robust risk-averse stochastic multi-armed bandits OA Maillard Algorithmic Learning Theory: 24th International Conference, ALT 2013 …, 2013	76	2013
Variance-aware regret bounds for undiscounted reinforcement learning in mdps MS Talebi, OA Maillard Algorithmic Learning Theory, 770-805, 2018	74	2018
LSTD with random projections M Ghavamzadeh, A Lazaric, OA Maillard, R Munos Advances in Neural Information Processing Systems 23, 721--729, 2010	74	2010
Sub-sampling for multi-armed bandits A Baransi, OA Maillard, S Mannor Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	64	2014
PICOSEC: Charged particle timing at sub-25 picosecond precision with a Micromegas based detector J Bortfeldt, F Brunbauer, C David, D Desforge, G Fanourakis, J Franchi, ... Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2018	62	2018
How hard is my MDP?" The distribution-norm to the rescue" OA Maillard, TA Mann, S Mannor Advances in Neural Information Processing Systems 27, 2014	61	2014
Linear regression with random projections O Maillard, R Munos Journal of Machine Learning Research 13 (1), 2735-2772, 2012	61	2012
Online learning in adversarial lipschitz environments OA Maillard, R Munos Joint european conference on machine learning and knowledge discovery in …, 2010	54	2010
Finite-sample analysis of Bellman residual minimization OA Maillard, R Munos, A Lazaric, M Ghavamzadeh Proceedings of 2nd Asian Conference on Machine Learning, 299-314, 2010	48	2010
Selecting the state-representation in reinforcement learning OA Maillard, D Ryabko, R Munos Advances in Neural Information Processing Systems 24, 2011	45	2011
Optimal thompson sampling strategies for support-aware cvar bandits D Baudry, R Gautron, E Kaufmann, O Maillard International Conference on Machine Learning, 716-726, 2021	39	2021
Adaptive Bandits: Towards the best history-dependent strategy OA Maillard, R Munos Proceedings of the Fourteenth International Conference on Artificial …, 2011	39*	2011
Tightening exploration in upper confidence reinforcement learning H Bourel, O Maillard, MS Talebi International Conference on Machine Learning, 1056-1066, 2020	36	2020

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy