Alessandro Lazaric

Cytowane przez

	Wszystkie	Od 2019
Cytowania	7109	5347
h-indeks	46	39
i10-indeks	106	94

1400

700

350

1050

2008200920102011201220132014201520162017201820192020202120222023202424 23 52 87 130 135 189 183 256 279 364 481 676 996 1176 1305 710

Dostęp publiczny

Wyświetl wszystko

19 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Matteo PirottaResearch Scientist, Meta (FAIR)Zweryfikowany adres z fb.com
Mohammad GhavamzadehAmazonZweryfikowany adres z amazon.com
Marcello RestelliAssociate Professor, Politecnico di MilanoZweryfikowany adres z polimi.it
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindZweryfikowany adres z meta.com
Rémi MunosGoogle DeepMindZweryfikowany adres z inria.fr
Andrea BonariniFull Professor, Politecnico di Milano, Dipartimento di Eletronica, Informazione e Biongegneria, AIZweryfikowany adres z polimi.it
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityZweryfikowany adres z cs.stanford.edu
Daniele CalandrielloResearch Scientist, DeepMindZweryfikowany adres z google.com
Jean TarbouriechGoogle DeepMindZweryfikowany adres z google.com
Marc AbeilleCriteoZweryfikowany adres z ens-cachan.fr
Andrea TirinzoniMetaZweryfikowany adres z fb.com
Ronan FruitPhD candidate, Inria Lille, SequeL teamZweryfikowany adres z inria.fr
Evrard GarcelonFacebook AI ResearchZweryfikowany adres z fb.com
Andrea ZanetteAssistant Professor, Carnegie Mellon UniversityZweryfikowany adres z andrew.cmu.edu
Marta SoareUniversité d'OrléansZweryfikowany adres z univ-orleans.fr
Denis YaratsCofounder and CTO, Perplexity AIZweryfikowany adres z perplexity.ai
Lerrel PintoNew York UniversityZweryfikowany adres z cs.nyu.edu
Anima AnandkumarCalifornia Institute of Technology and NVIDIAZweryfikowany adres z caltech.edu
Kamyar AzizzadenesheliNvidiaZweryfikowany adres z nvidia.com
Amir SaniTechstarsZweryfikowany adres z amirsani.com

Obserwuj

Alessandro Lazaric

Research Scientist, Facebook Artificial Intelligence Research

Zweryfikowany adres z inria.fr - Strona główna

Machine Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Transfer in reinforcement learning: a framework and a survey A Lazaric Reinforcement Learning: State-of-the-Art, 143-173, 2012	363	2012
Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric Advances in Neural Information Processing Systems 25, 2012	342	2012
Linear thompson sampling revisited M Abeille, A Lazaric Artificial Intelligence and Statistics, 176-184, 2017	269	2017
Mastering visual continuous control: Improved data-augmented reinforcement learning D Yarats, R Fergus, A Lazaric, L Pinto arXiv preprint arXiv:2107.09645, 2021	256	2021
Learning near optimal policies with low inherent bellman error A Zanette, A Lazaric, M Kochenderfer, E Brunskill International Conference on Machine Learning, 10978-10989, 2020	232	2020
Best-arm identification in linear bandits M Soare, A Lazaric, R Munos Advances in Neural Information Processing Systems 27, 2014	212	2014
Reinforcement learning with prototypical representations D Yarats, R Fergus, A Lazaric, L Pinto International Conference on Machine Learning, 11920-11931, 2021	210	2021
Transfer of samples in batch reinforcement learning A Lazaric, M Restelli, A Bonarini Proceedings of the 25th international conference on Machine learning, 544-551, 2008	209	2008
Reinforcement learning in continuous action spaces through sequential monte carlo methods A Lazaric, M Restelli, A Bonarini Advances in neural information processing systems 20, 2007	196	2007
Risk-aversion in multi-armed bandits A Sani, A Lazaric, R Munos Advances in neural information processing systems 25, 2012	185	2012
Frequentist regret bounds for randomized least-squares value iteration A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020	145	2020
Bayesian multi-task reinforcement learning A Lazaric, M Ghavamzadeh ICML-27th international conference on machine learning, 599-606, 2010	143	2010
Reinforcement learning of pomdps using spectral methods K Azizzadenesheli, A Lazaric, A Anandkumar Conference on Learning Theory, 193-256, 2016	139	2016
Finite-sample analysis of least-squares policy iteration A Lazaric, M Ghavamzadeh, R Munos Journal of Machine Learning Research 13, 3041-3074, 2012	132	2012
Upper-confidence-bound algorithms for active learning in multi-armed bandits A Carpentier, A Lazaric, M Ghavamzadeh, R Munos, P Auer International Conference on Algorithmic Learning Theory, 189-203, 2011	127	2011
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck Advances in Neural Information Processing Systems 24, 2011	127	2011
Sequential transfer in multi-armed bandit with finite set of models A Lazaric, E Brunskill Advances in Neural Information Processing Systems 26, 2013	119	2013
Efficient bias-span-constrained exploration-exploitation in reinforcement learning R Fruit, M Pirotta, A Lazaric, R Ortner International Conference on Machine Learning, 1578-1586, 2018	114	2018
Improved regret bounds for thompson sampling in linear quadratic control problems M Abeille, A Lazaric International Conference on Machine Learning, 1-9, 2018	104	2018
A truthful learning mechanism for contextual multi-slot sponsored search auctions with externalities N Gatti, A Lazaric, F Trovò Proceedings of the 13th ACM Conference on Electronic Commerce, 605-622, 2012	98	2012

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy