Pierre-Luc Bacon

Cytowane przez

	Wszystkie	Od 2019
Cytowania	2431	2129
h-indeks	17	17
i10-indeks	24	21

520

260

130

390

20162017201820192020202120222023202432 74 184 263 351 351 422 513 229

Dostęp publiczny

Wyświetl wszystko

7 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Doina PrecupDeepMind and McGill UniversityZweryfikowany adres z cs.mcgill.ca
Jean HarbOpenAIZweryfikowany adres z openai.com
Emmanuel BengioMcGill UniversityZweryfikowany adres z mail.mcgill.ca
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaZweryfikowany adres z cs.mcgill.ca
Martin KlissarovMcGill University, MilaZweryfikowany adres z mail.mcgill.ca
Ahmed TouatiMeta AIZweryfikowany adres z umontreal.ca
Pascal VincentFacebook AI Research; U. Montreal (Professor, Computer Sc. & Op. Res.); MILA; CIFARZweryfikowany adres z iro.umontreal.ca
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityZweryfikowany adres z cs.stanford.edu
Yao LiuAmazonZweryfikowany adres z stanford.edu
Timothy A MannMetaZweryfikowany adres z fb.com
Daniel J. MankowitzGoogle DeepmindZweryfikowany adres z google.com
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchZweryfikowany adres z technion.ac.il
Anna HarutyunyanDeepMindZweryfikowany adres z google.com
Borja BalleDeepMindZweryfikowany adres z google.com
Anima AnandkumarCalifornia Institute of Technology and NVIDIAZweryfikowany adres z caltech.edu
David MegerAssociate Professor at McGill UniversityZweryfikowany adres z cim.mcgill.ca

Obserwuj

Pierre-Luc Bacon

University of Montreal

Zweryfikowany adres z mila.quebec - Strona główna

reinforcement learning artificial intelligence


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1193	2017
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	329	2015
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	154	2018
The primacy bias in deep reinforcement learning E Nikishin, M Schwarzer, P D’Oro, PL Bacon, A Courville International conference on machine learning, 16828-16847, 2022	100	2022
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	58	2017
Convergent tree backup and retrace with function approximation A Touati, PL Bacon, D Precup, P Vincent International Conference on Machine Learning, 4955-4964, 2018	46	2018
Learning robust options D Mankowitz, T Mann, PL Bacon, D Precup, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	45	2018
Sample-efficient reinforcement learning by breaking the replay ratio barrier P D'Oro, M Schwarzer, E Nikishin, PL Bacon, MG Bellemare, A Courville Deep Reinforcement Learning Workshop NeurIPS 2022, 2022	44	2022
Options of interest: Temporal abstraction with interest functions K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020	44	2020
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling Y Liu, PL Bacon, E Brunskill International Conference on Machine Learning, 6184-6193, 2020	39	2020
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	39	2020
Control-oriented model-based reinforcement learning with implicit differentiation E Nikishin, R Abachi, R Agarwal, PL Bacon Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7886-7894, 2022	29	2022
Temporal Representation Learning PL Bacon McGill University (Canada), 2018	29	2018
Direct behavior specification via constrained reinforcement learning J Roy, R Girgis, J Romoff, PL Bacon, C Pal arXiv preprint arXiv:2112.12228, 2021	24	2021
Learning with options that terminate off-policy A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	23	2018
An information-theoretic perspective on credit assignment in reinforcement learning D Arumugam, P Henderson, PL Bacon arXiv preprint arXiv:2103.06224, 2021	19	2021
Xlvin: executed latent value iteration nets A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić arXiv preprint arXiv:2010.13146, 2020	18	2020
Continuous-time meta-learning with forward mode differentiation T Deleu, D Kanaa, L Feng, G Kerg, Y Bengio, G Lajoie, PL Bacon arXiv preprint arXiv:2203.01443, 2022	17	2022
Neural algorithmic reasoners are implicit planners AI Deac, P Veličković, O Milinkovic, PL Bacon, J Tang, M Nikolic Advances in Neural Information Processing Systems 34, 15529-15542, 2021	15	2021
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018	13	2018

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy