Diana Borsa

Cytowane przez

	Wszystkie	Od 2019
Cytowania	1038	933
h-indeks	14	14
i10-indeks	18	18

240

120

180

201420152016201720182019202020212022202320244 5 17 28 48 78 134 201 218 237 65

Dostęp publiczny

Wyświetl wszystko

2 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Andre BarretoResearch Scientist, Google DeepMindZweryfikowany adres z google.com
Tom SchaulSenior Staff Scientist, DeepMindZweryfikowany adres z nyu.edu
Rémi MunosDeepMindZweryfikowany adres z inria.fr
David SilverDeepMind, UCLZweryfikowany adres z google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLZweryfikowany adres z google.com
Doina PrecupDeepMind and McGill UniversityZweryfikowany adres z cs.mcgill.ca
Will DabneyDeepMindZweryfikowany adres z google.com
Matteo HesselResearch Engineer, Google DeepMindZweryfikowany adres z google.com
Daniel J. MankowitzGoogle DeepmindZweryfikowany adres z google.com
Ingemar J. CoxDepartment of Computer Science, University College London / University of CopenhagenZweryfikowany adres z ucl.ac.uk
Elad Yom-TovBar Ilan UniversityZweryfikowany adres z yom-tov.info
Augustin ZidekResearch Engineer, DeepMindZweryfikowany adres z google.com
Nicolas HeessDeepMindZweryfikowany adres z google.com
Anna HarutyunyanDeepMindZweryfikowany adres z google.com
Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLZweryfikowany adres z ucl.ac.uk
GHEORGHE COMANICIResearch Scientist, DeepMindZweryfikowany adres z deepmind.com
Bilal PiotGoogle DeepmindZweryfikowany adres z google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Zweryfikowany adres z univ-lille.fr
John Shawe-TaylorUCLZweryfikowany adres z cs.ucl.ac.uk
Mark RowlandResearch Scientist, Google DeepMindZweryfikowany adres z google.com

Obserwuj

Diana Borsa

DeepMind

Zweryfikowany adres z google.com

Reinforcement Learning Machine Learning Artificial Intelligence Exploration.


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Transfer in deep reinforcement learning using successor features and generalised policy improvement A Barreto, D Borsa, J Quan, T Schaul, D Silver, M Hessel, D Mankowitz, ... International Conference on Machine Learning, 501-510, 2018	180	2018
Fast reinforcement learning with generalized policy updates A Barreto, S Hou, D Borsa, D Silver, D Precup Proceedings of the National Academy of Sciences 117 (48), 30079-30087, 2020	120	2020
Universal successor features approximators D Borsa, A Barreto, J Quan, D Mankowitz, R Munos, H Van Hasselt, ... arXiv preprint arXiv:1812.07626, 2018	118	2018
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	91	2019
Detecting disease outbreaks in mass gatherings using Internet data E Yom-Tov, D Borsa, IJ Cox, RA McKendry Journal of medical Internet research 16 (6), e154, 2014	73	2014
Observational learning by reinforcement learning D Borsa, B Piot, R Munos, O Pietquin arXiv preprint arXiv:1706.06617, 2017	68	2017
Ray interference: a source of plateaus in deep reinforcement learning T Schaul, D Borsa, J Modayil, R Pascanu arXiv preprint arXiv:1904.11455, 2019	66	2019
The termination critic A Harutyunyan, W Dabney, D Borsa, N Heess, R Munos, D Precup arXiv preprint arXiv:1902.09996, 2019	53	2019
Learning shared representations in multi-task reinforcement learning D Borsa, T Graepel, J Shawe-Taylor arXiv preprint arXiv:1603.02041, 2016	44	2016
Expected eligibility traces H van Hasselt, S Madjiheurem, M Hessel, D Silver, A Barreto, D Borsa Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9997 …, 2021	42	2021
Automatic identification of web-based risk markers for health events E Yom-Tov, D Borsa, AC Hayward, RA McKendry, IJ Cox Journal of medical Internet research 17 (1), e29, 2015	33	2015
Training deep neural nets to aggregate crowdsourced responses A Gaunt, D Borsa, Y Bachrach Proceedings of the Thirty-Second Conference on Uncertainty in Artificial …, 2016	32	2016
When should agents explore? M Pislar, D Szepesvari, G Ostrovski, D Borsa, T Schaul arXiv preprint arXiv:2108.11811, 2021	26	2021
Adapting behaviour for learning progress T Schaul, D Borsa, D Ding, D Szepesvari, G Ostrovski, W Dabney, ... arXiv preprint arXiv:1912.06910, 2019	15	2019
Temporal difference uncertainties as a signal for exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ... arXiv preprint arXiv:2010.02255, 2020	14	2020
Return-based scaling: Yet another normalisation trick for deep rl T Schaul, G Ostrovski, I Kemaev, D Borsa arXiv preprint arXiv:2105.05347, 2021	13	2021
Conditional importance sampling for off-policy learning M Rowland, A Harutyunyan, H Hasselt, D Borsa, T Schaul, R Munos, ... International Conference on Artificial Intelligence and Statistics, 45-55, 2020	12	2020
General non-linear bellman equations H van Hasselt, J Quan, M Hessel, Z Xu, D Borsa, A Barreto arXiv preprint arXiv:1907.03687, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
Generalised policy improvement with geometric policy composition S Thakoor, M Rowland, D Borsa, W Dabney, R Munos, A Barreto International Conference on Machine Learning, 21272-21307, 2022	6	2022

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy