Obserwuj
Pierluca D'Oro
Pierluca D'Oro
PhD Student, Mila - Québec Artificial Intelligence Institute
Zweryfikowany adres z mila.quebec
Tytuł
Cytowane przez
Cytowane przez
Rok
The primacy bias in deep reinforcement learning
E Nikishin*, M Schwarzer*, P D’Oro*, PL Bacon, A Courville
International conference on machine learning, 16828-16847, 2022
522022
Gradient-Aware Model-based Policy Search
P D'Oro*, AM Metelli*, A Tirinzoni, M Papini, M Restelli
The Thirty-Fourth AAAI Conference on Artificial Intelligence, 3801-3808, 2020
372020
Adversarial framework for unsupervised learning of motion dynamics in videos
C Spampinato, S Palazzo, P D’Oro, D Giordano, M Shah
International Journal of Computer Vision, 1-20, 2019
25*2019
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
P D'Oro*, M Schwarzer*, E Nikishin, PL Bacon, MG Bellemare, A Courville
International Conference on Learning Representations (ICLR), 𝐍𝐨𝐭𝐚𝐛𝐥𝐞-𝐭𝐨𝐩-𝟓%, 2023
242023
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P D'Oro, W Jaśkowski
Advances in Neural Information Processing Systems 34, 2020
242020
Policy Optimization as Online Learning with Mediator Feedback
AM Metelli*, M Papini*, P D'Oro, M Restelli
The Thirty-Fifth AAAI Conference on Artificial Intelligence, 8958-8966, 2021
102021
Real-time Classification from Short Event-Camera Streams using Input-filtering Neural ODEs
G Giannone, A Anoosheh, A Quaglino, P D'Oro, M Gallieri, J Masci
NeurIPS workshop on Interpretable Inductive Biases and Physically Structured …, 2020
72020
Group Anomaly Detection via Graph Autoencoders
P D’Oro, E Nasca, J Masci, M Matteucci
NeurIPS Graph Representation Learning Workshop, 2019
72019
SMfinder: Small Molecules Finder for Metabolomics and Lipidomics analysis
G Martano, M Leone, P D'Oro, V Matafora, A Cattaneo, M Masseroli, ...
Analytical Chemistry, 2020
52020
Long-Term Credit Assignment via Model-based Temporal Shortcuts
M Ma, P D'Oro, Y Bengio, PL Bacon
Deep RL Workshop NeurIPS, 2021
22021
Motif: Intrinsic Motivation From Artificial Intelligence Feedback
M Klissarov*, P D’Oro*, S Sodhani, R Raileanu, PL Bacon, P Vincent, ...
arXiv preprint arXiv:2310.00166, 2023
12023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
N Rahn*, P D'Oro*, H Wiltzer, PL Bacon, MG Bellemare
arXiv preprint arXiv:2309.14597, 2023
12023
Meta Dynamic Programming
P D’Oro, PL Bacon
NeurIPS Workshop on Metacognition in the Age of AI: Challenges and Opportunities, 2021
12021
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
Z Lin, P D'Oro, E Nikishin, A Courville
Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
2022
Beyond maximum likelihood model estimation in model-based policy search
P D’Oro
Politecnico di Milano Digital Archive, Italy, 2019
2019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–15