Pablo Samuel Castro
Cytowane przez
Cytowane przez
From taxi GPS traces to social and community dynamics: A survey
PS Castro, D Zhang, C Chen, S Li, G Pan
ACM Computing Surveys (CSUR) 46 (2), 1-34, 2013
Urban traffic modelling and prediction using large scale taxi GPS traces
PS Castro, D Zhang, S Li
International Conference on Pervasive Computing, 57-72, 2012
iBOAT: Isolation-based online anomalous trajectory detection
C Chen, D Zhang, PS Castro, N Li, L Sun, S Li, Z Wang
IEEE Transactions on Intelligent Transportation Systems 14 (2), 806-818, 2013
Dopamine: A research framework for deep reinforcement learning
PS Castro, S Moitra, C Gelada, S Kumar, MG Bellemare
arXiv preprint arXiv:1812.06110, 2018
Real-time detection of anomalous taxi trajectories from GPS traces
C Chen, D Zhang, PS Castro, N Li, L Sun, S Li
International Conference on Mobile and Ubiquitous Systems: Computing …, 2011
Methods for computing state similarity in Markov decision processes
N Ferns, PS Castro, D Precup, P Panangaden
arXiv preprint arXiv:1206.6836, 2012
TF-Agents: A library for reinforcement learning in tensorflow
S Guadarrama, A Korattikara, O Ramirez, P Castro, E Holly, S Fishman, ...
GitHub repository, 2018
Rigging the lottery: Making all tickets winners
U Evci, T Gale, J Menick, PS Castro, E Elsen
International Conference on Machine Learning, 2943-2952, 2020
A comparative analysis of expected and distributional reinforcement learning
C Lyle, MG Bellemare, PS Castro
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4504-4511, 2019
A geometric perspective on optimal representations for reinforcement learning
MG Bellemare, W Dabney, R Dadashi, AA Taiga, PS Castro, NL Roux, ...
arXiv preprint arXiv:1901.11530, 2019
An atari model zoo for analyzing, visualizing, and comparing deep reinforcement learning agents
FP Such, V Madhavan, R Liu, R Wang, PS Castro, Y Li, J Zhi, L Schubert, ...
arXiv preprint arXiv:1812.07069, 2018
Real time anomalous trajectory detection and analysis
L Sun, D Zhang, C Chen, PS Castro, S Li, Z Wang
Mobile Networks and Applications 18 (3), 341-356, 2013
Using Linear Programming for Bayesian Exploration in Markov Decision Processes.
PS Castro, D Precup
IJCAI 24372442, 2007
Automatic construction of temporally extended actions for mdps using bisimulation metrics
PS Castro, D Precup
European Workshop on Reinforcement Learning, 140-152, 2011
Using bisimulation for policy transfer in MDPs
P Castro, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 24 (1), 2010
Equivalence relations in fully and partially observable Markov decision processes
PS Castro, P Panangaden, D Precup
Twenty-First International Joint Conference on Artificial Intelligence, 2009
Smarter sampling in model-based Bayesian reinforcement learning
PS Castro, D Precup
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2010
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
Scalable methods for computing state similarity in deterministic Markov Decision Processes
PS Castro
Proceedings of the AAAI Conference on Artificial Intelligence 34 (06), 10069 …, 2020
Distributional reinforcement learning with linear function approximation
MG Bellemare, N Le Roux, PS Castro, S Moitra
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20