Obserwuj
Peter Stone
Tytuł
Cytowane przez
Cytowane przez
Rok
Transfer learning for reinforcement learning domains: A survey.
ME Taylor, P Stone
Journal of Machine Learning Research 10 (7), 2009
23872009
Deep recurrent q-learning for partially observable mdps
M Hausknecht, P Stone
2015 aaai fall symposium series, 2015
22912015
Multiagent systems: A survey from a machine learning perspective
P Stone, M Veloso
Autonomous Robots 8, 345-383, 2000
20422000
A multiagent approach to autonomous intersection management
K Dresner, P Stone
Journal of artificial intelligence research 31, 591-656, 2008
15972008
Artificial intelligence and life in 2030: the one hundred year study on artificial intelligence
P Stone, R Brooks, E Brynjolfsson, R Calo, O Etzioni, G Hager, ...
arXiv preprint arXiv:2211.06318, 2022
11942022
Layered learning in multiagent systems
PH Stone
Carnegie Mellon University, 1998
1046*1998
Multiagent traffic management: A reservation-based intersection control mechanism
K Dresner, P Stone
Autonomous Agents and Multiagent Systems, International Joint Conference on …, 2004
9102004
Policy gradient reinforcement learning for fast quadrupedal locomotion
N Kohl, P Stone
IEEE International Conference on Robotics and Automation, 2004. Proceedings …, 2004
8292004
Behavioral cloning from observation
F Torabi, G Warnell, P Stone
arXiv preprint arXiv:1805.01954, 2018
7822018
Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science
DC Mocanu, E Mocanu, P Stone, PH Nguyen, M Gibescu, A Liotta
Nature communications 9 (1), 2383, 2018
6932018
Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork
P Stone, M Veloso
Artificial Intelligence 110 (2), 241-273, 1999
6511999
Interactively shaping agents via human reinforcement: The TAMER framework
WB Knox, P Stone
Proceedings of the fifth international conference on Knowledge capture, 9-16, 2009
6252009
Reinforcement learning for robocup soccer keepaway
P Stone, RS Sutton, G Kuhlmann
Adaptive Behavior 13 (3), 165-188, 2005
6152005
Autonomous agents modelling other agents: A comprehensive survey and open problems
SV Albrecht, P Stone
Artificial Intelligence 258, 66-95, 2018
5862018
Curriculum learning for reinforcement learning domains: A framework and survey
S Narvekar, B Peng, M Leonetti, J Sinapov, ME Taylor, P Stone
Journal of Machine Learning Research 21 (181), 1-50, 2020
5582020
The RoboCup synthetic agent challenge 97
H Kitano, M Tambe, P Stone, M Veloso, S Coradeschi, E Osawa, ...
RoboCup-97: Robot Soccer World Cup I 1, 62-73, 1998
5001998
Ad hoc autonomous agent teams: Collaboration without pre-coordination
P Stone, G Kaminka, S Kraus, J Rosenschein
Proceedings of the AAAI Conference on Artificial Intelligence 24 (1), 1504-1509, 2010
4702010
PAC subset selection in stochastic multi-armed bandits.
S Kalyanakrishnan, A Tewari, P Auer, P Stone
ICML 12, 655-662, 2012
4372012
Outracing champion Gran Turismo drivers with deep reinforcement learning
PR Wurman, S Barrett, K Kawamoto, J MacGlashan, K Subramanian, ...
Nature 602 (7896), 223-228, 2022
4192022
Deep reinforcement learning in parameterized action space
M Hausknecht, P Stone
arXiv preprint arXiv:1511.04143, 2015
4042015
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20