Ryan Carey
Ryan Carey
Zweryfikowany adres z philosophy.ox.ac.uk - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Incorrigibility in the CIRL Framework
R Carey
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 30-35, 2018
182018
The Incentives that Shape Behaviour
R Carey, E Langlois, T Everitt, S Legg
Safe AI AAAI Workshop, 2020
82020
Predicting human deliberative judgments with machine learning
O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ...
Technical report, University of Oxford, 2018
82018
Interpreting AI Compute Trends
R Carey
AI Impacts Blog, 2018
82018
Agent Incentives: A Causal Perspective
T Everitt, R Carey, E Langlois, PA Ortega, S Legg
AAAI, 2021
62021
The Effective Altruism Handbook
R Carey
The Centre for Effective Altruism, 2015
62015
PyCID: A Python Library for Causal Influence Diagrams
J Fox, T Everitt, R Carey, E Langlois, A Abate, M Wooldridge
2021
(When) Is Truth-telling Favored in AI Debate?
V Kovařík, R Carey
SafeAI AAAI Workshop, 2019
2019
How useful is quantilization for mitigating specification-gaming?
R Carey
SafeML Workshop at International Conference on Learning Representations, 2019
2019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–9