Ryan Carey
Ryan Carey
Zweryfikowany adres z philosophy.ox.ac.uk - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Incorrigibility in the CIRL Framework
R Carey
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 30-35, 2018
102018
Predicting human deliberative judgments with machine learning
O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ...
Technical report, Technical report, University of Oxford, 2018
72018
Interpreting AI Compute Trends
R Carey
AI Impacts Blog, 2018
72018
The Effective Altruism Handbook
R Carey
The Centre for Effective Altruism, 2015
42015
The incentives that shape behaviour
R Carey, E Langlois, T Everitt, S Legg
arXiv preprint arXiv:2001.07118, 2020
32020
(When) Is Truth-telling Favored in AI Debate?
V Kovařík, R Carey
SafeAI AAAI Workshop, 2019
2019
How useful is quantilization for mitigating specification-gaming?
R Carey
SafeML Workshop at International Conference on Learning Representations, 2019
2019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–7