Ryan Carey

Cytowane przez

	Wszystkie	Od 2019
Cytowania	160	151
h-indeks	8	8
i10-indeks	7	7

201720182019202020212022202320244 4 9 21 21 26 49 25

Dostęp publiczny

Wyświetl wszystko

4 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Tom EverittStaff Research Scientist at Google DeepMindZweryfikowany adres z google.com

Obserwuj

Ryan Carey

University of Oxford

Zweryfikowany adres z philosophy.ox.ac.uk - Strona główna

AI Safety Causality Incentives


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Agent Incentives: A Causal Perspective T Everitt, R Carey, E Langlois, PA Ortega, S Legg AAAI, 2021	38	2021
Incorrigibility in the CIRL Framework R Carey Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 30-35, 2018	22*	2018
Path-Specific Objectives for Safer Agent Incentives S Farquhar, R Carey, T Everitt AAAI, 2022	15	2022
Predicting human deliberative judgments with machine learning O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ... Technical report, University of Oxford, 2018	15	2018
Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness C Ashurst, R Carey, S Chiappa, T Everitt AAAI, 2022	14	2022
The Incentives that Shape Behaviour R Carey, E Langlois, T Everitt, S Legg Safe AI AAAI Workshop, 2020	14	2020
Interpreting AI Compute Trends R Carey AI Impacts Blog, 2018	10	2018
Reasoning about Causality in Games L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge AI Journal, 2023	9	2023
The Effective Altruism Handbook R Carey The Centre for Effective Altruism, 2015	8	2015
PyCID: A Python Library for Causal Influence Diagrams J Fox, T Everitt, R Carey, E Langlois, A Abate, M Wooldridge SciPy, 2021	5	2021
Human Control: Definitions and Algorithms R Carey, T Everitt UAI, 2023	4	2023
A Complete Criterion for Value of Information in Soluble Influence Diagrams C van Merwijk, R Carey, T Everitt AAAI, 2022	3	2022
DE-HNN: An effective neural model for Circuit Netlist representation Z Luo, TS Hy, P Tabaghi, D Koh, M Defferrard, E Rezaei, R Carey, ... arXiv preprint arXiv:2404.00477, 2024	2	2024
Context aware system with multiple power consumption modes T Jarosinski, S Sadasivam, RM Carey, J Lee, B Dhingra, A Bisain, ... US Patent 9,622,177, 2017	1	2017
Reasoning about Causality in Games (Abstract Reprint) L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22697 …, 2024		2024
(When) Is Truth-telling Favored in AI Debate? V Kovařík, R Carey SafeAI AAAI Workshop, 2019		2019
How useful is quantilization for mitigating specification-gaming? R Carey SafeML Workshop at International Conference on Learning Representations, 2019		2019

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–17

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy