Laura Weidinger

Cytowane przez

	Wszystkie	Od 2019
Cytowania	2288	2286
h-indeks	11	11
i10-indeks	13	13

1300

650

325

975

202020212022202320248 28 379 1252 601

Dostęp publiczny

Wyświetl wszystko

2 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Obserwuj

Laura Weidinger

Senior Research Scientist at DeepMind

Zweryfikowany adres z google.com


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	752	2021
Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021	603	2021
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	312	2022
Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022	294	2022
Alignment of language agents Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving arXiv preprint arXiv:2103.14659, 2021	113	2021
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents R Köster, D Hadfield-Menell, R Everett, L Weidinger, GK Hadfield, ... Proceedings of the National Academy of Sciences 119 (3), e2106028118, 2022	34	2022
Sociotechnical safety evaluation of generative ai systems L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ... arXiv preprint arXiv:2310.11986, 2023	33	2023
Characteristics of harmful text: Towards rigorous benchmarking of language models M Rauh, J Mellor, J Uesato, PS Huang, J Welbl, L Weidinger, S Dathathri, ... Advances in Neural Information Processing Systems 35, 24720-24739, 2022	25	2022
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences R Köster, KR McKee, R Everett, L Weidinger, WS Isaac, E Hughes, ... arXiv preprint arXiv:2010.09054, 2020	24	2020
Social conformity in autism SC Lazzaro, L Weidinger, RA Cooper, S Baron-Cohen, C Moutsiana, ... Journal of Autism and Developmental Disorders 49, 1304-1315, 2019	24	2019
Using the Veil of Ignorance to align AI systems with principles of justice L Weidinger, KR McKee, R Everett, S Huang, TO Zhu, MJ Chadwick, ... Proceedings of the National Academy of Sciences 120 (18), e2213709120, 2023	19	2023
Scaling language models: Methods, analysis & insights from training gopher. arXiv 2021 JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	10	2021
Test-retest reliability of canonical reinforcement learning models L Weidinger, A Gradassi, L Molleman, W van den Bos	10*
Improving alignment of dialogue agents via targeted human judgements, 2022 A Glaese, N McAleese, M Trebacz, J Aslanides, V Firoiu, T Ewalds, ... URL https://storage. googleapis. com/deepmind-media/DeepMind. com/Authors …, 2022	9	2022
Language modelling at scale: Gopher, ethical considerations, and retrieval J Rae, G Irving, L Weidinger DeepMind Blog, 2021	8	2021
Accounting for offensive speech as a practice of resistance M Díaz, R Amironesei, L Weidinger, I Gabriel Proceedings of the sixth workshop on online abuse and harms (woah), 192-202, 2022	7	2022
Test–retest reliability of reinforcement learning parameters JV Schaaf, L Weidinger, L Molleman, W van den Bos Behavior Research Methods, 1-18, 2023	4	2023
Artificial moral cognition: Learning from developmental psychology L Weidinger, MG Reinecke, J Haas PsyArXiv, 2022	4	2022
Modelling Cooperation in Network Games with Spatio-Temporal Complexity MA Bakker, R Everett, L Weidinger, I Gabriel, WS Isaac, JZ Leibo, ... arXiv preprint arXiv:2102.06911, 2021	3	2021
Holistic Safety and Responsibility Evaluations of Advanced AI Models L Weidinger, J Barnhart, J Brennan, C Butterfield, S Young, W Hawkins, ... arXiv preprint arXiv:2404.14068, 2024		2024

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez