Obserwuj
Esin Durmus
Tytuł
Cytowane przez
Cytowane przez
Rok
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
26782021
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
5672022
FEQA: A question answering evaluation framework for faithfulness assessment in abstractive summarization
E Durmus, H He, M Diab
ACL, 2020
3412020
WikiLingua: A new benchmark dataset for cross-lingual abstractive summarization
F Ladhak, E Durmus, C Cardie, K McKeown
arXiv preprint arXiv:2010.03093, 2020
1602020
Benchmarking large language models for news summarization
T Zhang, F Ladhak, E Durmus, P Liang, K McKeown, TB Hashimoto
Transactions of the Association for Computational Linguistics 12, 39-57, 2024
1502024
The gem benchmark: Natural language generation, its evaluation and metrics
S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ...
arXiv preprint arXiv:2102.01672, 2021
1342021
Whose opinions do language models reflect?
S Santurkar, E Durmus, F Ladhak, C Lee, P Liang, T Hashimoto
International Conference on Machine Learning, 29971-30004, 2023
1292023
Easily accessible text-to-image generation amplifies demographic stereotypes at large scale
F Bianchi, P Kalluri, E Durmus, F Ladhak, M Cheng, D Nozza, ...
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023
1142023
Exploring the role of prior beliefs for argument persuasion
E Durmus, C Cardie
NAACL, 2018
692018
Faithful or extractive? on mitigating the faithfulness-abstractiveness trade-off in abstractive summarization
F Ladhak, E Durmus, H He, C Cardie, K McKeown
arXiv preprint arXiv:2108.13684, 2021
572021
Towards measuring the representation of subjective global opinions in language models
E Durmus, K Nyugen, TI Liao, N Schiefer, A Askell, A Bakhtin, C Chen, ...
arXiv preprint arXiv:2306.16388, 2023
542023
Evaluating human-language model interaction
M Lee, M Srivastava, A Hardy, J Thickstun, E Durmus, A Paranjape, ...
arXiv preprint arXiv:2212.09746, 2022
542022
Marked personas: Using natural language prompts to measure stereotypes in language models
M Cheng, E Durmus, D Jurafsky
arXiv preprint arXiv:2305.18189, 2023
462023
Studying large language model generalization with influence functions
R Grosse, J Bae, C Anil, N Elhage, A Tamkin, A Tajdini, B Steiner, D Li, ...
arXiv preprint arXiv:2308.03296, 2023
452023
Exploring the Role of Argument Structure in Online Debate Persuasion
J Li, E Durmus, C Cardie
EMNLP, 2020
362020
Persuasion of the Undecided: Language vs. the Listener.
L Longpre, E Durmus, C Cardie
Proceedings of the 6th Workshop on Argument Mining, 2019
342019
Towards understanding sycophancy in language models
M Sharma, M Tong, T Korbak, D Duvenaud, A Askell, SR Bowman, ...
arXiv preprint arXiv:2310.13548, 2023
332023
Measuring faithfulness in chain-of-thought reasoning
T Lanham, A Chen, A Radhakrishnan, B Steiner, C Denison, ...
arXiv preprint arXiv:2307.13702, 2023
322023
A corpus for modeling user and language effects in argumentation on online debating
E Durmus, C Cardie
ACL, 2019
282019
Language modeling via stochastic processes
RE Wang, E Durmus, N Goodman, T Hashimoto
arXiv preprint arXiv:2203.11370, 2022
272022
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20