Evaluating commonsense in pre-trained language models X Zhou, Y Zhang, L Cui, D Huang Proceedings of the AAAI conference on artificial intelligence 34 (05), 9733-9740, 2020 | 181 | 2020 |
Annotators with attitudes: How annotator beliefs and identities bias toxic language detection M Sap, S Swayamdipta, L Vianna, X Zhou, Y Choi, NA Smith Proceedings of the 2022 Conference of the North American Chapter of the …, 2021 | 170 | 2021 |
Challenges in automated debiasing for toxic language detection X Zhou Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 125 | 2021 |
Webarena: A realistic web environment for building autonomous agents S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, Y Bisk, D Fried, ... arXiv preprint arXiv:2307.13854, 2023 | 80 | 2023 |
Clever hans or neural theory of mind? stress testing social reasoning in large language models N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ... arXiv preprint arXiv:2305.14763, 2023 | 51 | 2023 |
Linguistically-informed transformations (LIT): A method for automatically generating contrast sets C Li, L Shengshuo, LZ Liu, X Wu, X Zhou, S Steinert-Threlkeld Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting …, 2020 | 30 | 2020 |
Sotopia: Interactive evaluation for social intelligence in language agents X Zhou, H Zhu, L Mathur, R Zhang, H Yu, Z Qi, LP Morency, Y Bisk, ... arXiv preprint arXiv:2310.11667, 2023 | 24 | 2023 |
Multilevel text alignment with cross-document attention X Zhou, N Pappas, NA Smith Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 15 | 2020 |
Can llms keep a secret? testing privacy implications of language models via contextual integrity theory N Mireshghallah, H Kim, X Zhou, Y Tsvetkov, M Sap, R Shokri, Y Choi arXiv preprint arXiv:2310.17884, 2023 | 13 | 2023 |
Extracting and inferring personal attributes from dialogue Z Wang Proceedings of the 4th Workshop on NLP for Conversational AI, 2021 | 13 | 2021 |
FANToM: A benchmark for stress-testing machine theory of mind in interactions H Kim, M Sclar, X Zhou, RL Bras, G Kim, Y Choi, M Sap arXiv preprint arXiv:2310.15421, 2023 | 12 | 2023 |
Cobra frames: Contextual reasoning about effects and harms of offensive statements X Zhou, H Zhu, A Yerukola, T Davidson, JD Hwang, S Swayamdipta, ... Proceedings of the Association for Computational Linguistics (ACL), 2023 | 9 | 2023 |
Emergent Communication Fine-tuning (EC-FT) for Pretrained Language Models S Steinert-Threlkeld, X Zhou, Z Liu, CM Downey Emergent Communication Workshop at ICLR 2022, 2022 | 6 | 2022 |
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms X Zhou, Z Su, T Eisape, H Kim, M Sap arXiv preprint arXiv:2403.05020, 2024 | 4 | 2024 |
RPD: a distance function between word embeddings X Zhou, Z Zheng, S Huang Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 3 | 2020 |
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting A Yerukola, X Zhou, M Sap arXiv preprint arXiv:2305.14755, 2023 | 1 | 2023 |
Learning to translate by learning to communicate CM Downey*, X Zhou*, LZ Liu, S Steinert-Threlkeld EMNLP 2023 MRL, 2022 | | 2022 |