Crepe: Can vision-language foundation models reason compositionally? Z Ma, J Hong, MO Gul, M Gandhi, I Gao, R Krishna Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 61 | 2023 |
Measuring compositional consistency for video question answering M Gandhi, MO Gul, E Prakash, M Grunde-McLaughlin, R Krishna, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 11 | 2022 |