Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023 | 85 | 2023 |
Imagen editor and editbench: Advancing and evaluating text-guided image inpainting S Wang, C Saharia, C Montgomery, J Pont-Tuset, S Noy, S Pellegrini, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 85 | 2023 |
Less is more: Generating grounded navigation instructions from landmarks S Wang, C Montgomery, J Orbay, V Birodkar, A Faust, I Gur, N Jaques, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 43* | 2022 |