Obserwuj
Yonatan Bitton
Yonatan Bitton
Research Scientist, Google
Zweryfikowany adres z google.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Openflamingo: An open-source framework for training large autoregressive vision-language models
A Awadalla, I Gao, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ...
arXiv preprint arXiv:2308.01390, 2023
1572023
Datacomp: In search of the next generation of multimodal datasets
SY Gadre, G Ilharco, A Fang, J Hayase, G Smyrnis, T Nguyen, R Marten, ...
Advances in Neural Information Processing Systems 36, 2024
1082024
Openflamingo
A Awadalla, I Gao, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ...
Zenodo, March, 2023
36*2023
Automatic generation of contrast sets from scene graphs: Probing the compositional consistency of GQA
Y Bitton, G Stanovsky, R Schwartz, M Elhadad
NAACL 2021, 2021
292021
Breaking common sense: Whoops! a vision-and-language benchmark of synthetic and compositional images
N Bitton-Guetta, Y Bitton, J Hessel, L Schmidt, Y Elovici, G Stanovsky, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
262023
Data efficient masked language modeling for vision and language
Y Bitton, G Stanovsky, M Elhadad, R Schwartz
EMNLP 2021, Findings, 2021
222021
Visit-bench: A benchmark for vision-language instruction following inspired by real-world use
Y Bitton, H Bansal, J Hessel, R Shao, W Zhu, A Awadalla, J Gardner, ...
arXiv preprint arXiv:2308.06595, 2023
202023
What you see is what you read? improving text-image alignment evaluation
M Yarom, Y Bitton, S Changpinyo, R Aharoni, J Herzig, O Lang, E Ofek, ...
Advances in Neural Information Processing Systems 36, 2024
192024
WinoGAViL: Gamified association benchmark to challenge vision-and-language models
Y Bitton, NB Guetta, R Yosef, Y Elovici, M Bansal, G Stanovsky, ...
NeurIPS 2022, Oral, Datasets and Benchmarks, 2022
132022
Cross-lingual Unified Medical Language System entity linking in online health communities
Y Bitton, R Cohen, T Schifter, E Bachmat, M Elhadad, N Elhadad
Journal of the American Medical Informatics Association 27 (10), 1585-1592, 2020
92020
VASR: Visual Analogies of Situation Recognition
Y Bitton, R Yosef, E Strugo, D Shahaf, R Schwartz, G Stanovsky
AAAI 2023 (Oral), 2022
72022
Irfl: Image recognition of figurative language
R Yosef, Y Bitton, D Shahaf
arXiv preprint arXiv:2303.15445, 2023
62023
q2d: Turning questions into dialogs to teach models how to search
Y Bitton, S Cohen-Ganor, I Hakimi, Y Lewenberg, R Aharoni, E Weinreb
arXiv preprint arXiv:2304.14318, 2023
32023
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
B Gordon, Y Bitton, Y Shafir, R Garg, X Chen, D Lischinski, D Cohen-Or, ...
arXiv preprint arXiv:2312.03766, 2023
12023
VideoCon: Robust video-language alignment via contrast captions
H Bansal, Y Bitton, I Szpektor, KW Chang, A Grover
arXiv preprint arXiv:2311.10111, 2023
12023
ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies
O Sultan, Y Bitton, R Yosef, D Shahaf
arXiv preprint arXiv:2403.01139, 2024
2024
VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models
Y Bitton, H Bansal, J Hessel, R Shao, W Zhu, A Awadalla, J Gardner, ...
Advances in Neural Information Processing Systems 36, 2024
2024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
A Jacovi, Y Bitton, B Bohnet, J Herzig, O Honovich, M Tseng, M Collins, ...
arXiv preprint arXiv:2402.00559, 2024
2024
Read, Look or Listen? What's Needed for Solving a Multimodal Dataset
N Madvil, Y Bitton, R Schwartz
arXiv preprint arXiv:2307.04532, 2023
2023
Transferring Visual Attributes from Natural Language to Verified Image Generation
R Valerio, J Bordalo, M Yarom, Y Bitton, I Szpektor, J Magalhaes
arXiv preprint arXiv:2305.15026, 2023
2023
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20