Obserwuj
Jie Lei 雷杰
Jie Lei 雷杰
Research Scientist, Meta AI
Zweryfikowany adres z fb.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
TVQA: Localized, compositional video question answering
J Lei, L Yu, M Bansal, TL Berg
EMNLP 2018, 2018
5852018
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
J Lei*, L Li*, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
CVPR 2021, Best Student Paper Honorable Mention, 2021
5642021
Unifying vision-and-language tasks via text generation
J Cho, J Lei, H Tan, M Bansal
ICML 2021, 2021
4302021
Tvr: A large-scale dataset for video-subtitle moment retrieval
J Lei, L Yu, TL Berg, M Bansal
ECCV 2020, 2020
2192020
TVQA+: Spatio-temporal grounding for video question answering
J Lei, L Yu, TL Berg, M Bansal
ACL 2020, 2020
2162020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
J Lei, L Wang, Y Shen, D Yu, TL Berg, M Bansal
ACL 2020, 2020
1692020
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
J Lei, TL Berg, M Bansal
NeurIPS 2021, 2021
114*2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
L Li*, J Lei*, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
NeurIPS 2021 Datasets and Benchmarks Track, 2021
942021
Revealing single frame bias for video-and-language learning
J Lei, TL Berg, M Bansal
ACL 2023, 2022
822022
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Z Wang, M Li, R Xu, L Zhou, J Lei, X Lin, S Wang, Z Yang, C Zhu, ...
NeurIPS 2022, 2022
812022
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
H Tan*, J Lei*, T Wolf, M Bansal
CVPR 2022 workshop on Transformers for Vision, 2021
582021
DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization
Z Tang*, J Lei*, M Bansal
NAACL 2021, 2021
582021
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
J Lei, L Yu, TL Berg, M Bansal
EMNLP 2020, 2020
572020
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models
L Li, J Lei, Z Gan, J Liu
ICCV 2021, 2021
542021
VindLU: A Recipe for Effective Video-and-Language Pretraining
F Cheng, X Wang, J Lei, D Crandall, M Bansal, G Bertasius
CVPR 2023, 2022
442022
RESIN-11: Schema-guided event prediction for 11 newsworthy scenarios
X Du, Z Zhang, S Li, P Yu, H Wang, T Lai, X Lin, Z Wang, I Liu, B Zhou, ...
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
272022
Vision Transformers are Parameter-Efficient Audio-Visual Learners
YB Lin, YL Sung, J Lei, M Bansal, G Bertasius
CVPR 2023, 2022
262022
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
YB Lin, J Lei, M Bansal, G Bertasius
ECCV 2022 Oral, 2022
262022
Weakly supervised image classification with coarse and fine labels
J Lei, Z Guo, Y Wang
2017 14th conference on computer and robot vision (crv), 240-247, 2017
222017
mtvr: Multilingual moment retrieval in videos
J Lei, TL Berg, M Bansal
ACL 2021, 2021
112021
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20