DINOv2: Learning Robust Visual Features without Supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023 | 841* | 2023 |
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference B Graham, A El-Nouby, H Touvron, P Stock, A Joulin, H Jégou, M Douze International Conference on Computer Vision 2021, 2021 | 648* | 2021 |
Resmlp: Feedforward networks for image classification with data-efficient training H Touvron, P Bojanowski, M Caron, M Cord, A El-Nouby, E Grave, ... IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 5314-5321, 2022 | 616* | 2022 |
XCiT: Cross-Covariance Image Transformers A El-Nouby, H Touvron, M Caron, P Bojanowski, M Douze, A Joulin, ... 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 | 412* | 2021 |
Imagebind: One embedding space to bind them all R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 363* | 2023 |
Training vision transformers for image retrieval A El-Nouby, N Neverova, I Laptev, H Jégou arXiv preprint arXiv:2102.05644, 2021 | 160 | 2021 |
Tell, draw, and repeat: Generating and modifying images based on continual linguistic instruction A El-Nouby, S Sharma, H Schulz, D Hjelm, LE Asri, SE Kahou, Y Bengio, ... Proceedings of the IEEE International Conference on Computer Vision, 10304-10312, 2019 | 141* | 2019 |
Are large-scale datasets necessary for self-supervised pre-training? A El-Nouby, G Izacard, H Touvron, I Laptev, H Jegou, E Grave arXiv preprint arXiv:2112.10740, 2021 | 122 | 2021 |
Three things everyone should know about vision transformers H Touvron, M Cord, A El-Nouby, J Verbeek, H Jégou European Conference on Computer Vision, 497-515, 2022 | 81 | 2022 |
Omnimae: Single model masked pretraining on images and videos R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 66* | 2023 |
Augmenting convolutional networks with attention-based aggregation H Touvron, M Cord, A El-Nouby, P Bojanowski, A Joulin, G Synnaeve, ... arXiv preprint arXiv:2112.13692, 2021 | 53* | 2021 |
Real-Time End-to-End Action Detection with Two-Stream Networks A Ali, GW Taylor 2018 15th Conference on Computer and Robot Vision (CRV), 31-38, 2018 | 32* | 2018 |
Image compression with product quantized masked image modeling A El-Nouby, MJ Muckley, K Ullrich, I Laptev, J Verbeek, H Jégou arXiv preprint arXiv:2212.07372, 2022 | 19 | 2022 |
Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking A El-Nouby, S Zhai, GW Taylor, JM Susskind Holistic Video Understanding Workshop ICCV2019, 2019 | 17 | 2019 |
Improving statistical fidelity for neural image compression with implicit local likelihood models MJ Muckley, A El-Nouby, K Ullrich, H Jégou, J Verbeek International Conference on Machine Learning, 25426-25443, 2023 | 13* | 2023 |
Scalable Pre-training of Large Autoregressive Image Models A El-Nouby, M Klein, S Zhai, MA Bautista, A Toshev, V Shankar, ... arXiv preprint arXiv:2401.08541, 2024 | 9 | 2024 |
Variable Rate Allocation for Vector-Quantized Autoencoders F Baldassarre, A El-Nouby, H Jégou ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Are Visual Recognition Models Robust to Image Compression? JM Janeiro, S Frolov, A El-Nouby, J Verbeek arXiv preprint arXiv:2304.04518, 2023 | | 2023 |
Spatiotemporal Representation Learning For Human Action Recognition And Localization A Ali University of Guelph, 2019 | | 2019 |