Follow
Sergi Caelles
Sergi Caelles
Research Scientist at Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
14902023
The 2017 DAVIS Challenge on Video Object Segmentation
J Pont-Tuset, F Perazzi, S Caelles, P Arbeláez, L Van Gool
arXiv preprint arXiv:1704.00675, 2017
12412017
One-Shot Video Object Segmentation
S Caelles, KK Maninis, J Pont-Tuset, L Leal-Taixé, D Cremers, LV Gool
CVPR: Computer Vision and Pattern Recognition, 2017
11122017
Deep extreme cut: From extreme points to object segmentation
KK Maninis, S Caelles, J Pont-Tuset, L Van Gool
CVPR: Computer Vision and Pattern Recognition, 2018
4982018
Video object segmentation without temporal information
KK Maninis, S Caelles, Y Chen, J Pont-Tuset, L Leal-Taixé, D Cremers, ...
TPAMI: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018
4102018
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3582024
The 2019 davis challenge on vos: Unsupervised multi-object segmentation
S Caelles, J Pont-Tuset, F Perazzi, A Montes, KK Maninis, L Van Gool
arXiv preprint arXiv:1905.00737, 2019
1582019
The 2018 DAVIS challenge on video object segmentation
S Caelles, A Montes, KK Maninis, Y Chen, L Van Gool, F Perazzi, ...
arXiv preprint arXiv:1803.00557 1 (2), 2018
1502018
Vct: A video compression transformer
F Mentzer, G Toderici, D Minnen, SJ Hwang, S Caelles, M Lucic, ...
arXiv preprint arXiv:2206.07307, 2022
912022
First real-time coherent MIMO-DSP for six coupled mode transmission
S Randel, S Corteselli, D Badini, D Pilori, S Caelles, S Chandrasekhar, ...
2015 IEEE Photonics Conference (IPC), 1-2, 2015
562015
Iterative Deep Learning for Road Topology Extraction
C Ventura, J Pont-Tuset, S Caelles, KK Maninis, L Van Gool
BMVC 2018, 2018
492018
Towards truly zero-shot compositional visual reasoning with llms as programmers
A Staniæ, S Caelles, M Tschannen
arXiv preprint arXiv:2401.01974, 2024
82024
Fast video object segmentation with Spatio-Temporal GANs
S Caelles, A Pumarola, F Moreno-Noguer, A Sanfeliu, L Van Gool
82019
Iterative Deep Retinal Topology Extraction
C Ventura, J Pont-Tuset, S Caelles, KK Maninis, L Van Gool
Patch-Based Techniques in Medical Imaging: 4th International Workshop, Patch …, 2018
12018
Video Object Segmentation by Tracking Structured Key Points and Contours
S Caelles Prat
Universitat Politècnica de Catalunya, 2016
2016
Implementation of DSP algorithms in VHDL for high-speed optical communications
S Caelles Prat
Universitat Politècnica de Catalunya, 2014
2014
The system can't perform the operation now. Try again later.
Articles 1–16