Obserwuj
Leigang Qu
Tytuł
Cytowane przez
Cytowane przez
Rok
Next-gpt: Any-to-any multimodal llm
S Wu, H Fei, L Qu, W Ji, TS Chua
arXiv preprint arXiv:2309.05519, 2023
3972023
Dynamic modality interaction modeling for image-text retrieval
L Qu, M Liu, J Wu, Z Gao, L Nie
Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021
1502021
Context-aware multi-view summarization network for image-text matching
L Qu, M Liu, D Cao, L Nie, Q Tian
Proceedings of the 28th ACM international conference on multimedia, 1047-1055, 2020
1392020
Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation
L Qu*, S Wu*, H Fei, L Nie, TS Chua
Proceedings of the 31st ACM International Conference on Multimedia, 643-654, 2023
812023
Search-oriented micro-video captioning
L Nie, L Qu, D Meng, M Zhang, Q Tian, AD Bimbo
Proceedings of the 30th ACM international conference on multimedia, 3234-3243, 2022
412022
Self-supervised correlation learning for cross-modal retrieval
Y Liu, J Wu, L Qu, T Gan, J Yin, L Nie
IEEE Transactions on Multimedia 25, 2851-2863, 2022
412022
Composed image retrieval with text feedback via multi-grained uncertainty regularization
Y Chen, Z Zheng, W Ji, L Qu, TS Chua
International Conference on Learning Representations (ICLR), 2022
352022
Temporal anomaly detection on IIoT-enabled manufacturing
P Zhan, S Wang, J Wang, L Qu, K Wang, Y Hu, X Li
Journal of Intelligent Manufacturing 32, 1669-1678, 2021
252021
Iterative local-global collaboration learning towards one-shot video person re-identification
M Liu, L Qu, L Nie, M Liu, L Duan, B Chen
IEEE Transactions on Image Processing 29, 9360-9372, 2020
252020
Generative cross-modal retrieval: Memorizing images in multimodal language models for retrieval and beyond
Y Li, W Wang, L Qu, L Nie, W Li, TS Chua
arXiv preprint arXiv:2402.10805, 2024
102024
Learnable Pillar-based Re-ranking for Image-Text Retrieval
L Qu, M Liu, W Wang, Z Zheng, L Nie, TS Chua
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
102023
Discriminative probing and tuning for text-to-image generation
L Qu, W Wang, Y Li, H Zhang, L Nie, TS Chua
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
92024
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
T Nguyen, Y Bin, J Xiao, L Qu, Y Li, JZ Wu, CD Nguyen, SK Ng, LA Tuan
arXiv preprint arXiv:2406.05615, 2024
42024
Popularity-aware Distributionally Robust Optimization for Recommendation System
J Zhao, W Wang, X Lin, L Qu, J Zhang, TS Chua
Proceedings of the 32nd ACM International Conference on Information and …, 2023
42023
Unified Text-to-Image Generation and Retrieval
L Qu, H Li, T Wang, W Wang, Y Li, L Nie, TS Chua
arXiv preprint arXiv:2406.05814, 2024
22024
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation
Y Li, H Cai, W Wang, L Qu, Y Wei, W Li, L Nie, TS Chua
arXiv preprint arXiv:2407.17274, 2024
2024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–16