Next-gpt: Any-to-any multimodal llm S Wu, H Fei, L Qu, W Ji, TS Chua arXiv preprint arXiv:2309.05519, 2023 | 397 | 2023 |
Dynamic modality interaction modeling for image-text retrieval L Qu, M Liu, J Wu, Z Gao, L Nie Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 150 | 2021 |
Context-aware multi-view summarization network for image-text matching L Qu, M Liu, D Cao, L Nie, Q Tian Proceedings of the 28th ACM international conference on multimedia, 1047-1055, 2020 | 139 | 2020 |
Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation L Qu*, S Wu*, H Fei, L Nie, TS Chua Proceedings of the 31st ACM International Conference on Multimedia, 643-654, 2023 | 81 | 2023 |
Search-oriented micro-video captioning L Nie, L Qu, D Meng, M Zhang, Q Tian, AD Bimbo Proceedings of the 30th ACM international conference on multimedia, 3234-3243, 2022 | 41 | 2022 |
Self-supervised correlation learning for cross-modal retrieval Y Liu, J Wu, L Qu, T Gan, J Yin, L Nie IEEE Transactions on Multimedia 25, 2851-2863, 2022 | 41 | 2022 |
Composed image retrieval with text feedback via multi-grained uncertainty regularization Y Chen, Z Zheng, W Ji, L Qu, TS Chua International Conference on Learning Representations (ICLR), 2022 | 35 | 2022 |
Temporal anomaly detection on IIoT-enabled manufacturing P Zhan, S Wang, J Wang, L Qu, K Wang, Y Hu, X Li Journal of Intelligent Manufacturing 32, 1669-1678, 2021 | 25 | 2021 |
Iterative local-global collaboration learning towards one-shot video person re-identification M Liu, L Qu, L Nie, M Liu, L Duan, B Chen IEEE Transactions on Image Processing 29, 9360-9372, 2020 | 25 | 2020 |
Generative cross-modal retrieval: Memorizing images in multimodal language models for retrieval and beyond Y Li, W Wang, L Qu, L Nie, W Li, TS Chua arXiv preprint arXiv:2402.10805, 2024 | 10 | 2024 |
Learnable Pillar-based Re-ranking for Image-Text Retrieval L Qu, M Liu, W Wang, Z Zheng, L Nie, TS Chua Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 10 | 2023 |
Discriminative probing and tuning for text-to-image generation L Qu, W Wang, Y Li, H Zhang, L Nie, TS Chua Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 9 | 2024 |
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives T Nguyen, Y Bin, J Xiao, L Qu, Y Li, JZ Wu, CD Nguyen, SK Ng, LA Tuan arXiv preprint arXiv:2406.05615, 2024 | 4 | 2024 |
Popularity-aware Distributionally Robust Optimization for Recommendation System J Zhao, W Wang, X Lin, L Qu, J Zhang, TS Chua Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 4 | 2023 |
Unified Text-to-Image Generation and Retrieval L Qu, H Li, T Wang, W Wang, Y Li, L Nie, TS Chua arXiv preprint arXiv:2406.05814, 2024 | 2 | 2024 |
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Y Li, H Cai, W Wang, L Qu, Y Wei, W Li, L Nie, TS Chua arXiv preprint arXiv:2407.17274, 2024 | | 2024 |