Improving video-text retrieval by multi-stream corpus alignment and dual softmax loss X Cheng, H Lin, X Wu, F Yang, D Shen arXiv preprint arXiv:2109.04290, 2021 | 125 | 2021 |
Cat: Cross attention in vision transformer H Lin, X Cheng, X Wu, D Shen 2022 IEEE international conference on multimedia and expo (ICME), 1-6, 2022 | 94 | 2022 |
Mltr: Multi-label classification with transformer X Cheng, H Lin, X Wu, D Shen, F Yang, H Liu, N Shi 2022 IEEE international conference on multimedia and expo (ICME), 1-6, 2022 | 47 | 2022 |
A unified model for video understanding and knowledge embedding with heterogeneous knowledge graph dataset J Deng, D Shen, H Pan, X Wu, X Liu, G Meng, F Yang, T Gao, R Fu, ... Proceedings of the 2023 ACM International Conference on Multimedia Retrieval …, 2023 | 1 | 2023 |
Generation-Guided Multi-Level Unified Network for Video Grounding X Cheng, X Wu, D Shen, H Lin, F Yang arXiv preprint arXiv:2303.07748, 2023 | | 2023 |