Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... arXiv preprint arXiv:2311.16502, 2023 | 68 | 2023 |
Hicu: Leveraging hierarchy for curriculum learning in automated icd coding W Ren, R Zeng, T Wu, T Zhu, RG Krishnan Machine Learning for Healthcare Conference, 198-223, 2022 | 5 | 2022 |
AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks M Ku, C Wei, W Ren, H Yang, W Chen arXiv preprint arXiv:2403.14468, 2024 | 1 | 2024 |
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation W Ren, H Yang, G Zhang, C Wei, X Du, S Huang, W Chen arXiv preprint arXiv:2402.04324, 2024 | 1 | 2024 |
Towards transformer-based automated icd coding: Challenges pitfalls and solutions W Ren, T Zhu, R Zeng, T Wu | 1 | 2021 |
Video Diffusion Models: A Survey A Melnik, M Ljubljanac, C Lu, Q Yan, W Ren, H Ritter arXiv preprint arXiv:2405.03150, 2024 | | 2024 |
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding A Zhuang, G Zhang, T Zheng, X Du, J Wang, W Ren, SW Huang, J Fu, ... arXiv preprint arXiv:2402.16671, 2024 | | 2024 |