Efficient Low-rank Multimodal Fusion with Modality-Specific Factors Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 707 | 2018 |
Words can shift: Dynamically adjusting word representations using nonverbal behaviors Y Wang, Y Shen, Z Liu, PP Liang, A Zadeh, LP Morency Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 7216-7223, 2019 | 368 | 2019 |
Multiinstruct: Improving multi-modal zero-shot learning via instruction tuning Z Xu, Y Shen, L Huang arXiv preprint arXiv:2212.10773, 2022 | 67 | 2022 |
Efficient low-rank multimodal fusion with modality-specific factors. arXiv 2018 Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency arXiv preprint arXiv:1806.00064, 0 | 18 | |
The art of SOCRATIC QUESTIONING: Recursive thinking with large language models J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 10 | 2023 |
The art of socratic questioning: Zero-shot multimodal reasoning with recursive thinking and self-questioning J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang arXiv preprint arXiv:2305.14999, 2023 | 5 | 2023 |
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning Z Xu, C Feng, R Shao, T Ashby, Y Shen, D Jin, Y Cheng, Q Wang, ... arXiv preprint arXiv:2402.11690, 2024 | 3 | 2024 |
X-eval: Generalizable multi-aspect text evaluation via augmented instruction tuning with auxiliary evaluation aspects M Liu, Y Shen, Z Xu, Y Cao, E Cho, V Kumar, R Ghanadan, L Huang arXiv preprint arXiv:2311.08788, 2023 | 3 | 2023 |
Multimodal Instruction Tuning with Conditional Mixture of LoRA Y Shen, Z Xu, Q Wang, Y Cheng, W Yin, L Huang arXiv preprint arXiv:2402.15896, 2024 | 1 | 2024 |
Learning by Asking for Embodied Visual Navigation and Task Completion Y Shen, I Lourentzou arXiv preprint arXiv:2302.04865, 2023 | 1 | 2023 |
Many-to-many Image Generation with Auto-regressive Diffusion Models Y Shen, Y Zhang, S Zhai, L Huang, JM Susskind, J Gu arXiv preprint arXiv:2404.03109, 2024 | | 2024 |
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks J Qi, M Liu, Y Shen, Z Xu, L Huang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18888 …, 2024 | | 2024 |
KnowledgeBot: Improving Assistive Robot for Task Completion and Live Interaction via Neuro-Symbolic Reasoning M Liu, Y Shen, BMYS Wang, JQZ Xu, L Huang | | |