Ensemble of feature sets and classification methods for stance detection J Xu, S Zheng, J Shi, Y Yao, B Xu Natural Language Understanding and Intelligent Applications: 5th CCF …, 2016 | 18 | 2016 |
Hierarchical memory networks for answer selection on unknown words J Xu, J Shi, Y Yao, S Zheng, B Xu arXiv preprint arXiv:1609.08843, 2016 | 17 | 2016 |
Cascaded mutual modulation for visual reasoning Y Yao, J Xu, F Wang, B Xu arXiv preprint arXiv:1809.01943, 2018 | 15 | 2018 |
Muser: Multimodal stress detection using emotion recognition as an auxiliary task Y Yao, M Papakostas, M Burzo, M Abouelenien, R Mihalcea arXiv preprint arXiv:2105.08146, 2021 | 11 | 2021 |
Flm-101b: An open llm and how to train it with $100 k budget X Li, Y Yao, X Jiang, X Fang, X Meng, S Fan, P Han, J Li, L Du, B Qin, ... arXiv preprint arXiv:2309.03852, 2023 | 10 | 2023 |
MORSE: MultimOdal sentiment analysis for Real-life SEttings Y Yao, V Pérez-Rosas, M Abouelenien, M Burzo Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020 | 7 | 2020 |
2x faster language model pre-training via masked structural growth Y Yao, Z Zhang, J Li, Y Wang arXiv preprint arXiv:2305.02869, 2023 | 6 | 2023 |
Learning to activate logic rules for textual reasoning Y Yao, J Xu, J Shi, B Xu Neural Networks 106, 42-49, 2018 | 6 | 2018 |
Research without re-search: Maximal update parametrization yields accurate loss prediction across scales Y Yao, Y Wang arXiv preprint arXiv:2304.06875, 2023 | 2 | 2023 |
The world in my mind: Visual dialog with adversarial multi-modal feature encoding Y Yao, J Xu, B Xu Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 1 | 2019 |
Tele-FLM Technical Report X Li, Y Yao, X Jiang, X Fang, C Wang, X Liu, Z Wang, Y Zhao, X Wang, ... arXiv preprint arXiv:2404.16645, 2024 | | 2024 |
CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text Z Lin, Y Yao, Y Yuan arXiv preprint arXiv:2403.01784, 2024 | | 2024 |
NanoLM: An Affordable LLM Study Benchmark via Accurate Loss Prediction Across Scales X Huang, X Fang, Y Yao, X Li, Z Ni, X Jiang, X Meng, P Han, S Shang, ... | | 2023 |
Masked Structural Growth for 2x Faster Language Model Pre-training Y Yao, Z Zhang, J Li, Y Wang The Twelfth International Conference on Learning Representations, 2023 | | 2023 |