Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
IEEE Transactions on Circuits and Systems for Video Technology, 2025.
Xu, Xiao and Qin, Libo and Che, Wanxiang and Kan, Min-Yen
Kan Min-Yen
IEEE Transactions on Circuits and Systems for Video Technology, 2025.
Xu, Xiao and Qin, Libo and Che, Wanxiang and Kan, Min-Yen
IEEE Transactions on Circuits and Systems for Video Technology, 2025.
Xu, Xiao and Qin, Libo and Che, Wanxiang and Kan, Min-Yen
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2677--2686, 2022
Qin, Libo and Chen, Qiguang and Xie, Tianbao and Li, Qixin and Lou, Jian-Guang and Che, Wanxiang and Kan, Min-Yen
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2677--2686, 2022
Qin, Libo and Chen, Qiguang and Xie, Tianbao and Li, Qixin and Lou, Jian-Guang and Che, Wanxiang and Kan, Min-Yen