Cheng Zihui

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Cheng, Zihui and Chen, Qiguang and Zhang, Jin and Fei, Hao and Feng, Xiaocheng and Che, Wanxiang and Li, Min and Qin, Libo

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Cheng, Zihui and Chen, Qiguang and Zhang, Jin and Fei, Hao and Feng, Xiaocheng and Che, Wanxiang and Li, Min and Qin, Libo

Visual thoughts: A unified perspective of understanding multimodal chain-of-thought

NeurIPS 2025

Cheng, Zihui and Chen, Qiguang and Xu, Xiao and Wang, Jiaqi and Wang, Weiyun and Fei, Hao and Wang, Yidong and Wang, Alex Jinpeng and Chen, Zhi and Che, Wanxiang and others

Visual thoughts: A unified perspective of understanding multimodal chain-of-thought

NeurIPS 2025

Cheng, Zihui and Chen, Qiguang and Xu, Xiao and Wang, Jiaqi and Wang, Weiyun and Fei, Hao and Wang, Yidong and Wang, Alex Jinpeng and Chen, Zhi and Che, Wanxiang and others