Cheng Zihui

Visual thoughts: A unified perspective of understanding multimodal chain-of-thought

arXiv preprint arXiv:2505.15510, 2025.

Cheng, Zihui and Chen, Qiguang and Xu, Xiao and Wang, Jiaqi and Wang, Weiyun and Fei, Hao and Wang, Yidong and Wang, Alex Jinpeng and Chen, Zhi and Che, Wanxiang and others

Visual thoughts: A unified perspective of understanding multimodal chain-of-thought

arXiv preprint arXiv:2505.15510, 2025.

Cheng, Zihui and Chen, Qiguang and Xu, Xiao and Wang, Jiaqi and Wang, Weiyun and Fei, Hao and Wang, Yidong and Wang, Alex Jinpeng and Chen, Zhi and Che, Wanxiang and others

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

arXiv preprint arXiv:2412.12932, 2024.

Cheng, Zihui and Chen, Qiguang and Zhang, Jin and Fei, Hao and Feng, Xiaocheng and Che, Wanxiang and Li, Min and Qin, Libo

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

arXiv preprint arXiv:2412.12932, 2024.

Cheng, Zihui and Chen, Qiguang and Zhang, Jin and Fei, Hao and Feng, Xiaocheng and Che, Wanxiang and Li, Min and Qin, Libo