CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
Proceedings of the AAAI Conference on Artificial Intelligence, 2025
Cheng, Zihui and Chen, Qiguang and Zhang, Jin and Fei, Hao and Feng, Xiaocheng and Che, Wanxiang and Li, Min and Qin, Libo