
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
Papers citing "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"
40 / 40 papers shown
Title |
---|
![]() AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling Jun Zhan Junqi Dai Jiasheng Ye Yunhua Zhou Dong Zhang ...Jie Fu Tao Gui Tianxiang Sun Yugang Jiang Xipeng Qiu |