Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.04236
Cited By
v1
v2
v3 (latest)
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
6 February 2024
Ji Qi
Ming Ding
Weihan Wang
Yushi Bai
Qingsong Lv
Wenyi Hong
Bin Xu
Lei Hou
Juanzi Li
Yuxiao Dong
Jie Tang
VLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning"
4 / 4 papers shown
Title
FaceInsight: A Multimodal Large Language Model for Face Perception
Jingzhi Li
Changjiang Luo
Ruoyu Chen
Hua Zhang
Wenqi Ren
Jianhou Gan
Xiaochun Cao
CVBM
LRM
138
0
0
22 Apr 2025
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
X. J. Yang
Jing Liu
Peng Wang
Guoqing Wang
Yue Yang
Jikang Cheng
ObjD
196
0
0
27 Feb 2025
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
Qianqi Yan
Yue Fan
Hongquan Li
Shan Jiang
Yang Zhao
Xinze Guan
Ching-Chen Kuo
Xinze Wang
VLM
LRM
227
2
0
22 Feb 2025
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
128
22
0
24 Apr 2024
1