Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.06749
Cited By
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
9 March 2025
Wenxuan Huang
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MU
OffRL
LRM
MLLM
ReLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models"
2 / 102 papers shown
Title
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
292
2,375
0
20 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
330
3,238
0
02 Dec 2016
Previous
1
2
3