Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.05452
Cited By
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
10 January 2025
Xingyu Fu
Minqian Liu
Zhengyuan Yang
John Corring
Yijuan Lu
Jianwei Yang
Dan Roth
D. Florêncio
Cha Zhang
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding"
5 / 5 papers shown
Title
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Minheng Ni
Zhengyuan Yang
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
W. Zuo
Lijuan Wang
ReLM
LRM
46
0
0
26 May 2025
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use
Mingyuan Wu
Jingcheng Yang
Jize Jiang
Meitang Li
Kaizhuo Yan
Hanchao Yu
Minjia Zhang
Chengxiang Zhai
Klara Nahrstedt
LRM
90
0
0
25 May 2025
Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning
Fanrui Zhang
Dian Li
Qiang Zhang
Chenjun
sinbadliu
Junxiong Lin
Jiahong Yan
Jiawei Liu
Zheng-Jun Zha
OffRL
31
0
0
22 May 2025
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Zhaochen Su
Linjie Li
Mingyang Song
Yunzhuo Hao
Zhengyuan Yang
...
Guanjie Chen
Jiawei Gu
Juntao Li
Xiaoye Qu
Yu Cheng
OffRL
LRM
51
6
0
13 May 2025
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Yansen Wang
Shengqiong Wu
Yize Zhang
William Yang Wang
Ziwei Liu
Jiebo Luo
Hao Fei
LRM
126
23
0
16 Mar 2025
1