
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use
Mingyuan Wu
Jingcheng Yang
Jize Jiang
Meitang Li
Kaizhuo Yan
Hanchao Yu
Minjia Zhang
Chengxiang Zhai
Klara Nahrstedt
Papers citing "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"
25 / 25 papers shown
Title |
---|
![]() MuirBench: A Comprehensive Benchmark for Robust Multi-image
Understanding Fei Wang Xingyu Fu James Y. Huang Zekun Li Qin Liu ...Kai-Wei Chang Dan Roth Sheng Zhang Hoifung Poon Muhao Chen |