Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.01773
Cited By
v1
v2 (latest)
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
3 March 2025
Shiqi Chen
Tongyao Zhu
Ruochen Zhou
Jinghan Zhang
Siyang Gao
Juan Carlos Niebles
Mor Geva
Junxian He
Jiajun Wu
Manling Li
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"
2 / 2 papers shown
Title
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
Mengdi Jia
Zekun Qi
Shaochen Zhang
Wenyao Zhang
Xinqiang Yu
Jiawei He
He Wang
L. Yi
LRM
VLM
59
0
0
03 Jun 2025
Caption This, Reason That: VLMs Caught in the Middle
Zihan Weng
Lucas Gomez
Taylor Whittington Webb
P. Bashivan
VLM
LRM
50
0
0
24 May 2025
1