v1v2 (latest)

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

3 March 2025

Papers citing "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"

2 / 2 papers shown

Title
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Mengdi Jia Zekun Qi Shaochen Zhang Wenyao Zhang Xinqiang Yu Jiawei He He Wang L. Yi LRM VLM 59 0 0 03 Jun 2025
Caption This, Reason That: VLMs Caught in the Middle Zihan Weng Lucas Gomez Taylor Whittington Webb P. Bashivan VLM LRM 50 0 0 24 May 2025