ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.01773
  4. Cited By
v1v2 (latest)

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

3 March 2025
Shiqi Chen
Tongyao Zhu
Ruochen Zhou
Jinghan Zhang
Siyang Gao
Juan Carlos Niebles
Mor Geva
Junxian He
Jiajun Wu
Manling Li
    LRM
ArXiv (abs)PDFHTML

Papers citing "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas"

2 / 2 papers shown
Title
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
Mengdi Jia
Zekun Qi
Shaochen Zhang
Wenyao Zhang
Xinqiang Yu
Jiawei He
He Wang
L. Yi
LRMVLM
59
0
0
03 Jun 2025
Caption This, Reason That: VLMs Caught in the Middle
Caption This, Reason That: VLMs Caught in the Middle
Zihan Weng
Lucas Gomez
Taylor Whittington Webb
P. Bashivan
VLMLRM
50
0
0
24 May 2025
1