Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.05298
Cited By
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
9 November 2023
Cheng Yang
Rui Xu
Ye Guo
Peixiang Huang
Yiru Chen
Wenkui Ding
Zhongyuan Wang
Hong Zhou
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Vision-and-Language Reasoning via Spatial Relations Modeling"
3 / 3 papers shown
Title
Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks
Keanu Nichols
Nazia Tasnim
Yuting Yan
Nicholas Ikechukwu
Elva Zou
Deepti Ghadiyaram
Bryan A. Plummer
80
0
0
27 May 2025
VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework
Chunbai Zhang
Chao Wang
Yang Zhou
Yan Peng
LRM
ReLM
158
0
0
02 Feb 2025
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?
Bowen Zhao
Leo Parker Dirac
Paulina Varshavskaya
VLM
LRM
104
0
0
25 Sep 2024
1