Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.02669
Cited By
v1
v2 (latest)
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
5 January 2025
Simon Park
A. Panigrahi
Yun Cheng
Dingli Yu
Anirudh Goyal
Sanjeev Arora
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?"
4 / 4 papers shown
Title
COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning
Xindi Wu
Hee Seung Hwang
Polina Kirichenko
Olga Russakovsky
VLM
CoGe
125
1
0
30 Apr 2025
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Kexian Tang
Junyao Gao
Yanhong Zeng
Haodong Duan
Yanan Sun
Zhening Xing
Wenran Liu
Kaifeng Lyu
Kai-xiang Chen
ELM
LRM
138
9
0
25 Mar 2025
A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1
Zhaoyi Li
Xiaohan Zhao
Dong-Dong Wu
Jiacheng Cui
Zhiqiang Shen
AAML
VLM
138
3
0
13 Mar 2025
Textual-to-Visual Iterative Self-Verification for Slide Generation
Yunqing Xu
Xinbei Ma
Jiyang Qiu
Hai Zhao
108
0
0
24 Feb 2025
1