Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.18108
Cited By
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
24 December 2024
Jing Bi
Junjia Guo
Yunlong Tang
Lianggong Wen
Zhang Liu
Chenliang Xu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach"
3 / 3 papers shown
Title
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
Yunlong Tang
Pinxin Liu
Mingqian Feng
Zhangyun Tan
Rui Mao
...
Hang Hua
Ali Vosoughi
Luchuan Song
Zeliang Zhang
Chenliang Xu
LRM
82
1
0
26 May 2025
Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models
Rui Hu
Delai Qiu
Shuyu Wei
J.N. Zhang
Yining Wang
Shengping Liu
Jitao Sang
AuLLM
VLM
134
0
0
27 Feb 2025
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
222
100
0
29 Dec 2023
1