Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.18325
Cited By
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models
23 October 2024
Kim Sung-Bin
Oh Hyun-Bin
JungMok Lee
Arda Senocak
Joon Son Chung
Tae-Hyun Oh
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models"
2 / 2 papers shown
Title
Bridging Ears and Eyes: Analyzing Audio and Visual Large Language Models to Humans in Visible Sound Recognition and Reducing Their Sensory Gap via Cross-Modal Distillation
Xilin Jiang
Junkai Wu
Vishal B. Choudhari
N. Mesgarani
VLM
30
0
0
11 May 2025
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Y. Wang
Shengqiong Wu
Y. Zhang
William Yang Wang
Ziwei Liu
Jiebo Luo
Hao Fei
LRM
92
8
0
16 Mar 2025
1