
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Papers citing "AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
10 / 10 papers shown
Title |
---|
![]() OmniBench: Towards The Future of Universal Omni-Language Models Yizhi Li Ge Zhang Yinghao Ma Ruibin Yuan Kang Zhu ...Zhaoxiang Zhang Zachary Liu Emmanouil Benetos Wenhao Huang Chenghua Lin |