
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
Papers citing "IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs"
17 / 17 papers shown
Title |
---|
![]() Aria: An Open Multimodal Native Mixture-of-Experts Model Dongxu Li Yudong Liu Haoning Wu Yue Wang Zhiqi Shen ...Lihuan Zhang Hanshu Yan Guoyin Wang Bei Chen Junnan Li |
![]() SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval Siwei Wu Yizhi Li Kang Zhu Ge Zhang Yiming Liang ...Wenhu Chen Wenhao Huang Noura Al Moubayed Jie Fu Chenghua Lin |