Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.19875
Cited By
InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding
28 June 2024
Kirolos Ataallah
Chenhui Gou
Eslam Abdelrahman
Khushbu Pahwa
Jian Ding
Mohamed Elhoseiny
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding"
4 / 4 papers shown
Title
MINERVA: Evaluating Complex Video Reasoning
Arsha Nagrani
Sachit Menon
Ahmet Iscen
Shyamal Buch
Ramin Mehran
...
Yukun Zhu
Carl Vondrick
Mikhail Sirotenko
Cordelia Schmid
Tobias Weyand
58
0
0
01 May 2025
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark
Hanlei Zhang
Zhuohang Li
Yeshuang Zhu
Hua Xu
Peiwu Wang
Haige Zhu
Jie Zhou
Jinchao Zhang
39
0
0
23 Apr 2025
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Yilun Zhao
Lujing Xie
Haowei Zhang
Guo Gan
Yitao Long
...
Xiangru Tang
Zhenwen Liang
Y. Liu
Chen Zhao
Arman Cohan
53
5
0
21 Jan 2025
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLM
MLLM
197
595
0
16 Nov 2023
1