Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.10068
Cited By
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
14 April 2025
Yang Shi
Jiaheng Liu
Yushuo Guan
Zhikai Wu
Yize Zhang
Ziyi Wang
Weihong Lin
Jingyun Hua
Ziyi Wang
Xinlong Chen
Bohan Zeng
Wei Zhang
Fuzheng Zhang
Wenjing Yang
Di Zhang
VGen
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Mavors: Multi-granularity Video Representation for Multimodal Large Language Model"
4 / 54 papers shown
Title
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)
Chee-Kheng Chng
Yuliang Liu
Yipeng Sun
Chun Chet Ng
Canjie Luo
...
Errui Ding
Jingtuo Liu
Dimosthenis Karatzas
Chee Seng Chan
Lianwen Jin
3DV
92
215
0
16 Sep 2019
A Short Note on the Kinetics-700 Human Action Dataset
João Carreira
Eric Noland
Chloe Hillier
Andrew Zisserman
76
455
0
15 Jul 2019
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
98
1,542
0
13 Jun 2017
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
75
830
0
28 Mar 2017
Previous
1
2