Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.23782
Cited By
Video Token Merging for Long-form Video Understanding
31 October 2024
Seon-Ho Lee
Jue Wang
Zhikang Zhang
D. Fan
Xinyu Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video Token Merging for Long-form Video Understanding"
2 / 2 papers shown
Title
VideoSAVi: Self-Aligned Video Language Models without Human Supervision
Yogesh Kulkarni
Pooyan Fazli
VLM
103
2
0
01 Dec 2024
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin
Zhelun Shi
Jiwen Yu
Xijun Wang
Enshen Zhou
...
Lu Sheng
Jing Shao
Junlin Wu
Wanli Ouyang
Ruimao Zhang
EGVM
VGen
126
381
0
23 Oct 2024
1