Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.08771
Cited By
LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information
11 December 2024
Ke Wang
Hong Xuan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information"
2 / 2 papers shown
Title
Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark
Enxin Song
Wenhao Chai
Weili Xu
Jianwen Xie
Yuxuan Liu
Gaoang Wang
62
0
0
20 Apr 2025
Investigating Inference-time Scaling for Chain of Multi-modal Thought: A Preliminary Study
Yujie Lin
Ante Wang
Moye Chen
Jingyao Liu
Hao Liu
Jinsong Su
Xinyan Xiao
LRM
50
2
0
17 Feb 2025
1