Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.12753
Cited By
LookupViT: Compressing visual information to a limited number of tokens
17 July 2024
Rajat Koner
Gagan Jain
Prateek Jain
Volker Tresp
Sujoy Paul
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LookupViT: Compressing visual information to a limited number of tokens"
4 / 4 papers shown
Title
Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning
Bonan li
Zicheng Zhang
Songhua Liu
Weihao Yu
Xinchao Wang
VLM
74
0
0
17 May 2025
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Zhao Jin
Dacheng Tao
VGen
149
1
0
16 Dec 2024
Principles of Visual Tokens for Efficient Video Understanding
Xinyue Hao
Gen Li
Shreyank N. Gowda
Robert B Fisher
Jonathan Huang
Anurag Arnab
Laura Sevilla-Lara
130
0
0
20 Nov 2024
Token Turing Machines are Efficient Vision Models
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiravathukal
James C. Davis
Yung-Hsiang Lu
117
0
0
11 Sep 2024
1