Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.00287
Cited By
v1
v2 (latest)
Vision Transformers with Mixed-Resolution Tokenization
1 April 2023
Tomer Ronen
Omer Levy
A. Golbert
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (50★)
Papers citing
"Vision Transformers with Mixed-Resolution Tokenization"
13 / 13 papers shown
Title
Componential Prompt-Knowledge Alignment for Domain Incremental Learning
Kunlun Xu
Xu Zou
Gang Hua
Jiahuan Zhou
CLL
139
0
0
07 May 2025
Position: Foundation Models Need Digital Twin Representations
Yiqing Shen
Hao Ding
Lalithkumar Seenivasan
Tianmin Shu
Mathias Unberath
AI4CE
159
2
0
01 May 2025
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
Fatemeh Behrad
Tinne Tuytelaars
Johan Wagemans
ViT
120
0
0
03 Apr 2025
CAT: Content-Adaptive Image Tokenization
Junhong Shen
Kushal Tirumala
Michihiro Yasunaga
Ishan Misra
Luke Zettlemoyer
Lili Yu
Chunting Zhou
83
1
0
06 Jan 2025
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
77
1
0
11 Jul 2024
Multiple-Resolution Tokenization for Time Series Forecasting with an Application to Pricing
Egon Persak
Miguel F. Anjos
Sebastian Lautz
Aleksandar Kolev
AI4TS
102
0
0
03 Jul 2024
Wavelet-Based Image Tokenizer for Vision Transformers
Zhenhai Zhu
Radu Soricut
ViT
109
5
0
28 May 2024
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Run Shao
Zhaoyang Zhang
Chao Tao
Yunsheng Zhang
Chengli Peng
Haifeng Li
VLM
83
6
0
27 Mar 2024
Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance
Omer Goldman
Avi Caciularu
Matan Eyal
Kris Cao
Idan Szpektor
Reut Tsarfaty
118
31
0
10 Mar 2024
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
293
9
0
22 Feb 2024
Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations
Bhishma Dedhia
N. Jha
OCL
151
1
0
02 Feb 2024
Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
Adam Pardyl
Grzegorz Kurzejamski
Jan Olszewski
Tomasz Trzciñski
Bartosz Zieliñski
63
1
0
23 Sep 2023
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
Jakob Drachmann Havtorn
Amelie Royer
Tijmen Blankevoort
B. Bejnordi
83
8
0
05 Jul 2023
1