Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.00112
Cited By
v1
v2
v3 (latest)
Transformer in Transformer
27 February 2021
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4228★)
Papers citing
"Transformer in Transformer"
50 / 558 papers shown
Title
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
Zhuo Huang
Zhiyou Zhao
Banghuai Li
Jungong Han
3DPC
ViT
102
58
0
23 Oct 2022
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention
Chi Zhang
Lu Zhou
Lei Wang
Zaiyan Dai
Jun Yang
ViT
135
27
0
22 Oct 2022
LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling
Dongsheng Chen
Chaofan Tao
Lu Hou
Lifeng Shang
Xin Jiang
Qun Liu
VLM
100
19
0
21 Oct 2022
Boosting vision transformers for image retrieval
Chull Hwan Song
Jooyoung Yoon
Shunghyun Choi
Yannis Avrithis
ViT
117
33
0
21 Oct 2022
Similarity of Neural Architectures using Adversarial Attack Transferability
Ian Ryu
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
136
2
0
20 Oct 2022
Rethinking Bias Mitigation: Fairer Architectures Make for Fairer Face Recognition
Samuel Dooley
R. Sukthanker
John P. Dickerson
Colin White
Frank Hutter
Micah Goldblum
CVBM
132
23
0
18 Oct 2022
TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers
Hyeong Kyu Choi
Joonmyung Choi
Hyunwoo J. Kim
ViT
95
37
0
14 Oct 2022
Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
Zhiying Lu
Hongtao Xie
Chuanbin Liu
Yongdong Zhang
ViT
107
62
0
12 Oct 2022
Coded Residual Transform for Generalizable Deep Metric Learning
Shichao Kan
Yixiong Liang
Min Li
Yigang Cen
Jianxin Wang
Z. He
72
3
0
09 Oct 2022
The Lie Derivative for Measuring Learned Equivariance
Nate Gruver
Marc Finzi
Micah Goldblum
A. Wilson
99
40
0
06 Oct 2022
Towards Flexible Inductive Bias via Progressive Reparameterization Scheduling
Yunsung Lee
Gyuseong Lee
Kwang-seok Ryoo
Hyojun Go
Jihye Park
Seung Wook Kim
74
5
0
04 Oct 2022
Effective Vision Transformer Training: A Data-Centric Perspective
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
80
5
0
29 Sep 2022
UNesT: Local Spatial Representation Learning with Hierarchical Transformer for Efficient Medical Segmentation
Xin Yu
Qi Yang
Yinchi Zhou
L. Cai
Riqiang Gao
...
R. Abramson
Zizhao Zhang
Yuankai Huo
Bennett A. Landman
Yucheng Tang
ViT
MedIm
89
0
0
28 Sep 2022
Hierarchical MixUp Multi-label Classification with Imbalanced Interdisciplinary Research Proposals
Meng Xiao
Minjie Wu
Ziyue Qiao
Zhiyuan Ning
Yi Du
Yanjie Fu
Yuanchun Zhou
89
2
0
28 Sep 2022
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using Satellite Imagery
Ruikang Luo
Yaofeng Song
Haiying Zhao
Yicheng Zhang
Yi Zhang
Nanbin Zhao
Liping Huang
Rong Su
ViT
54
12
0
27 Sep 2022
Estimating Brain Age with Global and Local Dependencies
Yanwu Yang
Xutao Guo
Zhikai Chang
Hanyang Peng
Yang Xiang
Haiyan Lv
Ting Ma
112
0
0
19 Sep 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
76
13
0
18 Sep 2022
Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification
Meng Xiao
Ziyue Qiao
Yanjie Fu
Hao Dong
Yi Du
Pengyang Wang
Hui Xiong
Yuanchun Zhou
120
10
0
16 Sep 2022
SQ-Swin: a Pretrained Siamese Quadratic Swin Transformer for Lettuce Browning Prediction
Dayang Wang
Boce Zhang
Yongshun Xu
Yaguang Luo
Hengyong Yu
ViT
99
1
0
16 Sep 2022
PSAQ-ViT V2: Towards Accurate and General Data-Free Quantization for Vision Transformers
Zhikai Li
Mengjuan Chen
Junrui Xiao
Qingyi Gu
ViT
MQ
127
35
0
13 Sep 2022
Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition
M. Han
Yibing Zhan
Yong Luo
Bo Du
Han Hu
Yonggang Wen
Dacheng Tao
76
6
0
07 Sep 2022
ViTKD: Practical Guidelines for ViT feature knowledge distillation
Zhendong Yang
Zhe Li
Ailing Zeng
Zexian Li
Chun Yuan
Yu Li
145
42
0
06 Sep 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
66
9
0
24 Aug 2022
FocusFormer: Focusing on What We Need via Architecture Sampler
Jing Liu
Jianfei Cai
Bohan Zhuang
65
8
0
23 Aug 2022
Exploring Adversarial Robustness of Vision Transformers in the Spectral Perspective
Gihyun Kim
Juyeop Kim
Jong-Seok Lee
AAML
ViT
54
6
0
20 Aug 2022
Multiple Instance Neuroimage Transformer
Ayush Singla
Qingyu Zhao
Daniel K. Do
Yuyin Zhou
K. Pohl
Ehsan Adeli
ViT
MedIm
69
11
0
19 Aug 2022
Improved Image Classification with Token Fusion
Keong-Hun Choi
Jin-Woo Kim
Yaolong Wang
J. Ha
ViT
46
0
0
19 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
92
12
0
06 Aug 2022
TransMatting: Enhancing Transparent Objects Matting with Transformers
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
72
22
0
05 Aug 2022
DropKey
Bonan li
Yinhan Hu
Xuecheng Nie
Congying Han
Xiangjian Jiang
Tiande Guo
Luoqi Liu
46
12
0
04 Aug 2022
Computer Vision Methods for the Microstructural Analysis of Materials: The State-of-the-art and Future Perspectives
Khaled Alrfou
Amir Kordijazi
Tian Zhao
3DV
63
6
0
29 Jul 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
59
23
0
27 Jul 2022
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Xiangpei Hua
Lei Zhang
ViT
53
30
0
27 Jul 2022
TransCL: Transformer Makes Strong and Flexible Compressive Learning
Chong Mou
Jian Zhang
66
25
0
25 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
101
8
0
19 Jul 2022
Multi-manifold Attention for Vision Transformers
D. Konstantinidis
Ilias Papastratis
K. Dimitropoulos
P. Daras
ViT
103
16
0
18 Jul 2022
Earthformer: Exploring Space-Time Transformers for Earth System Forecasting
Zhihan Gao
Xingjian Shi
Hao Wang
Yi Zhu
Yuyang Wang
Mu Li
Dit-Yan Yeung
AI4TS
100
159
0
12 Jul 2022
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Ting Yao
Yingwei Pan
Yehao Li
Chong-Wah Ngo
Tao Mei
ViT
227
142
0
11 Jul 2022
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
233
82
0
11 Jul 2022
MaiT: Leverage Attention Masks for More Efficient Image Transformers
Ling Li
Ali Shafiee Ardestani
Joseph Hassoun
43
1
0
06 Jul 2022
Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks
Yongming Rao
Zuyan Liu
Wenliang Zhao
Jie Zhou
Jiwen Lu
ViT
86
38
0
04 Jul 2022
I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
Zhikai Li
Qingyi Gu
MQ
139
106
0
04 Jul 2022
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Wenhao Wu
Zhun Sun
Wanli Ouyang
VLM
190
99
0
04 Jul 2022
A Survey on Label-efficient Deep Image Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction
Wei Shen
Zelin Peng
Xuehui Wang
Huayu Wang
Jiazhong Cen
Dongsheng Jiang
Lingxi Xie
Xiaokang Yang
Qi Tian
VLM
114
84
0
04 Jul 2022
Rethinking Query-Key Pairwise Interactions in Vision Transformers
Cheng-rong Li
Yangxin Liu
68
0
0
01 Jul 2022
PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Lilang Zheng
Jiaxuan Fang
Xiaorun Tang
Hanzhang Li
Jiaxin Fan
Tianyi Wang
Rui Zhou
Zhaoyan Yan
ViT
MedIm
96
2
0
30 Jun 2022
Dynamic-Group-Aware Networks for Multi-Agent Trajectory Prediction with Relational Reasoning
Chenxin Xu
Yuxin Wei
Bohan Tang
Sheng Yin
Ya Zhang
Siheng Chen
AI4TS
AI4CE
120
37
0
27 Jun 2022
CV 3315 Is All You Need : Semantic Segmentation Competition
Akide Liu
Zihan Wang
48
4
0
25 Jun 2022
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space
Jinghuan Shang
Srijan Das
Michael S. Ryoo
99
13
0
23 Jun 2022
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
73
34
0
21 Jun 2022
Previous
1
2
3
...
6
7
8
...
10
11
12
Next