Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.00112
Cited By
v1
v2
v3 (latest)
Transformer in Transformer
27 February 2021
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4228★)
Papers citing
"Transformer in Transformer"
50 / 558 papers shown
Title
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
Rishov Sarkar
Hanxue Liang
Zhiwen Fan
Zhangyang Wang
Cong Hao
MoE
113
19
0
30 May 2023
ProcessGPT: Transforming Business Process Management with Generative Artificial Intelligence
Amin Beheshti
Jian Yang
Quan.Z Sheng
B. Benatallah
Fabio Casati
Schahram Dustdar
H. M. Nezhad
Xuyun Zhang
Shan Xue
AI4CE
91
39
0
29 May 2023
TranSFormer: Slow-Fast Transformer for Machine Translation
Bei Li
Yi Jing
Xu Tan
Zhen Xing
Tong Xiao
Jingbo Zhu
82
7
0
26 May 2023
FIT: Far-reaching Interleaved Transformers
Ting-Li Chen
Lala Li
108
13
0
22 May 2023
Transfer Learning for Fine-grained Classification Using Semi-supervised Learning and Visual Transformers
Manuel Lagunas
Brayan Impata
Victor Martinez
Virginia Fernandez
Christos Georgakis
Sofia Braun
Felipe Bertrand
ViT
69
8
0
17 May 2023
MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation
Abdul Rehman Khan
Asifullah Khan
ViT
MedIm
107
14
0
15 May 2023
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
T. Gao
Chengzhong Xu
Le Zhang
Hui Kong
121
4
0
13 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
93
5
0
13 May 2023
Meta-Polyp: a baseline for efficient Polyp segmentation
Quoc-Huy Trinh
MedIm
52
18
0
13 May 2023
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
174
41
0
10 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
82
82
0
09 May 2023
Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt
Han Zhou
Weida Dong
Yangyi Liu
Jun Chen
96
19
0
08 May 2023
Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts
Ruoyu Feng
Jinming Liu
Xin Jin
Xiaohan Pan
Heming Sun
Zhibo Chen
VLM
112
13
0
04 May 2023
Semantically Structured Image Compression via Irregular Group-Based Decoupling
V. Sheoran
Yixin Gao
Shreyansh Joshi
Tanisha R. Bhayani
Zhibo Chen
210
14
0
04 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
119
35
0
29 Apr 2023
TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potential
Dongjingdin Liu
Pengpeng Chen
Miao Yao
Yijingxiu Lu
Zijie Cai
Yuxin Tian
49
7
0
23 Apr 2023
Self-supervised Learning by View Synthesis
Shaoteng Liu
Xiangyu Zhang
T. Hu
Jiaya Jia
3DV
ViT
115
1
0
22 Apr 2023
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers
Siyuan Wei
Tianzhu Ye
Shen Zhang
Yao Tang
Jiajun Liang
ViT
75
72
0
21 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
98
21
0
19 Apr 2023
EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation
Ilwi Yun
Chanyong Shin
Hyunku Lee
Hyuk-Jae Lee
Chae-Eun Rhee
ViT
MDE
76
19
0
16 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
74
81
0
13 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
94
49
0
13 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
56
2
0
13 Apr 2023
DUFormer: Solving Power Line Detection Task in Aerial Images using Semantic Segmentation
Deyu An
Qian Zhang
Jianshu Chao
Tingli Li
Feng Qiao
Yong Deng
Zhen-Peng Bian
ViT
40
6
0
12 Apr 2023
Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition
Zhijun Zhai
Jianhui Zhao
Chengjiang Long
Wenju Xu
Shuangjian He
Huijuan Zhao
61
30
0
10 Apr 2023
A Cross-Scale Hierarchical Transformer with Correspondence-Augmented Attention for inferring Bird's-Eye-View Semantic Segmentation
N. Fang
Le-miao Qiu
Shuyou Zhang
Zili Wang
Kerui Hu
Kang Wang
109
7
0
07 Apr 2023
ElegansNet: a brief scientific report and initial experiments
Francesco Bardozzo
Andrea Terlizzi
Pietro Lio
R. Tagliaferri
66
1
0
06 Apr 2023
SMPConv: Self-moving Point Representations for Continuous Convolution
Sanghyeon Kim
Eunbyung Park
3DPC
75
13
0
05 Apr 2023
DIR-AS: Decoupling Individual Identification and Temporal Reasoning for Action Segmentation
Peiyao Wang
Haibin Ling
43
2
0
04 Apr 2023
Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision
Xinyi Ying
Li Liu
Yingqian Wang
Ruojing Li
Nuo Chen
Zaiping Lin
Weidong Sheng
Shilin Zhou
92
60
0
04 Apr 2023
Rethinking Local Perception in Lightweight Vision Transformer
Qi Fan
Huaibo Huang
Jiyang Guan
Ran He
ViT
79
31
0
31 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
191
142
0
29 Mar 2023
Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
Jianping Zhang
Yizhan Huang
Weibin Wu
Michael R. Lyu
AAML
ViT
87
55
0
28 Mar 2023
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
80
41
0
27 Mar 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
73
8
0
26 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
114
141
0
21 Mar 2023
Resolution Enhancement Processing on Low Quality Images Using Swin Transformer Based on Interval Dense Connection Strategy
Ruikang Ju
Chih-Chia Chen
Jen-Shiun Chiang
Yu-Shian Lin
Wei-Han Chen
Chun-Tse Chien
67
17
0
16 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
68
5
0
14 Mar 2023
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Wei Chen
Qibo Qiu
Long Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wei Liu
98
49
0
13 Mar 2023
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang
Jiaze Wang
Jinpeng Li
Zixu Zhao
Guangyong Chen
Anfeng Liu
Pheng-Ann Heng
3DPC
56
8
0
12 Mar 2023
TransMatting: Tri-token Equipped Transformer Model for Image Matting
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
71
3
0
11 Mar 2023
DETA: Denoised Task Adaptation for Few-Shot Learning
Ji Zhang
Lianli Gao
Xu Luo
Hengtao Shen
Jingkuan Song
VLM
109
21
0
11 Mar 2023
Boosting Adversarial Attacks by Leveraging Decision Boundary Information
Boheng Zeng
LianLi Gao
Qilong Zhang
Chaoqun Li
JingKuan Song
Shuaiqi Jing
AAML
119
2
0
10 Mar 2023
Masked Image Modeling with Local Multi-Scale Reconstruction
Haoqing Wang
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhiwei Deng
Kai Han
90
53
0
09 Mar 2023
Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space
Yahui Liu
Bin Wang
Yisheng Lv
Lingxi Li
Feiyue Wang
ViT
3DPC
114
48
0
08 Mar 2023
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
Xuan Shen
Yaohua Wang
Ming Lin
Yi-Li Huang
Hao Tang
Xiuyu Sun
Yanzhi Wang
146
34
0
05 Mar 2023
Transformers in Single Object Tracking: An Experimental Survey
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
121
38
0
23 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
213
9
0
21 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
78
4
0
18 Feb 2023
ViTA: A Vision Transformer Inference Accelerator for Edge Applications
Shashank Nag
Gourav Datta
Souvik Kundu
N. Chandrachoodan
Peter A. Beerel
ViT
43
28
0
17 Feb 2023
Previous
1
2
3
4
5
6
...
10
11
12
Next