Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.00112
Cited By
Transformer in Transformer
27 February 2021
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Transformer in Transformer"
50 / 553 papers shown
Title
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
T. Gao
Chengzhong Xu
Le Zhang
Hui Kong
38
4
0
13 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
39
5
0
13 May 2023
Meta-Polyp: a baseline for efficient Polyp segmentation
Quoc-Huy Trinh
MedIm
21
18
0
13 May 2023
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
54
38
0
10 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
24
71
0
09 May 2023
Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt
Han Zhou
Weida Dong
Yangyi Liu
Jun Chen
43
18
0
08 May 2023
Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts
Ruoyu Feng
Jinming Liu
Xin Jin
Xiaohan Pan
Heming Sun
Zhibo Chen
VLM
60
11
0
04 May 2023
Semantically Structured Image Compression via Irregular Group-Based Decoupling
V. Sheoran
Yixin Gao
Shreyansh Joshi
Tanisha R. Bhayani
Zhibo Chen
36
13
0
04 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
51
33
0
29 Apr 2023
TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potential
Dongjingdin Liu
Pengpeng Chen
Miao Yao
Yijingxiu Lu
Zijie Cai
Yuxin Tian
27
7
0
23 Apr 2023
Self-supervised Learning by View Synthesis
Shaoteng Liu
Xiangyu Zhang
T. Hu
Jiaya Jia
3DV
ViT
40
1
0
22 Apr 2023
Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers
Siyuan Wei
Tianzhu Ye
Shen Zhang
Yao Tang
Jiajun Liang
ViT
11
65
0
21 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
46
16
0
19 Apr 2023
EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation
Ilwi Yun
Chanyong Shin
Hyunku Lee
Hyuk-Jae Lee
Chae-Eun Rhee
ViT
MDE
32
17
0
16 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
34
76
0
13 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
35
47
0
13 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
27
2
0
13 Apr 2023
DUFormer: Solving Power Line Detection Task in Aerial Images using Semantic Segmentation
Deyu An
Qian Zhang
Jianshu Chao
Tingli Li
Feng Qiao
Yong Deng
Zhen-Peng Bian
ViT
30
6
0
12 Apr 2023
Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition
Zhijun Zhai
Jianhui Zhao
Chengjiang Long
Wenju Xu
Shuangjian He
Huijuan Zhao
28
24
0
10 Apr 2023
A Cross-Scale Hierarchical Transformer with Correspondence-Augmented Attention for inferring Bird's-Eye-View Semantic Segmentation
N. Fang
Le-miao Qiu
Shuyou Zhang
Zili Wang
Kerui Hu
Kang Wang
29
5
0
07 Apr 2023
ElegansNet: a brief scientific report and initial experiments
Francesco Bardozzo
Andrea Terlizzi
Pietro Lio
R. Tagliaferri
29
1
0
06 Apr 2023
SMPConv: Self-moving Point Representations for Continuous Convolution
Sanghyeon Kim
Eunbyung Park
3DPC
39
13
0
05 Apr 2023
DIR-AS: Decoupling Individual Identification and Temporal Reasoning for Action Segmentation
Peiyao Wang
Haibin Ling
15
2
0
04 Apr 2023
Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision
Xinyi Ying
Li Liu
Yingqian Wang
Ruojing Li
Nuo Chen
Zaiping Lin
Weidong Sheng
Shilin Zhou
34
57
0
04 Apr 2023
Rethinking Local Perception in Lightweight Vision Transformer
Qi Fan
Huaibo Huang
Jiyang Guan
Ran He
ViT
31
30
0
31 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
48
119
0
29 Mar 2023
Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
Jianping Zhang
Yizhan Huang
Weibin Wu
Michael R. Lyu
AAML
ViT
18
50
0
28 Mar 2023
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
24
38
0
27 Mar 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
27
8
0
26 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
42
127
0
21 Mar 2023
Resolution Enhancement Processing on Low Quality Images Using Swin Transformer Based on Interval Dense Connection Strategy
Ruikang Ju
Chih-Chia Chen
Jen-Shiun Chiang
Yu-Shian Lin
Wei-Han Chen
Chun-Tse Chien
27
17
0
16 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
29
5
0
14 Mar 2023
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Wei Chen
Qibo Qiu
Long Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wei Liu
32
38
0
13 Mar 2023
PointPatchMix: Point Cloud Mixing with Patch Scoring
Yi Wang
Jiaze Wang
Jinpeng Li
Zixu Zhao
Guangyong Chen
Anfeng Liu
Pheng-Ann Heng
3DPC
29
7
0
12 Mar 2023
TransMatting: Tri-token Equipped Transformer Model for Image Matting
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
20
3
0
11 Mar 2023
DETA: Denoised Task Adaptation for Few-Shot Learning
Ji Zhang
Lianli Gao
Xu Luo
Hengtao Shen
Jingkuan Song
VLM
44
19
0
11 Mar 2023
Boosting Adversarial Attacks by Leveraging Decision Boundary Information
Boheng Zeng
LianLi Gao
Qilong Zhang
Chaoqun Li
JingKuan Song
Shuaiqi Jing
AAML
19
2
0
10 Mar 2023
Masked Image Modeling with Local Multi-Scale Reconstruction
Haoqing Wang
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhiwei Deng
Kai Han
61
46
0
09 Mar 2023
Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space
Yahui Liu
Bin Wang
Yisheng Lv
Lingxi Li
Feiyue Wang
ViT
3DPC
20
43
0
08 Mar 2023
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
Xuan Shen
Yaohua Wang
Ming Lin
Yi-Li Huang
Hao Tang
Xiuyu Sun
Yanzhi Wang
70
33
0
05 Mar 2023
Transformers in Single Object Tracking: An Experimental Survey
Janani Kugarajeevan
T. Kokul
A. Ramanan
Subha Fernando
38
35
0
23 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
89
9
0
21 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
31
3
0
18 Feb 2023
ViTA: A Vision Transformer Inference Accelerator for Edge Applications
Shashank Nag
Gourav Datta
Souvik Kundu
N. Chandrachoodan
P. Beerel
ViT
26
25
0
17 Feb 2023
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies
Hongyu Hè
Marko Kabić
25
2
0
13 Feb 2023
IH-ViT: Vision Transformer-based Integrated Circuit Appear-ance Defect Detection
Xiaoibin Wang
Shuang Gao
Yuntao Zou
Jia Guo
Chu Wang
13
5
0
09 Feb 2023
PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer
Zitong Yu
Yuming Shen
Jingang Shi
Hengshuang Zhao
Yawen Cui
Jiehua Zhang
Philip Torr
Guoying Zhao
ViT
MedIm
29
80
0
07 Feb 2023
CECT: Controllable Ensemble CNN and Transformer for COVID-19 Image Classification
Zhao Liu
Leizhao Shen
ViT
29
8
0
05 Feb 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedIm
ViT
29
136
0
03 Feb 2023
Previous
1
2
3
4
5
6
...
10
11
12
Next