Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.01536
Cited By
v1
v2
v3
v4
v5 (latest)
Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work
3 March 2022
Khawar Islam
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (2★)
Papers citing
"Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work"
41 / 41 papers shown
Title
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen
Sebastián M. Palacio
Federico Raue
Andreas Dengel
118
4
0
18 Aug 2023
Face Pyramid Vision Transformer
Khawar Islam
M. Zaheer
Arif Mahmood
ViT
CVBM
60
4
0
21 Oct 2022
VRT: A Video Restoration Transformer
Christos Sakaridis
Jingyun Liang
Yuchen Fan
Peng Sun
Rakesh Ranjan
Yawei Li
Radu Timofte
Luc Van Gool
ViT
107
270
0
28 Jan 2022
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
78
221
0
12 Jan 2022
Towards End-to-End Image Compression and Analysis with Transformers
Yuanchao Bai
Xu Yang
Xianming Liu
Junjun Jiang
Yaowei Wang
Xiangyang Ji
Wen Gao
ViT
82
51
0
17 Dec 2021
Fast Point Transformer
Chunghyun Park
Yoonwoo Jeong
Minsu Cho
Jaesik Park
3DPC
ViT
72
171
0
09 Dec 2021
SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
Zhaoyang Sun
Yaxiong Chen
Shengwu Xiong
ViT
57
38
0
07 Dec 2021
CCTrans: Simplifying and Improving Crowd Counting with Transformer
Ye Tian
Xiangxiang Chu
Hongpeng Wang
ViT
62
78
0
29 Sep 2021
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
88
338
0
29 Jul 2021
HAT: Hierarchical Aggregation Transformers for Person Re-identification
Guowen Zhang
Pingping Zhang
Jinqing Qi
Huchuan Lu
ViT
111
119
0
13 Jul 2021
Combining EfficientNet and Vision Transformers for Video Deepfake Detection
D. Coccomini
Nicola Messina
Claudio Gennaro
Fabrizio Falchi
ViT
86
174
0
06 Jul 2021
UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
Yunhe Gao
Mu Zhou
Dimitris N. Metaxas
MedIm
ViT
76
430
0
02 Jul 2021
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
116
327
0
24 Jun 2021
Fully Transformer Networks for Semantic Image Segmentation
Sitong Wu
Tianyi Wu
Fangjian Lin
Sheng Tian
Guodong Guo
ViT
60
39
0
08 Jun 2021
Diverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer
Yulin Li
Jianfeng He
Tianzhu Zhang
Xiang Liu
Yongdong Zhang
Feng Wu
ViT
79
302
0
08 Jun 2021
Uformer: A General U-Shaped Transformer for Image Restoration
Zhendong Wang
Xiaodong Cun
Jianmin Bao
Wengang Zhou
Jianzhuang Liu
Houqiang Li
ViT
117
1,413
0
06 Jun 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
318
5,072
0
31 May 2021
Medical Image Segmentation Using Squeeze-and-Expansion Transformers
Shaohua Li
Xiuchao Sui
Xiangde Luo
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
ViT
MedIm
64
167
0
20 May 2021
Segmenter: Transformer for Semantic Segmentation
Robin Strudel
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
ViT
215
1,470
0
12 May 2021
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Hu Cao
Yueyue Wang
Jieneng Chen
Dongsheng Jiang
Xiaopeng Zhang
Qi Tian
Manning Wang
ViT
MedIm
141
2,922
0
12 May 2021
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
Bing Li
Cheng Zheng
Silvio Giancola
Guohao Li
ViT
3DPC
83
44
0
10 May 2021
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
Zhaoxin Fan
Zhenbo Song
Hongyan Liu
Zhiwu Lu
Jun He
Xiaoyong Du
3DPC
ViT
141
74
0
01 May 2021
Point Cloud Learning with Transformer
Xian-Feng Han
Yuming Kuang
ViT
80
34
0
28 Apr 2021
Dual Transformer for Point Cloud Analysis
Xian-Feng Han
Yi-Fei Jin
Hui Cheng
Guoqiang Xiao
ViT
84
75
0
27 Apr 2021
A Video Is Worth Three Views: Trigeminal Transformers for Video-based Person Re-identification
Xuehu Liu
Pingping Zhang
Chenyang Yu
Huchuan Lu
Xuesheng Qian
Xiaoyun Yang
ViT
75
48
0
05 Apr 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
160
1,021
0
31 Mar 2021
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu Zhang
Longhui Wei
Lingxi Xie
Zijie Zhuang
Yongfei Zhang
Yue Liu
Qi Tian
ViT
93
32
0
30 Mar 2021
Transformer Tracking
Xin Chen
Bin Yan
Jiawen Zhu
Dong Wang
Xiaoyun Yang
Huchuan Lu
ViT
69
961
0
29 Mar 2021
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Chun-Fu Chen
Quanfu Fan
Yikang Shen
ViT
71
1,484
0
27 Mar 2021
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou
Bingyi Kang
Xiaojie Jin
Linjie Yang
Xiaochen Lian
Zihang Jiang
Qibin Hou
Jiashi Feng
ViT
104
523
0
22 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViT
MedIm
182
1,614
0
18 Mar 2021
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer
Buyu Li
Yongchi Zhao
Zhelun Shi
Lu Sheng
47
134
0
18 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
89
390
0
14 Mar 2021
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
76
181
0
01 Mar 2021
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
Jeya Maria Jose Valanarasu
Poojan Oza
Ilker Hacihaliloglu
Vishal M. Patel
ViT
MedIm
108
993
0
21 Feb 2021
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
273
819
0
08 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViT
MedIm
98
3,499
0
08 Feb 2021
Trear: Transformer-based RGB-D Egocentric Action Recognition
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
215
87
0
05 Jan 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
194
2,911
0
31 Dec 2020
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
183
2,004
0
02 Nov 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
437
13,108
0
26 May 2020
1