ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.01536
  4. Cited By
Recent Advances in Vision Transformer: A Survey and Outlook of Recent
  Work
v1v2v3v4v5 (latest)

Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work

3 March 2022
Khawar Islam
    ViT
ArXiv (abs)PDFHTMLGithub (2★)

Papers citing "Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work"

41 / 41 papers shown
Title
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen
Sebastián M. Palacio
Federico Raue
Andreas Dengel
118
4
0
18 Aug 2023
Face Pyramid Vision Transformer
Face Pyramid Vision Transformer
Khawar Islam
M. Zaheer
Arif Mahmood
ViTCVBM
60
4
0
21 Oct 2022
VRT: A Video Restoration Transformer
VRT: A Video Restoration Transformer
Christos Sakaridis
Jingyun Liang
Yuchen Fan
Peng Sun
Rakesh Ranjan
Yawei Li
Radu Timofte
Luc Van Gool
ViT
107
270
0
28 Jan 2022
Multiview Transformers for Video Recognition
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
78
221
0
12 Jan 2022
Towards End-to-End Image Compression and Analysis with Transformers
Towards End-to-End Image Compression and Analysis with Transformers
Yuanchao Bai
Xu Yang
Xianming Liu
Junjun Jiang
Yaowei Wang
Xiangyang Ji
Wen Gao
ViT
82
51
0
17 Dec 2021
Fast Point Transformer
Fast Point Transformer
Chunghyun Park
Yoonwoo Jeong
Minsu Cho
Jaesik Park
3DPCViT
72
171
0
09 Dec 2021
SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer
  and Removal
SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
Zhaoyang Sun
Yaxiong Chen
Shengwu Xiong
ViT
57
38
0
07 Dec 2021
CCTrans: Simplifying and Improving Crowd Counting with Transformer
CCTrans: Simplifying and Improving Crowd Counting with Transformer
Ye Tian
Xiangxiang Chu
Hongpeng Wang
ViT
62
78
0
29 Sep 2021
Rethinking and Improving Relative Position Encoding for Vision
  Transformer
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
88
338
0
29 Jul 2021
HAT: Hierarchical Aggregation Transformers for Person Re-identification
HAT: Hierarchical Aggregation Transformers for Person Re-identification
Guowen Zhang
Pingping Zhang
Jinqing Qi
Huchuan Lu
ViT
111
119
0
13 Jul 2021
Combining EfficientNet and Vision Transformers for Video Deepfake
  Detection
Combining EfficientNet and Vision Transformers for Video Deepfake Detection
D. Coccomini
Nicola Messina
Claudio Gennaro
Fabrizio Falchi
ViT
86
174
0
06 Jul 2021
UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
Yunhe Gao
Mu Zhou
Dimitris N. Metaxas
MedImViT
76
430
0
02 Jul 2021
VOLO: Vision Outlooker for Visual Recognition
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
116
327
0
24 Jun 2021
Fully Transformer Networks for Semantic Image Segmentation
Fully Transformer Networks for Semantic Image Segmentation
Sitong Wu
Tianyi Wu
Fangjian Lin
Sheng Tian
Guodong Guo
ViT
60
39
0
08 Jun 2021
Diverse Part Discovery: Occluded Person Re-identification with
  Part-Aware Transformer
Diverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer
Yulin Li
Jianfeng He
Tianzhu Zhang
Xiang Liu
Yongdong Zhang
Feng Wu
ViT
79
302
0
08 Jun 2021
Uformer: A General U-Shaped Transformer for Image Restoration
Uformer: A General U-Shaped Transformer for Image Restoration
Zhendong Wang
Xiaodong Cun
Jianmin Bao
Wengang Zhou
Jianzhuang Liu
Houqiang Li
ViT
117
1,413
0
06 Jun 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with
  Transformers
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
318
5,072
0
31 May 2021
Medical Image Segmentation Using Squeeze-and-Expansion Transformers
Medical Image Segmentation Using Squeeze-and-Expansion Transformers
Shaohua Li
Xiuchao Sui
Xiangde Luo
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
ViTMedIm
64
167
0
20 May 2021
Segmenter: Transformer for Semantic Segmentation
Segmenter: Transformer for Semantic Segmentation
Robin Strudel
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
ViT
215
1,470
0
12 May 2021
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Hu Cao
Yueyue Wang
Jieneng Chen
Dongsheng Jiang
Xiaopeng Zhang
Qi Tian
Manning Wang
ViTMedIm
141
2,922
0
12 May 2021
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
Bing Li
Cheng Zheng
Silvio Giancola
Guohao Li
ViT3DPC
83
44
0
10 May 2021
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale
  Place Recognition
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
Zhaoxin Fan
Zhenbo Song
Hongyan Liu
Zhiwu Lu
Jun He
Xiaoyong Du
3DPCViT
141
74
0
01 May 2021
Point Cloud Learning with Transformer
Point Cloud Learning with Transformer
Xian-Feng Han
Yuming Kuang
ViT
80
34
0
28 Apr 2021
Dual Transformer for Point Cloud Analysis
Dual Transformer for Point Cloud Analysis
Xian-Feng Han
Yi-Fei Jin
Hui Cheng
Guoqiang Xiao
ViT
84
75
0
27 Apr 2021
A Video Is Worth Three Views: Trigeminal Transformers for Video-based
  Person Re-identification
A Video Is Worth Three Views: Trigeminal Transformers for Video-based Person Re-identification
Xuehu Liu
Pingping Zhang
Chenyang Yu
Huchuan Lu
Xuesheng Qian
Xiaoyun Yang
ViT
75
48
0
05 Apr 2021
Going deeper with Image Transformers
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
160
1,021
0
31 Mar 2021
Spatiotemporal Transformer for Video-based Person Re-identification
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu Zhang
Longhui Wei
Lingxi Xie
Zijie Zhuang
Yongfei Zhang
Yue Liu
Qi Tian
ViT
93
32
0
30 Mar 2021
Transformer Tracking
Transformer Tracking
Xin Chen
Bin Yan
Jiawen Zhu
Dong Wang
Xiaoyun Yang
Huchuan Lu
ViT
69
961
0
29 Mar 2021
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image
  Classification
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Chun-Fu Chen
Quanfu Fan
Yikang Shen
ViT
71
1,484
0
27 Mar 2021
DeepViT: Towards Deeper Vision Transformer
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou
Bingyi Kang
Xiaojie Jin
Linjie Yang
Xiaochen Lian
Zihang Jiang
Qibin Hou
Jiashi Feng
ViT
104
523
0
22 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
UNETR: Transformers for 3D Medical Image Segmentation
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViTMedIm
182
1,614
0
18 Mar 2021
DanceFormer: Music Conditioned 3D Dance Generation with Parametric
  Motion Transformer
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer
Buyu Li
Yongchi Zhao
Zhelun Shi
Lu Sheng
47
134
0
18 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
89
390
0
14 Mar 2021
Generative Adversarial Transformers
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
76
181
0
01 Mar 2021
Medical Transformer: Gated Axial-Attention for Medical Image
  Segmentation
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
Jeya Maria Jose Valanarasu
Poojan Oza
Ilker Hacihaliloglu
Vishal M. Patel
ViTMedIm
108
993
0
21 Feb 2021
TransReID: Transformer-based Object Re-Identification
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
273
819
0
08 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image
  Segmentation
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViTMedIm
98
3,499
0
08 Feb 2021
Trear: Transformer-based RGB-D Egocentric Action Recognition
Trear: Transformer-based RGB-D Egocentric Action Recognition
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
215
87
0
05 Jan 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective
  with Transformers
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
194
2,911
0
31 Dec 2020
Point Transformer
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
183
2,004
0
02 Nov 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT3DVPINN
437
13,108
0
26 May 2020
1