Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.00652
Cited By
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
1 July 2021
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Weiming Zhang
Nenghai Yu
Lu Yuan
Dong Chen
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows"
50 / 440 papers shown
Title
Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers
Chenyang Lu
Daan de Geus
Gijs Dubbelman
ViT
30
20
0
03 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
45
0
0
02 Jun 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
45
160
0
01 Jun 2023
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners
Sarthak Yadav
Sergios Theodoridis
Lars Kai Hansen
Zheng-Hua Tan
28
7
0
01 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
52
28
0
01 Jun 2023
Are Large Kernels Better Teachers than Transformers for ConvNets?
Tianjin Huang
Lu Yin
Zhenyu Zhang
Lijuan Shen
Meng Fang
Mykola Pechenizkiy
Zhangyang Wang
Shiwei Liu
38
13
0
30 May 2023
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer
Bo Jiang
S. Luo
Tianlin Li
Chuanfu Li
Jin Tang
38
8
0
30 May 2023
Predicting Token Impact Towards Efficient Vision Transformer
Hong Wang
Su Yang
Xiaoke Huang
Weishan Zhang
25
0
0
24 May 2023
Dual Path Transformer with Partition Attention
Zhengkai Jiang
Liang Liu
Jiangning Zhang
Yabiao Wang
Mingang Chen
Chengjie Wang
ViT
36
2
0
24 May 2023
Efficient Large-Scale Visual Representation Learning And Evaluation
Eden Dolev
A. Awad
Denisa Roberts
Zahra Ebrahimzadeh
Marcin Mejran
Vaibhav Malpani
Mahir Yavuz
45
0
0
22 May 2023
GELU Activation Function in Deep Learning: A Comprehensive Mathematical Analysis and Performance
Minhyeok Lee
26
30
0
20 May 2023
Reciprocal Attention Mixing Transformer for Lightweight Image Restoration
Haram Choi
Cheolwoong Na
Jihyeon Oh
Seungjae Lee
Jinseop S. Kim
Subeen Choe
Jeongmin Lee
Taehoon Kim
Jihoon Yang
51
5
0
19 May 2023
Dual flow fusion model for concrete surface crack segmentation
Yuwei Duan
14
1
0
09 May 2023
OctFormer: Octree-based Transformers for 3D Point Clouds
Peng-Shuai Wang
ViT
3DPC
34
82
0
04 May 2023
AxWin Transformer: A Context-Aware Vision Transformer Backbone with Axial Windows
Fangjian Lin
Yizhe Ma
Sitong Wu
Long Yu
Sheng Tian
ViT
21
5
0
02 May 2023
PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation
Yizhe Ma
Fangjian Lin
Sitong Wu
Sheng Tian
Long Yu
37
12
0
01 May 2023
Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI
Yuheng Li
Jacob F. Wynne
Jing Wang
Richard L. J. Qiu
J. Roper
...
A. Jani
Tian Liu
P. Patel
H. Mao
Xiaofeng Yang
OOD
ViT
MedIm
30
10
0
30 Apr 2023
UniNeXt: Exploring A Unified Architecture for Vision Recognition
Fangjian Lin
Jianlong Yuan
Sitong Wu
Fan Wang
Zhibin Wang
ViT
32
14
0
26 Apr 2023
ScatterFormer: Locally-Invariant Scattering Transformer for Patient-Independent Multispectral Detection of Epileptiform Discharges
Rui-Hua Zheng
Jun Yu Li
Yi Wang
Tian Luo
Yuguo Yu
MedIm
40
4
0
26 Apr 2023
NTIRE 2023 Challenge on Light Field Image Super-Resolution: Dataset, Methods and Results
Yingqian Wang
Longguang Wang
Zhengyu Liang
Jung-Mo Yang
Radu Timofte
Y. Guo
44
39
0
20 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
46
16
0
19 Apr 2023
SViTT: Temporal Learning of Sparse Video-Text Transformers
Yi Li
Kyle Min
Subarna Tripathi
Nuno Vasconcelos
31
12
0
18 Apr 2023
AutoTaskFormer: Searching Vision Transformers for Multi-task Learning
Yang Liu
Shen Yan
Yuge Zhang
Kan Ren
Quan Zhang
Zebin Ren
Deng Cai
Mi Zhang
ViT
32
0
0
18 Apr 2023
EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation
Ilwi Yun
Chanyong Shin
Hyunku Lee
Hyuk-Jae Lee
Chae-Eun Rhee
ViT
MDE
32
17
0
16 Apr 2023
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding
Yu-Qi Yang
Yu-Xiao Guo
Jiangfeng Xiong
Yang Liu
Hao Pan
Peng-Shuai Wang
Xin Tong
B. Guo
ViT
35
77
0
14 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
37
47
0
13 Apr 2023
RSIR Transformer: Hierarchical Vision Transformer using Random Sampling Windows and Important Region Windows
Zhemin Zhang
Xun Gong
ViT
21
1
0
13 Apr 2023
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Xuran Pan
Tianzhu Ye
Zhuofan Xia
S. Song
Gao Huang
ViT
36
53
0
09 Apr 2023
MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision
Zhimin Zhu
Jianguo Zhao
Tong Mu
Yuliang Yang
Mengyu Zhu
40
0
0
08 Apr 2023
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and Progressive Shift
Gaojie Wu
Weishi Zheng
Yutong Lu
Q. Tian
ViT
48
15
0
07 Apr 2023
Towards an Effective and Efficient Transformer for Rain-by-snow Weather Removal
Tao Gao
Yuanbo Wen
Kaihao Zhang
Peng Cheng
Ting Chen
ViT
53
5
0
06 Apr 2023
MULLER: Multilayer Laplacian Resizer for Vision
Zhengzhong Tu
P. Milanfar
Hossein Talebi
45
3
0
06 Apr 2023
SMPConv: Self-moving Point Representations for Continuous Convolution
Sanghyeon Kim
Eunbyung Park
3DPC
42
13
0
05 Apr 2023
Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising
Miaoyu Li
Ji Liu
Ying Fu
Yulun Zhang
Dejing Dou
ViT
13
58
0
03 Apr 2023
SVT: Supertoken Video Transformer for Efficient Video Understanding
Chen-Ming Pan
Rui Hou
Hanchao Yu
Qifan Wang
Senem Velipasalar
Madian Khabsa
ViT
29
0
0
01 Apr 2023
Rethinking Local Perception in Lightweight Vision Transformer
Qi Fan
Huaibo Huang
Jiyang Guan
Ran He
ViT
31
30
0
31 Mar 2023
Dual Cross-Attention for Medical Image Segmentation
Gorkem Can Ates
P. Mohan
Emrah Çelik
17
75
0
30 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
48
119
0
29 Mar 2023
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
29
38
0
27 Mar 2023
Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution
Gang Wu
Junjun Jiang
Yuanchao Bai
Xianming Liu
SupR
ViT
25
6
0
25 Mar 2023
Spherical Transformer for LiDAR-based 3D Recognition
Xin Lai
Yukang Chen
Fanbin Lu
Jianhui Liu
Jiaya Jia
3DPC
37
126
0
22 Mar 2023
OcTr: Octree-based Transformer for 3D Object Detection
Chao Zhou
Yanan Zhang
Jiaxin Chen
Di Huang
3DPC
ViT
27
42
0
22 Mar 2023
Robustifying Token Attention for Vision Transformers
Yong Guo
David Stutz
Bernt Schiele
ViT
23
24
0
20 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
21
37
0
17 Mar 2023
BiFormer: Vision Transformer with Bi-Level Routing Attention
Lei Zhu
Xinjiang Wang
Zhanghan Ke
Wayne Zhang
Rynson W. H. Lau
134
487
0
15 Mar 2023
Making Vision Transformers Efficient from A Token Sparsification View
Shuning Chang
Pichao Wang
Ming Lin
Fan Wang
David Junhao Zhang
Rong Jin
Mike Zheng Shou
ViT
45
24
0
15 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
29
5
0
14 Mar 2023
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Wei Chen
Qibo Qiu
Long Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wei Liu
35
38
0
13 Mar 2023
Recursive Generalization Transformer for Image Super-Resolution
Zheng Chen
Yulun Zhang
Jinjin Gu
L. Kong
Xiaokang Yang
ViT
21
27
0
11 Mar 2023
Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space
Yahui Liu
Bin Wang
Yisheng Lv
Lingxi Li
Feiyue Wang
ViT
3DPC
25
43
0
08 Mar 2023
Previous
1
2
3
4
5
6
7
8
9
Next