Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.00652
Cited By
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
1 July 2021
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Weiming Zhang
Nenghai Yu
Lu Yuan
Dong Chen
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows"
50 / 440 papers shown
Title
Gramian Attention Heads are Strong yet Efficient Vision Learners
Jongbin Ryu
Dongyoon Han
J. Lim
38
1
0
25 Oct 2023
Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning
Zhiming Qian
VLM
SSL
22
0
0
11 Oct 2023
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision Transformers
Xuwei Xu
Sen Wang
Yudong Chen
Jiajun Liu
ViT
26
1
0
09 Oct 2023
Hierarchical Side-Tuning for Vision Transformers
Weifeng Lin
Ziheng Wu
Wentao Yang
Mingxin Huang
Jun Huang
Lianwen Jin
31
8
0
09 Oct 2023
Low-Resolution Self-Attention for Semantic Segmentation
Yu-Huan Wu
Shi-Chen Zhang
Yun-Hai Liu
Le Zhang
Xin Zhan
Daquan Zhou
Jiashi Feng
Ming-Ming Cheng
Liangli Zhen
ViT
45
3
0
08 Oct 2023
TiC: Exploring Vision Transformer in Convolution
Song Zhang
Qingzhong Wang
Jiang Bian
Haoyi Xiong
ViT
37
1
0
06 Oct 2023
A Complementary Global and Local Knowledge Network for Ultrasound denoising with Fine-grained Refinement
Zhenyu Bu
Kaini Wang
Fuxing Zhao
Shengxiao Li
Guangquan Zhou
22
0
0
05 Oct 2023
Multiple Physics Pretraining for Physical Surrogate Models
Michael McCabe
Bruno Régaldo-Saint Blancard
Liam Parker
Ruben Ohana
M. Cranmer
...
Francois Lanusse
Mariel Pettee
Tiberiu Teşileanu
Kyunghyun Cho
Shirley Ho
PINN
AI4CE
40
53
0
04 Oct 2023
TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation
Yahia Dalbah
Jean Lahoud
Hisham Cholakkal
27
7
0
03 Oct 2023
When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo
Tianqi Liu
Xinyi Ye
Weiyue Zhao
Zhiyu Pan
Min Shi
Zhiguo Cao
29
12
0
29 Sep 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
J. Han
Guiguang Ding
ViT
34
6
0
27 Sep 2023
UniHead: Unifying Multi-Perception for Detection Heads
Hantao Zhou
Rui Yang
Yachao Zhang
Haoran Duan
Yawen Huang
R. Hu
Xiu Li
Yefeng Zheng
31
12
0
23 Sep 2023
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
ViT
13
2
0
21 Sep 2023
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
43
75
0
20 Sep 2023
Multi-Context Dual Hyper-Prior Neural Image Compression
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Mohammad Akyash
Hossein Kashiani
Nasser M. Nasrabadi
32
6
0
19 Sep 2023
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao
Pichao Wang
Yuyang Zhao
Hao Luo
F. Wang
Mike Zheng Shou
ViT
37
14
0
15 Sep 2023
Dataset Condensation via Generative Model
David Junhao Zhang
Heng Wang
Chuhui Xue
Rui Yan
Wenqing Zhang
Song Bai
Mike Zheng Shou
DD
26
11
0
14 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
32
16
0
13 Sep 2023
HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen
Xintao Wang
Wenlong Zhang
Xiangtao Kong
Yu Qiao
Jiantao Zhou
Chao Dong
32
44
0
11 Sep 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation
Xin Lai
Yuhui Yuan
Ruihang Chu
Yukang Chen
Han Hu
Jiaya Jia
MedIm
ISeg
3DPC
50
30
0
04 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
32
25
0
04 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Qi Han
Yuxuan Cai
Xiangyu Zhang
41
7
0
02 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
39
85
0
27 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
36
20
0
27 Aug 2023
Vision Transformer Adapters for Generalizable Multitask Learning
Deblina Bhattacharjee
Sabine Süsstrunk
Mathieu Salzmann
ViT
21
8
0
23 Aug 2023
SG-Former: Self-guided Transformer with Evolving Token Reallocation
Sucheng Ren
Xingyi Yang
Songhua Liu
Xinchao Wang
ViT
27
41
0
23 Aug 2023
Long-Range Grouping Transformer for Multi-View 3D Reconstruction
Liying Yang
Zhenwei Zhu
Xuxin Lin
Jian Nong
Yanyan Liang
ViT
27
6
0
17 Aug 2023
SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers
Xijun Wang
Xiaojie Chu
Chunrui Han
Xiangyu Zhang
ViT
23
1
0
14 Aug 2023
Revisiting Vision Transformer from the View of Path Ensemble
Shuning Chang
Pichao Wang
Haowen Luo
Fan Wang
Mike Zheng Shou
ViT
40
3
0
12 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
39
2
0
10 Aug 2023
Deformable Mixer Transformer with Gating for Multi-Task Learning of Dense Prediction
Yangyang Xu
Yibo Yang
Bernard Ghanemm
Lefei Zhang
Du Bo
Dacheng Tao
21
1
0
10 Aug 2023
PVG: Progressive Vision Graph for Vision Recognition
Jiafu Wu
Jian Li
Jiangning Zhang
Boshen Zhang
M. Chi
Yabiao Wang
Chengjie Wang
ViT
30
13
0
01 Aug 2023
FLatten Transformer: Vision Transformer using Focused Linear Attention
Dongchen Han
Xuran Pan
Yizeng Han
Shiji Song
Gao Huang
23
158
0
01 Aug 2023
A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond
Junyu Chen
Yihao Liu
Shuwen Wei
Zhangxing Bian
Shalini Subramanian
A. Carass
Jerry L. Prince
Yong Du
OOD
45
36
0
28 Jul 2023
Adaptive Segmentation Network for Scene Text Detection
Gui-yan Zhao
SSeg
35
1
0
27 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
38
119
0
25 Jul 2023
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation
Jinjing Zhu
Yuan Luo
Xueye Zheng
Hao Wang
Lin Wang
25
33
0
24 Jul 2023
As large as it gets: Learning infinitely large Filters via Neural Implicit Functions in the Fourier Domain
Julia Grabinski
J. Keuper
M. Keuper
23
6
0
19 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
34
178
0
18 Jul 2023
Vision Language Transformers: A Survey
Clayton Fields
C. Kennington
VLM
28
5
0
06 Jul 2023
Art Authentication with Vision Transformers
Ludovica Schaerf
Carina Popovici
Eric Postma
ViT
14
9
0
06 Jul 2023
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
Jakob Drachmann Havtorn
Amelie Royer
Tijmen Blankevoort
B. Bejnordi
30
8
0
05 Jul 2023
X-MLP: A Patch Embedding-Free MLP Architecture for Vision
Xinyue Wang
Zhicheng Cai
Chenglei Peng
ViT
24
5
0
02 Jul 2023
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter
Binjie Zhang
Yixiao Ge
Xuyuan Xu
Ying Shan
Mike Zheng Shou
52
7
0
22 Jun 2023
Reviving Shift Equivariance in Vision Transformers
Peijian Ding
Davit Soselia
Thomas Armstrong
Jiahao Su
Furong Huang
25
7
0
13 Jun 2023
2-D SSM: A General Spatial Layer for Visual Transformers
Ethan Baron
Itamar Zimerman
Lior Wolf
32
14
0
11 Jun 2023
FalconNet: Factorization for the Light-weight ConvNets
Zhicheng Cai
Qiu Shen
32
11
0
10 Jun 2023
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers
Bowen Zhang
Liyang Liu
Minh Hieu Phan
Zhi Tian
Chunhua Shen
Yifan Liu
ViT
34
28
0
09 Jun 2023
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
Greg Heinrich
Hongxu Yin
Andrew Tao
J. Álvarez
Jan Kautz
Pavlo Molchanov
ViT
28
68
0
09 Jun 2023
Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding
Jie Gui
Xiaofeng Cong
Lei He
Yuan Yan Tang
James T. Kwok
20
11
0
09 Jun 2023
Previous
1
2
3
4
5
6
7
8
9
Next