Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09681
Cited By
XCiT: Cross-Covariance Image Transformers
17 June 2021
Alaaeldin El-Nouby
Hugo Touvron
Mathilde Caron
Piotr Bojanowski
Matthijs Douze
Armand Joulin
Ivan Laptev
Natalia Neverova
Gabriel Synnaeve
Jakob Verbeek
Hervé Jégou
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XCiT: Cross-Covariance Image Transformers"
50 / 283 papers shown
Title
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Ning Zhang
F. Nex
G. Vosselman
N. Kerle
MDE
41
153
0
23 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
35
31
0
18 Nov 2022
What Images are More Memorable to Machines?
Junlin Han
Huangying Zhan
Jie Hong
Pengfei Fang
Hongdong Li
L. Petersson
Ian Reid
30
3
0
14 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
36
657
0
10 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
10
6
0
04 Nov 2022
Attention-based Neural Cellular Automata
Mattie Tesfaldet
Derek Nowrouzezahrai
C. Pal
ViT
29
17
0
02 Nov 2022
Similarity of Neural Architectures using Adversarial Attack Transferability
Jaehui Hwang
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
29
1
0
20 Oct 2022
Scratching Visual Transformer's Back with Uniform Attention
Nam Hyeon-Woo
Kim Yu-Ji
Byeongho Heo
Doonyoon Han
Seong Joon Oh
Tae-Hyun Oh
353
23
0
16 Oct 2022
Prediction Calibration for Generalized Few-shot Semantic Segmentation
Zhihe Lu
Sen He
Da Li
Yi-Zhe Song
Tao Xiang
ViT
27
22
0
15 Oct 2022
CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models
Denis Kuznedelev
Eldar Kurtic
Elias Frantar
Dan Alistarh
VLM
ViT
11
11
0
14 Oct 2022
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
Jinchao Zhang
Shuyang Jiang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
3DV
43
9
0
14 Oct 2022
Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
Zhiying Lu
Hongtao Xie
Chuanbin Liu
Yongdong Zhang
ViT
25
57
0
12 Oct 2022
The Lie Derivative for Measuring Learned Equivariance
Nate Gruver
Marc Finzi
Micah Goldblum
A. Wilson
18
34
0
06 Oct 2022
Natural Color Fool: Towards Boosting Black-box Unrestricted Attacks
Shengming Yuan
Qilong Zhang
Lianli Gao
Yaya Cheng
Jingkuan Song
AAML
24
42
0
05 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
27
25
0
03 Oct 2022
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features
S. Wadekar
Abhishek Chaurasia
ViT
98
87
0
30 Sep 2022
Effective Vision Transformer Training: A Data-Centric Perspective
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
26
5
0
29 Sep 2022
A Light Recipe to Train Robust Vision Transformers
Edoardo Debenedetti
Vikash Sehwag
Prateek Mittal
ViT
29
68
0
15 Sep 2022
On the interplay of adversarial robustness and architecture components: patches, convolution and attention
Francesco Croce
Matthias Hein
41
6
0
14 Sep 2022
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Rui Wang
Zuxuan Wu
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Luowei Zhou
Lu Yuan
Yu-Gang Jiang
ViT
37
4
0
25 Aug 2022
Exploring Adversarial Robustness of Vision Transformers in the Spectral Perspective
Gihyun Kim
Juyeop Kim
Jong-Seok Lee
AAML
ViT
21
4
0
20 Aug 2022
SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation
Haoran Pan
Jun Zhou
Yuanpeng Liu
Xuequan Lu
Weiming Wang
Xu Yan
Mingqiang Wei
24
5
0
17 Aug 2022
D3Former: Debiased Dual Distilled Transformer for Incremental Learning
Abdel-rahman Mohamed
Rushali Grandhe
KJ Joseph
Salman Khan
F. Khan
CLL
19
9
0
25 Jul 2022
Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness
Chaoning Zhang
Kang Zhang
Chenshuang Zhang
Axi Niu
Jiu Feng
Chang D. Yoo
In So Kweon
SSL
35
24
0
22 Jul 2022
Magic ELF: Image Deraining Meets Association Learning and Transformer
Kui Jiang
Zhongyuan Wang
Chen Chen
Zheng Wang
Laizhong Cui
Chia-Wen Lin
ViT
14
63
0
21 Jul 2022
Large Scale Radio Frequency Signal Classification
Luke Boegner
Manbir Gulati
Garrett M. Vanhoy
Phillip Vallance
B. Comar
S. Kokalj-Filipovic
Craig T. Lennon
Rob Miller
14
15
0
20 Jul 2022
Lightweight Vision Transformer with Cross Feature Attention
Youpeng Zhao
Huadong Tang
Yingying Jiang
A. Yong
Qiang Wu
ViT
22
10
0
15 Jul 2022
Imaging through the Atmosphere using Turbulence Mitigation Transformer
Xingguang Zhang
Zhiyuan Mao
Nicholas Chimitt
Stanley H. Chan
ViT
24
22
0
13 Jul 2022
LightViT: Towards Light-Weight Convolution-Free Vision Transformers
Tao Huang
Lang Huang
Shan You
Fei Wang
Chao Qian
Chang Xu
ViT
17
55
0
12 Jul 2022
Vision Transformers: State of the Art and Research Challenges
Bo-Kai Ruan
Hong-Han Shuai
Wen-Huang Cheng
ViT
24
17
0
07 Jul 2022
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
Runsheng Xu
Zhengzhong Tu
Hao Xiang
Wei Shao
Bolei Zhou
Jiaqi Ma
56
218
0
05 Jul 2022
Self-supervised Learning in Remote Sensing: A Review
Yi Wang
C. Albrecht
Nassim Ait Ali Braham
Lichao Mou
Xiao Xiang Zhu
24
218
0
27 Jun 2022
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Muhammad Maaz
Abdelrahman M. Shaker
Hisham Cholakkal
Salman Khan
Syed Waqas Zamir
Rao Muhammad Anwer
F. Khan
ViT
27
184
0
21 Jun 2022
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
34
31
0
21 Jun 2022
Global Context Vision Transformers
Ali Hatamizadeh
Hongxu Yin
Greg Heinrich
Jan Kautz
Pavlo Molchanov
ViT
17
120
0
20 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Yibo Yang
Yong Liu
Dacheng Tao
ViT
34
32
0
19 Jun 2022
SimA: Simple Softmax-free Attention for Vision Transformers
Soroush Abbasi Koohpayegani
Hamed Pirsiavash
16
25
0
17 Jun 2022
Patch-level Representation Learning for Self-supervised Vision Transformers
Sukmin Yun
Hankook Lee
Jaehyung Kim
Jinwoo Shin
ViT
22
64
0
16 Jun 2022
Efficient Decoder-free Object Detection with Transformers
Peixian Chen
Mengdan Zhang
Yunhang Shen
Kekai Sheng
Yuting Gao
Xing Sun
Ke Li
Chunhua Shen
ViT
41
16
0
14 Jun 2022
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViT
MDE
29
30
0
14 Jun 2022
Fair Comparison between Efficient Attentions
Jiuk Hong
Chaehyeon Lee
Soyoun Bang
Heechul Jung
19
1
0
01 Jun 2022
Learning Instance-Specific Augmentations by Capturing Local Invariances
Ning Miao
Tom Rainforth
Emile Mathieu
Yann Dubois
Yee Whye Teh
Adam Foster
Hyunjik Kim
34
10
0
31 May 2022
Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
Leandro M. de Lima
R. Krohling
ViT
MedIm
28
10
0
30 May 2022
A Closer Look at Self-Supervised Lightweight Vision Transformers
Shaoru Wang
Jin Gao
Zeming Li
Jian Sun
Weiming Hu
ViT
67
41
0
28 May 2022
Object-wise Masked Autoencoders for Fast Pre-training
Jiantao Wu
Shentong Mo
ViT
OCL
19
15
0
28 May 2022
X-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song
Heung-Chang Lee
ViT
17
2
0
27 May 2022
Fast Vision Transformers with HiLo Attention
Zizheng Pan
Jianfei Cai
Bohan Zhuang
28
152
0
26 May 2022
Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy
Zhiqi Bu
J. Mao
Shiyun Xu
131
47
0
21 May 2022
Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging
Yuanhao Cai
Jing Lin
Haoqian Wang
Xin Yuan
Henghui Ding
Yulun Zhang
Radu Timofte
Luc Van Gool
80
116
0
20 May 2022
TRT-ViT: TensorRT-oriented Vision Transformer
Xin Xia
Jiashi Li
Jie Wu
Xing Wang
Xuefeng Xiao
Min Zheng
Rui Wang
ViT
21
27
0
19 May 2022
Previous
1
2
3
4
5
6
Next