Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09681
Cited By
XCiT: Cross-Covariance Image Transformers
17 June 2021
Alaaeldin El-Nouby
Hugo Touvron
Mathilde Caron
Piotr Bojanowski
Matthijs Douze
Armand Joulin
Ivan Laptev
Natalia Neverova
Gabriel Synnaeve
Jakob Verbeek
Hervé Jégou
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XCiT: Cross-Covariance Image Transformers"
50 / 283 papers shown
Title
UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images
Zhen Chen
Qing Xu
Xinyu Liu
Yixuan Yuan
MedIm
31
15
0
26 Feb 2024
Key Design Choices in Source-Free Unsupervised Domain Adaptation: An In-depth Empirical Analysis
Andrea Maracani
Raffaello Camoriano
Elisa Maiettini
Davide Talon
Lorenzo Rosasco
Lorenzo Natale
39
1
0
25 Feb 2024
Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers
Markus Hiller
Krista A. Ehinger
Tom Drummond
38
1
0
19 Feb 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
41
29
0
29 Jan 2024
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness
Rongyu Zhang
Zefan Cai
Huanrui Yang
Zidong Liu
Denis A. Gudovskiy
...
Kurt Keutzer
Baobao Chang
Yuan Du
Li Du
Shanghang Zhang
VLM
27
1
0
15 Jan 2024
Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Siddharth Tiwari
MedIm
ViT
42
0
0
10 Jan 2024
FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring
Geunhyuk Youk
Jihyong Oh
Munchurl Kim
30
9
0
08 Jan 2024
MetaISP -- Exploiting Global Scene Structure for Accurate Multi-Device Color Rendition
Matheus Souza
Wolfgang Heidrich
16
3
0
06 Jan 2024
SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization
Xixu Hu
Runkai Zheng
Jindong Wang
Cheuk Hang Leung
Qi Wu
Xing Xie
32
1
0
02 Jan 2024
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
Chenhang He
Ruihuang Li
Guowen Zhang
Lei Zhang
27
4
0
01 Jan 2024
Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence
Hongyuan Wang
Lizhi Wang
Jiang Xu
Chang Chen
Xue Hu
Fenglong Song
Youliang Yan
19
0
0
20 Dec 2023
Linear Attention via Orthogonal Memory
Jun Zhang
Shuyang Jiang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
40
3
0
18 Dec 2023
Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context
Xiang Cheng
Yuxin Chen
S. Sra
18
35
0
11 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
35
0
0
08 Dec 2023
Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy
Danqi Liao
Chen Liu
Benjamin W. Christensen
Alexander Tong
Guillaume Huguet
Guy Wolf
Maximilian Nickel
Ian M. Adelstein
Smita Krishnaswamy
DiffM
47
5
0
04 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
44
0
0
01 Dec 2023
Efficient Baseline for Quantitative Precipitation Forecasting in Weather4cast 2023
Akshay Punjabi
Pablo Izquierdo-Ayala
20
0
0
30 Nov 2023
Correlated Attention in Transformers for Multivariate Time Series
Quang Minh Nguyen
Lam M. Nguyen
Subhro Das
AI4TS
21
0
0
20 Nov 2023
Dynamic Association Learning of Self-Attention and Convolution in Image Restoration
Kui Jiang
Xuemei Jia
Wenxin Huang
Wenbin Wang
Zheng Wang
Junjun Jiang
20
1
0
09 Nov 2023
CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine Context-Guided Motion Reasoning
Azin Jahedi
Maximilian Luz
Marc Rivinius
Andrés Bruhn
22
2
0
05 Nov 2023
ViR: Towards Efficient Vision Retention Backbones
Ali Hatamizadeh
Michael Ranzinger
Shiyi Lan
Jose M. Alvarez
Sanja Fidler
Jan Kautz
GNN
22
1
0
30 Oct 2023
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
Karsten Roth
Lukas Thede
Almut Sophia Koepke
Oriol Vinyals
Olivier J. Hénaff
Zeynep Akata
AAML
22
11
0
26 Oct 2023
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
David T. Hoffmann
Simon Schrodi
Jelena Bratulić
Nadine Behrmann
Volker Fischer
Thomas Brox
30
5
0
19 Oct 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
29
8
0
16 Oct 2023
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Shuyang Jiang
Jinchao Zhang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
54
0
0
14 Oct 2023
Entropic Score metric: Decoupling Topology and Size in Training-free NAS
Niccolò Cavagnero
Luc Robbiano
Francesca Pistilli
Barbara Caputo
Giuseppe Averta
18
2
0
06 Oct 2023
SlowFormer: Universal Adversarial Patch for Attack on Compute and Energy Efficiency of Inference Efficient Vision Transformers
K. Navaneet
Soroush Abbasi Koohpayegani
Essam Sleiman
Hamed Pirsiavash
AAML
ViT
13
1
0
04 Oct 2023
Understanding Masked Autoencoders From a Local Contrastive Perspective
Xiaoyu Yue
Lei Bai
Meng Wei
Jiangmiao Pang
Xihui Liu
Luping Zhou
Wanli Ouyang
SSL
64
4
0
03 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Vincent Leroy
Jérôme Revaud
Thomas Lucas
Philippe Weinzaepfel
ViT
34
2
0
01 Oct 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
J. Han
Guiguang Ding
ViT
31
6
0
27 Sep 2023
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
ViT
13
2
0
21 Sep 2023
DimCL: Dimensional Contrastive Learning For Improving Self-Supervised Learning
Thanh Nguyen
T. Pham
Chaoning Zhang
T. Luu
Thang Vu
Chang-Dong Yoo
27
9
0
21 Sep 2023
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao
Pichao Wang
Yuyang Zhao
Hao Luo
F. Wang
Mike Zheng Shou
ViT
34
14
0
15 Sep 2023
Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation
Yixing Lu
Zhaoxin Fan
Min Xu
27
0
0
12 Sep 2023
Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning
S. Kapse
Srijan Das
Jingwei Zhang
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
Prateek Prasanna
25
9
0
12 Sep 2023
Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for Pitch Analysis
Jerrin Bright
Yuhao Chen
John S. Zelek
23
5
0
02 Sep 2023
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
28
12
0
24 Aug 2023
Patch Is Not All You Need
Chang-bo Li
Jie M. Zhang
Yang Wei
Zhilong Ji
Jinfeng Bai
Shiguang Shan
ViT
46
1
0
21 Aug 2023
Blind Face Restoration for Under-Display Camera via Dictionary Guided Transformer
Jingfan Tan
Xiaoxu Chen
Tao Wang
Kaihao Zhang
Wenhan Luo
Xiaocun Cao
30
11
0
20 Aug 2023
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen
Sebastián M. Palacio
Federico Raue
Andreas Dengel
42
3
0
18 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
Zheng Chen
Yulun Zhang
Jinjin Gu
L. Kong
Xiaokang Yang
F. I. F. Richard Yu
ViT
16
167
0
07 Aug 2023
A Voting-Stacking Ensemble of Inception Networks for Cervical Cytology Classification
Linyi Qian
Qiang Huang
Yulin Chen
Junzhou Chen
11
1
0
05 Aug 2023
EndoDepthL: Lightweight Endoscopic Monocular Depth Estimation with CNN-Transformer
Yangke Li
MedIm
21
1
0
04 Aug 2023
Deep Learning and Computer Vision for Glaucoma Detection: A Review
Mona Ashtari-Majlan
Mohammad Mahdi Dehshibi
David Masip
32
9
0
31 Jul 2023
On the unreasonable vulnerability of transformers for image restoration -- and an easy fix
Shashank Agnihotri
Kanchana Vaishnavi Gandikota
Julia Grabinski
Paramanand Chandramouli
M. Keuper
32
9
0
25 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
F. Khan
ViT
51
19
0
13 Jul 2023
MiVOLO: Multi-input Transformer for Age and Gender Estimation
Maksim Kuprashevich
Irina Tolstykh
22
32
0
10 Jul 2023
Distill-SODA: Distilling Self-Supervised Vision Transformer for Source-Free Open-Set Domain Adaptation in Computational Pathology
Guillaume Vray
Devavrat Tomar
Jean-Philippe Thiran
Behzad Bozorgtabar
MedIm
29
0
0
10 Jul 2023
EdgeFace: Efficient Face Recognition Model for Edge Devices
Anjith George
Christophe Ecabert
Hatef Otroshi-Shahreza
Ketan Kotwal
S´ebastien Marcel
CVBM
26
23
0
04 Jul 2023
Iterated Piecewise Affine (IPA) Approximation for Language Modeling
Davood Shamsi
Wenhui Hua
Brian Williams
13
0
0
21 Jun 2023
Previous
1
2
3
4
5
6
Next