Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05909
Cited By
Stand-Alone Self-Attention in Vision Models
13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stand-Alone Self-Attention in Vision Models"
50 / 588 papers shown
Title
Marine Debris Detection in Satellite Surveillance using Attention Mechanisms
Ao Shen
Yijie Zhu
Richard Jiang
95
8
0
09 Jul 2023
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
Jakob Drachmann Havtorn
Amelie Royer
Tijmen Blankevoort
B. Bejnordi
81
8
0
05 Jul 2023
X-MLP: A Patch Embedding-Free MLP Architecture for Vision
Xinyue Wang
Zhicheng Cai
Chenglei Peng
ViT
90
5
0
02 Jul 2023
Dynamic Perceiver for Efficient Visual Recognition
Yizeng Han
Dongchen Han
Zeyu Liu
Yulin Wang
Xuran Pan
Yifan Pu
Chaorui Deng
Junlan Feng
S. Song
Gao Huang
108
30
0
20 Jun 2023
Multi-level Multiple Instance Learning with Transformer for Whole Slide Image Classification
Rui-qi Zhang
Qiaozheng Zhang
Yingzhuang Liu
Hao Xin
Yang Liu
Xinggang Wang
ViT
MedIm
122
8
0
08 Jun 2023
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
102
29
0
07 Jun 2023
Evolution: A Unified Formula for Feature Operators from a High-level Perspective
Zhicheng Cai
26
0
0
23 May 2023
Deep Multiple Instance Learning with Distance-Aware Self-Attention
Georg Wolflein
Lucie Charlotte Magister
Pietro Lio
David J. Harrison
Ognjen Arandjelovic
63
3
0
17 May 2023
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marçal Rusiñol
63
6
0
11 May 2023
PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces
Yiqun Wang
Ivan Skorokhodov
Peter Wonka
96
43
0
09 May 2023
Semantic Segmentation using Vision Transformers: A survey
Hans Thisanke
Chamli Deshan
K. Chamith
Sachith Seneviratne
Rajith Vidanaarachchi
Damayanthi Herath
ViT
87
159
0
05 May 2023
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Arunima Jaiswal
Ananya Sadana
MedIm
32
3
0
01 May 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation
Yousef Yeganeh
Azade Farshad
Peter Weinberger
Seyed-Ahmad Ahmadi
Ehsan Adeli
Nassir Navab
ViT
MedIm
54
0
0
28 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
103
12
0
24 Apr 2023
DeformableFormer: Classification of Endoscopic Ultrasound Guided Fine Needle Biopsy in Pancreatic Diseases
Taiji Kurami
T. Ishikawa
Kazuhiro Hotta
MedIm
31
0
0
21 Apr 2023
Variational Relational Point Completion Network for Robust 3D Classification
Liang Pan
Xinyi Chen
Zhongang Cai
Junzhe Zhang
Haiyu Zhao
Shuai Yi
Ziwei Liu
3DPC
3DV
64
13
0
18 Apr 2023
How Will It Drape Like? Capturing Fabric Mechanics from Depth Images
Carlos Rodriguez-Pardo
Melania Prieto-Martin
Dan Casas
Elena Garces
81
12
0
13 Apr 2023
Multi-scale Geometry-aware Transformer for 3D Point Cloud Classification
Xian Wei
Muyu Wang
S. J. Lin
Zhengyu Li
Jian Yang
Arafat Al-Jawari
Xuan Tang
3DPC
ViT
59
2
0
12 Apr 2023
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Xuran Pan
Tianzhu Ye
Zhuofan Xia
S. Song
Gao Huang
ViT
86
60
0
09 Apr 2023
RFAConv: Innovating Spatial Attention and Standard Convolutional Operation
Xinyu Zhang
Chen Liu
Degang Yang
Tingting Song
Yichen Ye
Ke Li
Ying Song
80
123
0
06 Apr 2023
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation
Jun-ho Park
Jintae Kim
Chang-Su Kim
74
22
0
05 Apr 2023
Dual Cross-Attention for Medical Image Segmentation
Gorkem Can Ates
P. Mohan
Emrah Çelik
56
85
0
30 Mar 2023
Self-positioning Point-based Transformer for Point Cloud Understanding
Jinyoung Park
S. Lee
S. Kim
Yunyang Xiong
Hyunwoo J. Kim
ViT
3DPC
80
62
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
189
139
0
29 Mar 2023
Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution
Gang Wu
Junjun Jiang
Yuanchao Bai
Xianming Liu
SupR
ViT
57
6
0
25 Mar 2023
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant
Maika Edberg
Nicolas Dufour
Vicky Kalogeiton
MedIm
ViT
58
1
0
21 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
81
3
0
16 Mar 2023
BiFormer: Vision Transformer with Bi-Level Routing Attention
Lei Zhu
Xinjiang Wang
Zhanghan Ke
Wayne Zhang
Rynson W. H. Lau
192
536
0
15 Mar 2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
Yongil Kim
Yerin Hwang
Hyeongu Yun
Seunghyun Yoon
Trung Bui
Kyomin Jung
61
6
0
15 Mar 2023
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Tianlin Li
Ying Wang
Ziwei Xuan
Guo-Jun Qi
ViT
75
3
0
14 Mar 2023
Deep-NFA: a Deep
a
contrario
\textit{a contrario}
a contrario
Framework for Small Object Detection
Alina Ciocarlan
S. L. Hégarat-Mascle
S. Lefebvre
Arnaud Woiselle
ObjD
42
9
0
02 Mar 2023
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
Yawei Li
Yuchen Fan
Xiaoyu Xiang
D. Demandolx
Rakesh Ranjan
Radu Timofte
Luc Van Gool
103
181
0
01 Mar 2023
Sampled Transformer for Point Sets
Shidi Li
Christian J. Walder
Alexander Soen
Lexing Xie
Miaomiao Liu
3DPC
72
1
0
28 Feb 2023
RGB-D Grasp Detection via Depth Guided Learning with Cross-modal Attention
Ran Qin
Haoxiang Ma
Bo-Bin Gao
Di Huang
64
19
0
28 Feb 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
64
24
0
23 Feb 2023
STB-VMM: Swin Transformer Based Video Motion Magnification
Ricard Lado-Roigé
M. A. Pérez
46
13
0
20 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
78
4
0
18 Feb 2023
Single Motion Diffusion
Sigal Raab
Inbal Leibovitch
Guy Tevet
Moab Arar
Amit H. Bermano
Daniel Cohen-Or
DiffM
VGen
137
60
0
12 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
133
37
0
09 Feb 2023
Semantic Diffusion Network for Semantic Segmentation
Hao Hao Tan
Sitong Wu
Jimin Pi
DiffM
97
33
0
04 Feb 2023
Fairness-aware Vision Transformer via Debiased Self-Attention
Yao Qiang
Chengyin Li
Prashant Khanduri
D. Zhu
ViT
126
9
0
31 Jan 2023
Flow-guided Semi-supervised Video Object Segmentation
Yushan Zhang
Andreas Robinson
M. Magnusson
Michael Felsberg
VOS
69
1
0
25 Jan 2023
HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images
Kun Li
G. Vosselman
M. Yang
80
7
0
23 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
Stefanos Zafeiriou
ViT
86
56
0
12 Jan 2023
Deep Residual Axial Networks
Nazmul Shahadat
Anthony Maida
3DPC
116
6
0
11 Jan 2023
Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review
Reza Azad
Amirhossein Kazerouni
Moein Heidari
Ehsan Khodapanah Aghdam
Amir Molaei
Yiwei Jia
Abin Jose
Rijo Roy
Dorit Merhof
MedIm
ViT
118
187
0
09 Jan 2023
SLGTformer: An Attention-Based Approach to Sign Language Recognition
Neil Song
Yu Xiang
SLR
58
0
0
21 Dec 2022
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
102
6
0
16 Dec 2022
GAMMA: Generative Augmentation for Attentive Marine Debris Detection
Vaishnavi Khindkar
Janhavi Khindkar
ViT
64
1
0
07 Dec 2022
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data
Matthias Zeller
Jens Behley
Michael Heidingsfeld
C. Stachniss
95
24
0
07 Dec 2022
Previous
1
2
3
4
5
6
...
10
11
12
Next