Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05909
Cited By
Stand-Alone Self-Attention in Vision Models
13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stand-Alone Self-Attention in Vision Models"
50 / 588 papers shown
Title
Neural Distributed Image Compression with Cross-Attention Feature Alignment
N. Mital
Ezgi Ozyilkan
Ali Garjani
Deniz Gunduz
57
23
0
18 Jul 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
135
259
0
12 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Nayeon Lee
Yichang Shih
R. Ramamoorthi
ViT
100
114
0
12 Jul 2022
Improving Domain Generalization by Learning without Forgetting: Application in Retail Checkout
Thuy C. Nguyen
N. Phan
S. T. Nguyen
OOD
56
3
0
12 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
108
2
0
12 Jul 2022
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Ting Yao
Yingwei Pan
Yehao Li
Chong-Wah Ngo
Tao Mei
ViT
225
142
0
11 Jul 2022
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
233
81
0
11 Jul 2022
Snow Mask Guided Adaptive Residual Network for Image Snow Removal
Bodong Cheng
Juncheng Li
Ying-Cong Chen
Shuyi Zhang
T. Zeng
70
44
0
11 Jul 2022
Attention and Self-Attention in Random Forests
Lev V. Utkin
A. Konstantinov
71
7
0
09 Jul 2022
Cross-Attention Transformer for Video Interpolation
Hannah Kim
Shuzhi Yu
Shuai Yuan
Carlo Tomasi
ViT
70
16
0
08 Jul 2022
kMaX-DeepLab: k-means Mask Transformer
Qihang Yu
Huiyu Wang
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
179
19
0
08 Jul 2022
MaiT: Leverage Attention Masks for More Efficient Image Transformers
Ling Li
Ali Shafiee Ardestani
Joseph Hassoun
41
1
0
06 Jul 2022
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation
Samuel Cognolato
Alberto Testolin
71
7
0
06 Jul 2022
Polarized Color Image Denoising using Pocoformer
Zhuoxiao Li
Hai-bo Jiang
Yinqiang Zheng
85
3
0
01 Jul 2022
PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Lilang Zheng
Jiaxuan Fang
Xiaorun Tang
Hanzhang Li
Jiaxin Fan
Tianyi Wang
Rui Zhou
Zhaoyan Yan
ViT
MedIm
88
2
0
30 Jun 2022
The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Bo Yan
Leilei Cao
Zhuang Li
Hongbin Wang
48
0
0
28 Jun 2022
Excavating RoI Attention for Underwater Object Detection
Xutao Liang
Pinhao Song
38
37
0
24 Jun 2022
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
73
34
0
21 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
MedIm
124
92
0
17 Jun 2022
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Yuxuan Zhou
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Lei Zhang
Margret Keuper
Xia Hua
ViT
71
15
0
15 Jun 2022
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViT
MDE
79
33
0
14 Jun 2022
Positional Label for Self-Supervised Vision Transformer
Zhemin Zhang
Xun Gong
ViT
MDE
59
6
0
10 Jun 2022
GAMR: A Guided Attention Model for (visual) Reasoning
Mohit Vaishnav
Thomas Serre
LRM
88
16
0
10 Jun 2022
SwinCheX: Multi-label classification on chest X-ray images with transformers
Sina Taslimi
Soroush Taslimi
Nima Fathi
Mohammad Salehi
M. Rohban
ViT
MedIm
83
27
0
09 Jun 2022
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang
Kaihao Zhang
Wenhan Luo
Changsheng Li
Guoren Wang
CVBM
74
17
0
08 Jun 2022
Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images
Tom Ron
M. Weiler-Sagie
Tamir Hazan
FAtt
MedIm
81
6
0
06 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
117
88
0
01 Jun 2022
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
Bahri Batuhan Bilecen
Alparslan Fisne
Mustafa Ayazoglu
65
2
0
01 Jun 2022
Future Transformer for Long-term Action Anticipation
Dayoung Gong
Joonseok Lee
Manjin Kim
S. Ha
Minsu Cho
AI4TS
53
66
0
27 May 2022
Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging
Yuanhao Cai
Jing Lin
Haoqian Wang
Xin Yuan
Henghui Ding
Yulun Zhang
Radu Timofte
Luc Van Gool
165
124
0
20 May 2022
VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling
Andrei Sharf
3DPC
53
1
0
19 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
40
5
0
18 May 2022
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
116
68
0
17 May 2022
Transformers in 3D Point Clouds: A Survey
Dening Lu
Qian Xie
Mingqiang Wei
Kyle Gao
Linlin Xu
Jonathan Li
3DPC
ViT
139
53
0
16 May 2022
Dense residual Transformer for image denoising
Chao Yao
Shuo Jin
Meiqin Liu
Xiaojuan Ban
ViT
80
30
0
14 May 2022
RCMNet: A deep learning model assists CAR-T therapy for leukemia
Ruitao Zhang
Xue-ying Han
I. Gul
Shiyao Zhai
Yang Liu
...
Yuhan Dong
Lan Ma
Dongmei Yu
Jingqian Zhou
Peiwu Qin
30
32
0
06 May 2022
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
Haowei Zhu
Wenjing Ke
Dong Li
Ji Liu
Lu Tian
Yi Shan
104
143
0
04 May 2022
Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel Transformer
Wu Yun
Mengshi Qi
Chuanming Wang
Huiyuan Fu
Huadong Ma
ViT
71
6
0
30 Apr 2022
Learning Adaptive Warping for Real-World Rolling Shutter Correction
Ming Cao
Zhihang Zhong
Jiahao Wang
Yinqiang Zheng
Yujiu Yang
42
17
0
29 Apr 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
125
182
0
27 Apr 2022
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
Xianing Chen
Qiong Cao
Yujie Zhong
Jing Zhang
Shenghua Gao
Dacheng Tao
ViT
100
80
0
27 Apr 2022
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
Adrian Holzbock
Alexander Tsaregorodtsev
Youssef Dawoud
Klaus C. J. Dietmayer
Vasileios Belagiannis
80
12
0
25 Apr 2022
Twitter-Based Gender Recognition Using Transformers
Z. Nia
A. Ahmadi
Bruce Mellado
Jianhong Wu
J. Orbinski
A. Asgary
J. Kong
ViT
45
5
0
24 Apr 2022
Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles
Pha Nguyen
Kha Gia Quach
C. Duong
Ngan Le
Xuan-Bac Nguyen
Khoa Luu
3DPC
68
19
0
19 Apr 2022
A Convolutional-Attentional Neural Framework for Structure-Aware Performance-Score Synchronization
Ruchit Agrawal
Daniel Wolff
S. Dixon
44
7
0
19 Apr 2022
VDTR: Video Deblurring with Transformer
Ming Cao
Yanbo Fan
Yong Zhang
Jue Wang
Yujiu Yang
ViT
66
41
0
17 Apr 2022
MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction
Yuanhao Cai
Jing Lin
Zudi Lin
Haoqian Wang
Yulun Zhang
Hanspeter Pfister
Radu Timofte
Luc Van Gool
50
183
0
17 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Ajmal Mian
136
166
0
16 Apr 2022
Neighborhood Attention Transformer
Ali Hassani
Steven Walton
Jiacheng Li
Shengjia Li
Humphrey Shi
ViT
AI4TS
117
276
0
14 Apr 2022
OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Rohan Sarkar
Navaneeth Bodla
Mariya I. Vasileva
Yen-Liang Lin
Anu Beniwal
Alan Lu
Gérard Medioni
57
36
0
11 Apr 2022
Previous
1
2
3
4
5
6
...
10
11
12
Next