Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05909
Cited By
Stand-Alone Self-Attention in Vision Models
13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stand-Alone Self-Attention in Vision Models"
50 / 588 papers shown
Title
Axial-LOB: High-Frequency Trading with Axial Attention
Damian Kisiel
D. Gorse
80
8
0
04 Dec 2022
SARAS-Net: Scale and Relation Aware Siamese Network for Change Detection
Chao Chen
J. Hsieh
Ping-Yang Chen
Yi-Kuan Hsieh
Bo Wang
148
43
0
02 Dec 2022
Lightweight Structure-Aware Attention for Visual Understanding
Heeseung Kwon
F. M. Castro
M. Marín-Jiménez
N. Guil
Alahari Karteek
79
2
0
29 Nov 2022
Semantic-Aware Local-Global Vision Transformer
Jiatong Zhang
Zengwei Yao
Fanglin Chen
Guangming Lu
Wenjie Pei
ViT
52
0
0
27 Nov 2022
Spatial-Temporal Attention Network for Open-Set Fine-Grained Image Recognition
Qiulei Dong
Hong Wang
Qiulei Dong
3DPC
ViT
60
1
0
25 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
126
141
0
22 Nov 2022
Patch-level Gaze Distribution Prediction for Gaze Following
Qiaomu Miao
Minh Hoai
Dimitris Samaras
78
16
0
20 Nov 2022
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViT
MedIm
86
36
0
18 Nov 2022
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach
Pha Nguyen
Kha Gia Quach
C. Duong
Son Lam Phung
Ngan Le
Khoa Luu
123
13
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
61
21
0
16 Nov 2022
Dual Complementary Dynamic Convolution for Image Recognition
Longbin Yan
Yunxiao Qin
Shumin Liu
Jie Chen
69
0
0
11 Nov 2022
Efficient Image Generation with Variadic Attention Heads
Steven Walton
Ali Hassani
Xingqian Xu
Zhangyang Wang
Humphrey Shi
ViT
84
23
0
10 Nov 2022
FedTP: Federated Learning by Transformer Personalization
Hongxia Li
Zhongyi Cai
Jingya Wang
Jiangnan Tang
Weiping Ding
Chin-Teng Lin
Ye-ling Shi
FedML
98
66
0
03 Nov 2022
Studying inductive biases in image classification task
N. Arizumi
59
1
0
31 Oct 2022
Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users
Mingming Hu
Kun Zhang
Ruibang You
Bibo Tu
AAML
60
1
0
30 Oct 2022
Valuing Vicinity: Memory attention framework for context-based semantic segmentation in histopathology
Oliver Ester
Fabian Horst
C. Seibold
J. Keyl
Saskia Ting
...
P. Ivanyi
Viktor Grünwald
J. Bräsen
Jan Egger
Jens Kleesiek
51
9
0
21 Oct 2022
Scratching Visual Transformer's Back with Uniform Attention
Nam Hyeon-Woo
Kim Yu-Ji
Byeongho Heo
Doonyoon Han
Seong Joon Oh
Tae-Hyun Oh
534
23
0
16 Oct 2022
Reconstructed Student-Teacher and Discriminative Networks for Anomaly Detection
Shinji Yamada
Satoshi Kamiya
Kazuhiro Hotta
81
32
0
14 Oct 2022
SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
Pei Sun
Mingxing Tan
Weiyue Wang
Chenxi Liu
Fei Xia
Zhaoqi Leng
Drago Anguelov
ViT
81
121
0
13 Oct 2022
Vision Transformers provably learn spatial structure
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViT
MLT
100
83
0
13 Oct 2022
ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation
Zhendi Gong
Andrew P French
Guoping Qiu
Xin Chen
ViT
MedIm
87
8
0
13 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
66
10
0
13 Oct 2022
Attention-Based Generative Neural Image Compression on Solar Dynamics Observatory
Ali Zafari
Atefeh Khoshkhahtinat
P. Mehta
Nasser M. Nasrabadi
B. Thompson
D. D. Silva
M. Kirk
64
9
0
12 Oct 2022
Centralized Feature Pyramid for Object Detection
Yu Quan
Dong Zhang
Liyan Zhang
Jinhui Tang
ObjD
111
166
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
120
66
0
04 Oct 2022
Accurate Image Restoration with Attention Retractable Transformer
Jiale Zhang
Yulun Zhang
Jinjin Gu
Yongbing Zhang
Lingyu Kong
X. Yuan
ViT
97
100
0
04 Oct 2022
Feature Embedding by Template Matching as a ResNet Block
Ada Gorgun
Y. Z. Gürbüz
A. Aydin Alatan
51
1
0
03 Oct 2022
Verifiable and Energy Efficient Medical Image Analysis with Quantised Self-attentive Deep Neural Networks
Rakshith Sathish
S. Khare
Debdoot Sheet
53
4
0
30 Sep 2022
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation
Ho Hin Lee
Shunxing Bao
Yuankai Huo
Bennett A. Landman
OOD
MedIm
159
143
0
29 Sep 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
114
73
0
29 Sep 2022
Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation
Xuesu Xiao
Tingnan Zhang
K. Choromanski
Edward J. Lee
Anthony G. Francis
...
Leila Takayama
Roy Frostig
Jie Tan
Carolina Parada
Vikas Sindhwani
153
55
0
22 Sep 2022
Towards self-attention based visual navigation in the real world
Jaime Ruiz-Serra
Jack White
Stephen M. Petrie
T. Kameneva
C. McCarthy
70
1
0
15 Sep 2022
A lightweight Transformer-based model for fish landmark detection
Alzayat Saleh
David Jones
D. Jerry
M. R. Azghadi
28
1
0
13 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
68
21
0
03 Sep 2022
Real-time 3D Single Object Tracking with Transformer
Jiayao Shan
Sifan Zhou
Yubo Cui
Zheng Fang
ViT
74
50
0
02 Sep 2022
HistoSeg : Quick attention with multi-loss function for multi-structure segmentation in digital histology images
Saad Wazir
M. Fraz
85
35
0
01 Sep 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
68
15
0
31 Aug 2022
MRL: Learning to Mix with Attention and Convolutions
Shlok Mohta
Hisahiro Suganuma
Yoshiki Tanaka
104
2
0
30 Aug 2022
Conviformers: Convolutionally guided Vision Transformer
Mohit Vaishnav
Thomas Fel
I. F. Rodriguez
Thomas Serre
ViT
99
1
0
17 Aug 2022
Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Qianying Lin
Wen-Ji Zhou
Yanshi Wang
Qing Da
Qingguo Chen
Bing Wang
VLM
39
9
0
08 Aug 2022
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation
Sebastian Lutz
R. Blythman
Koustav Ghosal
Matthew Moynihan
C. Simms
A. Smolic
ViT
90
15
0
07 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
90
12
0
06 Aug 2022
PointConvFormer: Revenge of the Point-based Convolution
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
84
31
0
04 Aug 2022
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
Ho Man Kwan
Shenghui Song
46
1
0
23 Jul 2022
Orientation and Context Entangled Network for Retinal Vessel Segmentation
Xinxu Wei
Kaifu Yang
D. Bzdok
Y. Li
56
35
0
23 Jul 2022
SplitMixer: Fat Trimmed From MLP-like Models
Ali Borji
Sikun Lin
46
3
0
21 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
72
1
0
20 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
99
8
0
19 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
75
33
0
18 Jul 2022
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross Modulation
Hegui Zhu
Zhan Gao
Jiayi Wang
Yangqiaoyu Zhou
Chengqing Li
108
7
0
18 Jul 2022
Previous
1
2
3
4
5
...
10
11
12
Next