ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05909
  4. Cited By
Stand-Alone Self-Attention in Vision Models

Stand-Alone Self-Attention in Vision Models

13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
    VLMSLRViT
ArXiv (abs)PDFHTML

Papers citing "Stand-Alone Self-Attention in Vision Models"

50 / 588 papers shown
Title
Axial-LOB: High-Frequency Trading with Axial Attention
Axial-LOB: High-Frequency Trading with Axial Attention
Damian Kisiel
D. Gorse
80
8
0
04 Dec 2022
SARAS-Net: Scale and Relation Aware Siamese Network for Change Detection
SARAS-Net: Scale and Relation Aware Siamese Network for Change Detection
Chao Chen
J. Hsieh
Ping-Yang Chen
Yi-Kuan Hsieh
Bo Wang
148
43
0
02 Dec 2022
Lightweight Structure-Aware Attention for Visual Understanding
Lightweight Structure-Aware Attention for Visual Understanding
Heeseung Kwon
F. M. Castro
M. Marín-Jiménez
N. Guil
Alahari Karteek
79
2
0
29 Nov 2022
Semantic-Aware Local-Global Vision Transformer
Semantic-Aware Local-Global Vision Transformer
Jiatong Zhang
Zengwei Yao
Fanglin Chen
Guangming Lu
Wenjie Pei
ViT
52
0
0
27 Nov 2022
Spatial-Temporal Attention Network for Open-Set Fine-Grained Image
  Recognition
Spatial-Temporal Attention Network for Open-Set Fine-Grained Image Recognition
Qiulei Dong
Hong Wang
Qiulei Dong
3DPCViT
60
1
0
25 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
126
141
0
22 Nov 2022
Patch-level Gaze Distribution Prediction for Gaze Following
Patch-level Gaze Distribution Prediction for Gaze Following
Qiaomu Miao
Minh Hoai
Dimitris Samaras
78
16
0
20 Nov 2022
Vision Transformers in Medical Imaging: A Review
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViTMedIm
86
36
0
18 Nov 2022
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global
  Association Approach
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach
Pha Nguyen
Kha Gia Quach
C. Duong
Son Lam Phung
Ngan Le
Khoa Luu
123
13
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
61
21
0
16 Nov 2022
Dual Complementary Dynamic Convolution for Image Recognition
Dual Complementary Dynamic Convolution for Image Recognition
Longbin Yan
Yunxiao Qin
Shumin Liu
Jie Chen
69
0
0
11 Nov 2022
Efficient Image Generation with Variadic Attention Heads
Efficient Image Generation with Variadic Attention Heads
Steven Walton
Ali Hassani
Xingqian Xu
Zhangyang Wang
Humphrey Shi
ViT
84
23
0
10 Nov 2022
FedTP: Federated Learning by Transformer Personalization
FedTP: Federated Learning by Transformer Personalization
Hongxia Li
Zhongyi Cai
Jingya Wang
Jiangnan Tang
Weiping Ding
Chin-Teng Lin
Ye-ling Shi
FedML
98
66
0
03 Nov 2022
Studying inductive biases in image classification task
Studying inductive biases in image classification task
N. Arizumi
59
1
0
31 Oct 2022
Relative Attention-based One-Class Adversarial Autoencoder for
  Continuous Authentication of Smartphone Users
Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users
Mingming Hu
Kun Zhang
Ruibang You
Bibo Tu
AAML
60
1
0
30 Oct 2022
Valuing Vicinity: Memory attention framework for context-based semantic
  segmentation in histopathology
Valuing Vicinity: Memory attention framework for context-based semantic segmentation in histopathology
Oliver Ester
Fabian Horst
C. Seibold
J. Keyl
Saskia Ting
...
P. Ivanyi
Viktor Grünwald
J. Bräsen
Jan Egger
Jens Kleesiek
51
9
0
21 Oct 2022
Scratching Visual Transformer's Back with Uniform Attention
Scratching Visual Transformer's Back with Uniform Attention
Nam Hyeon-Woo
Kim Yu-Ji
Byeongho Heo
Doonyoon Han
Seong Joon Oh
Tae-Hyun Oh
534
23
0
16 Oct 2022
Reconstructed Student-Teacher and Discriminative Networks for Anomaly
  Detection
Reconstructed Student-Teacher and Discriminative Networks for Anomaly Detection
Shinji Yamada
Satoshi Kamiya
Kazuhiro Hotta
81
32
0
14 Oct 2022
SWFormer: Sparse Window Transformer for 3D Object Detection in Point
  Clouds
SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
Pei Sun
Mingxing Tan
Weiyue Wang
Chenxi Liu
Fei Xia
Zhaoqi Leng
Drago Anguelov
ViT
81
121
0
13 Oct 2022
Vision Transformers provably learn spatial structure
Vision Transformers provably learn spatial structure
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViTMLT
100
83
0
13 Oct 2022
ConvTransSeg: A Multi-resolution Convolution-Transformer Network for
  Medical Image Segmentation
ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation
Zhendi Gong
Andrew P French
Guoping Qiu
Xin Chen
ViTMedIm
87
8
0
13 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic
  Segmentation
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
66
10
0
13 Oct 2022
Attention-Based Generative Neural Image Compression on Solar Dynamics
  Observatory
Attention-Based Generative Neural Image Compression on Solar Dynamics Observatory
Ali Zafari
Atefeh Khoshkhahtinat
P. Mehta
Nasser M. Nasrabadi
B. Thompson
D. D. Silva
M. Kirk
64
9
0
12 Oct 2022
Centralized Feature Pyramid for Object Detection
Centralized Feature Pyramid for Object Detection
Yu Quan
Dong Zhang
Liyan Zhang
Jinhui Tang
ObjD
111
166
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision
  Models
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViTMoE
120
66
0
04 Oct 2022
Accurate Image Restoration with Attention Retractable Transformer
Accurate Image Restoration with Attention Retractable Transformer
Jiale Zhang
Yulun Zhang
Jinjin Gu
Yongbing Zhang
Lingyu Kong
X. Yuan
ViT
97
100
0
04 Oct 2022
Feature Embedding by Template Matching as a ResNet Block
Feature Embedding by Template Matching as a ResNet Block
Ada Gorgun
Y. Z. Gürbüz
A. Aydin Alatan
51
1
0
03 Oct 2022
Verifiable and Energy Efficient Medical Image Analysis with Quantised
  Self-attentive Deep Neural Networks
Verifiable and Energy Efficient Medical Image Analysis with Quantised Self-attentive Deep Neural Networks
Rakshith Sathish
S. Khare
Debdoot Sheet
53
4
0
30 Sep 2022
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical
  Transformer for Medical Image Segmentation
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation
Ho Hin Lee
Shunxing Bao
Yuankai Huo
Bennett A. Landman
OODMedIm
159
143
0
29 Sep 2022
Dilated Neighborhood Attention Transformer
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViTMedIm
114
73
0
29 Sep 2022
Learning Model Predictive Controllers with Real-Time Attention for
  Real-World Navigation
Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation
Xuesu Xiao
Tingnan Zhang
K. Choromanski
Edward J. Lee
Anthony G. Francis
...
Leila Takayama
Roy Frostig
Jie Tan
Carolina Parada
Vikas Sindhwani
153
55
0
22 Sep 2022
Towards self-attention based visual navigation in the real world
Towards self-attention based visual navigation in the real world
Jaime Ruiz-Serra
Jack White
Stephen M. Petrie
T. Kameneva
C. McCarthy
70
1
0
15 Sep 2022
A lightweight Transformer-based model for fish landmark detection
A lightweight Transformer-based model for fish landmark detection
Alzayat Saleh
David Jones
D. Jerry
M. R. Azghadi
28
1
0
13 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for
  Text-to-Image Generation
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
68
21
0
03 Sep 2022
Real-time 3D Single Object Tracking with Transformer
Real-time 3D Single Object Tracking with Transformer
Jiayao Shan
Sifan Zhou
Yubo Cui
Zheng Fang
ViT
74
50
0
02 Sep 2022
HistoSeg : Quick attention with multi-loss function for multi-structure
  segmentation in digital histology images
HistoSeg : Quick attention with multi-loss function for multi-structure segmentation in digital histology images
Saad Wazir
M. Fraz
85
35
0
01 Sep 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for
  Visual Recognition
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
68
15
0
31 Aug 2022
MRL: Learning to Mix with Attention and Convolutions
MRL: Learning to Mix with Attention and Convolutions
Shlok Mohta
Hisahiro Suganuma
Yoshiki Tanaka
104
2
0
30 Aug 2022
Conviformers: Convolutionally guided Vision Transformer
Conviformers: Convolutionally guided Vision Transformer
Mohit Vaishnav
Thomas Fel
I. F. Rodriguez
Thomas Serre
ViT
99
1
0
17 Aug 2022
Sparse Attentive Memory Network for Click-through Rate Prediction with
  Long Sequences
Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Qianying Lin
Wen-Ji Zhou
Yanshi Wang
Qing Da
Qingguo Chen
Bing Wang
VLM
39
9
0
08 Aug 2022
Jointformer: Single-Frame Lifting Transformer with Error Prediction and
  Refinement for 3D Human Pose Estimation
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation
Sebastian Lutz
R. Blythman
Koustav Ghosal
Matthew Moynihan
C. Simms
A. Smolic
ViT
90
15
0
07 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly
  Detection and Localization
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
90
12
0
06 Aug 2022
PointConvFormer: Revenge of the Point-based Convolution
PointConvFormer: Revenge of the Point-based Convolution
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
84
31
0
04 Aug 2022
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
Ho Man Kwan
Shenghui Song
46
1
0
23 Jul 2022
Orientation and Context Entangled Network for Retinal Vessel
  Segmentation
Orientation and Context Entangled Network for Retinal Vessel Segmentation
Xinxu Wei
Kaifu Yang
D. Bzdok
Y. Li
56
35
0
23 Jul 2022
SplitMixer: Fat Trimmed From MLP-like Models
SplitMixer: Fat Trimmed From MLP-like Models
Ali Borji
Sikun Lin
46
3
0
21 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
72
1
0
20 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Vision Transformers: From Semantic Segmentation to Dense Prediction
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
99
8
0
19 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
75
33
0
18 Jul 2022
Few-shot Fine-grained Image Classification via Multi-Frequency
  Neighborhood and Double-cross Modulation
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross Modulation
Hegui Zhu
Zhan Gao
Jiayi Wang
Yangqiaoyu Zhou
Chengqing Li
108
7
0
18 Jul 2022
Previous
12345...101112
Next