ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05909
  4. Cited By
Stand-Alone Self-Attention in Vision Models

Stand-Alone Self-Attention in Vision Models

13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
    VLMSLRViT
ArXiv (abs)PDFHTML

Papers citing "Stand-Alone Self-Attention in Vision Models"

50 / 588 papers shown
Title
Marine Debris Detection in Satellite Surveillance using Attention
  Mechanisms
Marine Debris Detection in Satellite Surveillance using Attention Mechanisms
Ao Shen
Yijie Zhu
Richard Jiang
95
8
0
09 Jul 2023
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers
Jakob Drachmann Havtorn
Amelie Royer
Tijmen Blankevoort
B. Bejnordi
81
8
0
05 Jul 2023
X-MLP: A Patch Embedding-Free MLP Architecture for Vision
X-MLP: A Patch Embedding-Free MLP Architecture for Vision
Xinyue Wang
Zhicheng Cai
Chenglei Peng
ViT
90
5
0
02 Jul 2023
Dynamic Perceiver for Efficient Visual Recognition
Dynamic Perceiver for Efficient Visual Recognition
Yizeng Han
Dongchen Han
Zeyu Liu
Yulin Wang
Xuran Pan
Yifan Pu
Chaorui Deng
Junlan Feng
S. Song
Gao Huang
108
30
0
20 Jun 2023
Multi-level Multiple Instance Learning with Transformer for Whole Slide
  Image Classification
Multi-level Multiple Instance Learning with Transformer for Whole Slide Image Classification
Rui-qi Zhang
Qiaozheng Zhang
Yingzhuang Liu
Hao Xin
Yang Liu
Xinggang Wang
ViTMedIm
122
8
0
08 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViTMU
102
29
0
07 Jun 2023
Evolution: A Unified Formula for Feature Operators from a High-level
  Perspective
Evolution: A Unified Formula for Feature Operators from a High-level Perspective
Zhicheng Cai
26
0
0
23 May 2023
Deep Multiple Instance Learning with Distance-Aware Self-Attention
Deep Multiple Instance Learning with Distance-Aware Self-Attention
Georg Wolflein
Lucie Charlotte Magister
Pietro Lio
David J. Harrison
Ognjen Arandjelovic
63
3
0
17 May 2023
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document
  Image Classification
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marçal Rusiñol
63
6
0
11 May 2023
PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces
PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces
Yiqun Wang
Ivan Skorokhodov
Peter Wonka
96
43
0
09 May 2023
Semantic Segmentation using Vision Transformers: A survey
Semantic Segmentation using Vision Transformers: A survey
Hans Thisanke
Chamli Deshan
K. Chamith
Sachith Seneviratne
Rajith Vidanaarachchi
Damayanthi Herath
ViT
87
159
0
05 May 2023
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Arunima Jaiswal
Ananya Sadana
MedIm
32
3
0
01 May 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image
  Segmentation
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation
Yousef Yeganeh
Azade Farshad
Peter Weinberger
Seyed-Ahmad Ahmadi
Ehsan Adeli
Nassir Navab
ViTMedIm
54
0
0
28 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
103
12
0
24 Apr 2023
DeformableFormer: Classification of Endoscopic Ultrasound Guided Fine
  Needle Biopsy in Pancreatic Diseases
DeformableFormer: Classification of Endoscopic Ultrasound Guided Fine Needle Biopsy in Pancreatic Diseases
Taiji Kurami
T. Ishikawa
Kazuhiro Hotta
MedIm
31
0
0
21 Apr 2023
Variational Relational Point Completion Network for Robust 3D
  Classification
Variational Relational Point Completion Network for Robust 3D Classification
Liang Pan
Xinyi Chen
Zhongang Cai
Junzhe Zhang
Haiyu Zhao
Shuai Yi
Ziwei Liu
3DPC3DV
64
13
0
18 Apr 2023
How Will It Drape Like? Capturing Fabric Mechanics from Depth Images
How Will It Drape Like? Capturing Fabric Mechanics from Depth Images
Carlos Rodriguez-Pardo
Melania Prieto-Martin
Dan Casas
Elena Garces
81
12
0
13 Apr 2023
Multi-scale Geometry-aware Transformer for 3D Point Cloud Classification
Multi-scale Geometry-aware Transformer for 3D Point Cloud Classification
Xian Wei
Muyu Wang
S. J. Lin
Zhengyu Li
Jian Yang
Arafat Al-Jawari
Xuan Tang
3DPCViT
59
2
0
12 Apr 2023
Slide-Transformer: Hierarchical Vision Transformer with Local
  Self-Attention
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Xuran Pan
Tianzhu Ye
Zhuofan Xia
S. Song
Gao Huang
ViT
86
60
0
09 Apr 2023
RFAConv: Innovating Spatial Attention and Standard Convolutional
  Operation
RFAConv: Innovating Spatial Attention and Standard Convolutional Operation
Xinyu Zhang
Chen Liu
Degang Yang
Tingting Song
Yichen Ye
Ke Li
Ying Song
80
123
0
06 Apr 2023
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer
  for 4K Video Frame Interpolation
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation
Jun-ho Park
Jintae Kim
Chang-Su Kim
74
22
0
05 Apr 2023
Dual Cross-Attention for Medical Image Segmentation
Dual Cross-Attention for Medical Image Segmentation
Gorkem Can Ates
P. Mohan
Emrah Çelik
56
85
0
30 Mar 2023
Self-positioning Point-based Transformer for Point Cloud Understanding
Self-positioning Point-based Transformer for Point Cloud Understanding
Jinyoung Park
S. Lee
S. Kim
Yunyang Xiong
Hyunwoo J. Kim
ViT3DPC
80
62
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
189
139
0
29 Mar 2023
Incorporating Transformer Designs into Convolutions for Lightweight
  Image Super-Resolution
Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution
Gang Wu
Junjun Jiang
Yuanchao Bai
Xianming Liu
SupRViT
57
6
0
25 Mar 2023
Machine Learning for Brain Disorders: Transformers and Visual
  Transformers
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant
Maika Edberg
Nicolas Dufour
Vicky Kalogeiton
MedImViT
58
1
0
21 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
81
3
0
16 Mar 2023
BiFormer: Vision Transformer with Bi-Level Routing Attention
BiFormer: Vision Transformer with Bi-Level Routing Attention
Lei Zhu
Xinjiang Wang
Zhanghan Ke
Wayne Zhang
Rynson W. H. Lau
192
536
0
15 Mar 2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
Yongil Kim
Yerin Hwang
Hyeongu Yun
Seunghyun Yoon
Trung Bui
Kyomin Jung
61
6
0
15 Mar 2023
AdPE: Adversarial Positional Embeddings for Pretraining Vision
  Transformers via MAE+
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Tianlin Li
Ying Wang
Ziwei Xuan
Guo-Jun Qi
ViT
75
3
0
14 Mar 2023
Deep-NFA: a Deep $\textit{a contrario}$ Framework for Small Object
  Detection
Deep-NFA: a Deep a contrario\textit{a contrario}a contrario Framework for Small Object Detection
Alina Ciocarlan
S. L. Hégarat-Mascle
S. Lefebvre
Arnaud Woiselle
ObjD
42
9
0
02 Mar 2023
Efficient and Explicit Modelling of Image Hierarchies for Image
  Restoration
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
Yawei Li
Yuchen Fan
Xiaoyu Xiang
D. Demandolx
Rakesh Ranjan
Radu Timofte
Luc Van Gool
103
181
0
01 Mar 2023
Sampled Transformer for Point Sets
Sampled Transformer for Point Sets
Shidi Li
Christian J. Walder
Alexander Soen
Lexing Xie
Miaomiao Liu
3DPC
72
1
0
28 Feb 2023
RGB-D Grasp Detection via Depth Guided Learning with Cross-modal
  Attention
RGB-D Grasp Detection via Depth Guided Learning with Cross-modal Attention
Ran Qin
Haoxiang Ma
Bo-Bin Gao
Di Huang
64
19
0
28 Feb 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
64
24
0
23 Feb 2023
STB-VMM: Swin Transformer Based Video Motion Magnification
STB-VMM: Swin Transformer Based Video Motion Magnification
Ricard Lado-Roigé
M. A. Pérez
46
13
0
20 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
78
4
0
18 Feb 2023
Single Motion Diffusion
Single Motion Diffusion
Sigal Raab
Inbal Leibovitch
Guy Tevet
Moab Arar
Amit H. Bermano
Daniel Cohen-Or
DiffMVGen
137
60
0
12 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference
  Frames
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
133
37
0
09 Feb 2023
Semantic Diffusion Network for Semantic Segmentation
Semantic Diffusion Network for Semantic Segmentation
Hao Hao Tan
Sitong Wu
Jimin Pi
DiffM
97
33
0
04 Feb 2023
Fairness-aware Vision Transformer via Debiased Self-Attention
Fairness-aware Vision Transformer via Debiased Self-Attention
Yao Qiang
Chengyin Li
Prashant Khanduri
D. Zhu
ViT
126
9
0
31 Jan 2023
Flow-guided Semi-supervised Video Object Segmentation
Flow-guided Semi-supervised Video Object Segmentation
Yushan Zhang
Andreas Robinson
M. Magnusson
Michael Felsberg
VOS
69
1
0
25 Jan 2023
HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial
  Images
HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images
Kun Li
G. Vosselman
M. Yang
80
7
0
23 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
Stefanos Zafeiriou
ViT
86
56
0
12 Jan 2023
Deep Residual Axial Networks
Deep Residual Axial Networks
Nazmul Shahadat
Anthony Maida
3DPC
116
6
0
11 Jan 2023
Advances in Medical Image Analysis with Vision Transformers: A
  Comprehensive Review
Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review
Reza Azad
Amirhossein Kazerouni
Moein Heidari
Ehsan Khodapanah Aghdam
Amir Molaei
Yiwei Jia
Abin Jose
Rijo Roy
Dorit Merhof
MedImViT
118
187
0
09 Jan 2023
SLGTformer: An Attention-Based Approach to Sign Language Recognition
SLGTformer: An Attention-Based Approach to Sign Language Recognition
Neil Song
Yu Xiang
SLR
58
0
0
21 Dec 2022
Convolution-enhanced Evolving Attention Networks
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
102
6
0
16 Dec 2022
GAMMA: Generative Augmentation for Attentive Marine Debris Detection
GAMMA: Generative Augmentation for Attentive Marine Debris Detection
Vaishnavi Khindkar
Janhavi Khindkar
ViT
64
1
0
07 Dec 2022
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data
Matthias Zeller
Jens Behley
Michael Heidingsfeld
C. Stachniss
95
24
0
07 Dec 2022
Previous
123456...101112
Next