ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05909
  4. Cited By
Stand-Alone Self-Attention in Vision Models

Stand-Alone Self-Attention in Vision Models

13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
    VLMSLRViT
ArXiv (abs)PDFHTML

Papers citing "Stand-Alone Self-Attention in Vision Models"

50 / 588 papers shown
Title
The Self-Optimal-Transport Feature Transform
The Self-Optimal-Transport Feature Transform
Daniel Shalam
Simon Korman
OT
65
22
0
06 Apr 2022
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and
  Sparse Network
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network
Byung-Kwan Lee
Junho Kim
Y. Ro
AAML
57
20
0
06 Apr 2022
RobIn: A Robust Interpretable Deep Network for Schizophrenia Diagnosis
RobIn: A Robust Interpretable Deep Network for Schizophrenia Diagnosis
Daniel Organisciak
Hubert P. H. Shum
E. Nwoye
Wai Lok Woo
OOD
68
19
0
31 Mar 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
189
142
0
31 Mar 2022
Integrative Few-Shot Learning for Classification and Segmentation
Integrative Few-Shot Learning for Classification and Segmentation
Dahyun Kang
Minsu Cho
VLM
99
61
0
29 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
Fan Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViTMedIm
118
28
0
24 Mar 2022
Lane detection with Position Embedding
Lane detection with Position Embedding
Jun Xie
Jiacheng Han
Dezhen Qi
F. Chen
Kaer Huang
Jia Shuai
61
5
0
23 Mar 2022
Global Matching with Overlapping Attention for Optical Flow Estimation
Global Matching with Overlapping Attention for Optical Flow Estimation
Shiyu Zhao
Long Zhao
Zhixing Zhang
Enyu Zhou
Dimitris N. Metaxas
3DPC
131
79
0
21 Mar 2022
ScalableViT: Rethinking the Context-oriented Generalization of Vision
  Transformer
ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer
Rui Yang
Hailong Ma
Jie Wu
Yansong Tang
Xuefeng Xiao
Min Zheng
Xiu Li
ViT
160
57
0
21 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware
  Visual Processing
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
103
25
0
20 Mar 2022
Adversarial Mutual Leakage Network for Cell Image Segmentation
Adversarial Mutual Leakage Network for Cell Image Segmentation
Hiroki Tsuda
Kazuhiro Hotta
GAN
70
0
0
20 Mar 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene
  Understanding
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
71
90
0
15 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
Xinming Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian Sun
VLM
151
554
0
13 Mar 2022
Efficient Long-Range Attention Network for Image Super-resolution
Efficient Long-Range Attention Network for Image Super-resolution
Xindong Zhang
Huiyu Zeng
Shi Guo
Lei Zhang
ViT
89
303
0
13 Mar 2022
Joint Learning of Salient Object Detection, Depth Estimation and Contour
  Extraction
Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction
Xiaoqi Zhao
Youwei Pang
Lihe Zhang
Huchuan Lu
110
23
0
09 Mar 2022
Plug-and-play Shape Refinement Framework for Multi-site and Lifespan
  Brain Skull Stripping
Plug-and-play Shape Refinement Framework for Multi-site and Lifespan Brain Skull Stripping
Yunxiang Li
Ruilong Dan
Shuai Wang
Yifan Cao
Xiangde Luo
...
Gangyong Jia
Huiyu Zhou
You Zhang
Yaqi Wang
Liwen Wang
62
3
0
08 Mar 2022
Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy
  for Image Recognition without Convolutions
Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions
Ruikang Ju
Ting-Yu Lin
Jen-Shiun Chiang
Jia-Hao Jian
Yu-Shian Lin
Liu-Rui-Yi Huang
ViT
30
2
0
02 Mar 2022
Enhancing Local Feature Learning for 3D Point Cloud Processing using
  Unary-Pairwise Attention
Enhancing Local Feature Learning for 3D Point Cloud Processing using Unary-Pairwise Attention
H. Xiu
Xin Liu
Weimin Wang
Kyoung-Sook Kim
T. Shinohara
Qiong Chang
M. Matsuoka
3DPC
59
5
0
01 Mar 2022
Equivariant Graph Attention Networks for Molecular Property Prediction
Equivariant Graph Attention Networks for Molecular Property Prediction
Tuan Le
Frank Noé
Djork-Arné Clevert
79
22
0
20 Feb 2022
Visual Attention Network
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViTVLM
140
679
0
20 Feb 2022
Patch-Based Stochastic Attention for Image Editing
Patch-Based Stochastic Attention for Image Editing
Nicolas Cherel
Andrés Almansa
Y. Gousseau
A. Newson
55
6
0
07 Feb 2022
Regression Transformer: Concurrent sequence regression and generation
  for molecular language modeling
Regression Transformer: Concurrent sequence regression and generation for molecular language modeling
Jannis Born
Matteo Manica
112
97
0
01 Feb 2022
Query Efficient Decision Based Sparse Attacks Against Black-Box Deep
  Learning Models
Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models
Viet Vo
Ehsan Abbasnejad
Damith C. Ranasinghe
AAML
110
14
0
31 Jan 2022
Fast Monte-Carlo Approximation of the Attention Mechanism
Fast Monte-Carlo Approximation of the Attention Mechanism
Hyunjun Kim
Jeonggil Ko
95
3
0
30 Jan 2022
Generalised Image Outpainting with U-Transformer
Generalised Image Outpainting with U-Transformer
Penglei Gao
Xi Yang
Rui Zhang
John Y. Goulermas
Yujie Geng
Yuyao Yan
Kaizhu Huang
ViT
88
17
0
27 Jan 2022
Transformers in Medical Imaging: A Survey
Transformers in Medical Imaging: A Survey
Fahad Shamshad
Salman Khan
Syed Waqas Zamir
Muhammad Haris Khan
Munawar Hayat
Fahad Shahbaz Khan
Huazhu Fu
ViTLM&MAMedIm
195
709
0
24 Jan 2022
Patches Are All You Need?
Patches Are All You Need?
Asher Trockman
J. Zico Kolter
ViT
272
414
0
24 Jan 2022
UniFormer: Unifying Convolution and Self-attention for Visual
  Recognition
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
227
383
0
24 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
141
107
0
16 Jan 2022
UniFormer: Unified Transformer for Efficient Spatiotemporal
  Representation Learning
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning
Kunchang Li
Yali Wang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
137
254
0
12 Jan 2022
A ConvNet for the 2020s
A ConvNet for the 2020s
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
202
5,261
0
10 Jan 2022
Flow-Guided Sparse Transformer for Video Deblurring
Flow-Guided Sparse Transformer for Video Deblurring
Jing Lin
Yuanhao Cai
Xiaowan Hu
Haoqian Wang
Youliang Yan
X. Zou
Henghui Ding
Yulun Zhang
Radu Timofte
Luc Van Gool
ViT
94
67
0
06 Jan 2022
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD
  Drawings
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings
Zhaohua Zheng
Jianfang Li
Lingjie Zhu
Honghua Li
F. Petzold
Ping Tan
47
15
0
03 Jan 2022
Dynamic Scene Video Deblurring using Non-Local Attention
Dynamic Scene Video Deblurring using Non-Local Attention
Maitreya Suin
A. N. Rajagopalan
34
0
0
01 Jan 2022
Adaptive Single Image Deblurring
Adaptive Single Image Deblurring
Maitreya Suin
Kuldeep Purohit
A. N. Rajagopalan
29
0
0
01 Jan 2022
APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers
APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers
Jiahao Lu
Xi Sheryl Zhang
Tianli Zhao
Xiangyu He
Jian Cheng
ViTPILM
72
23
0
28 Dec 2021
Augmenting Convolutional networks with attention-based aggregation
Augmenting Convolutional networks with attention-based aggregation
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Piotr Bojanowski
Armand Joulin
Gabriel Synnaeve
Hervé Jégou
ViT
114
49
0
27 Dec 2021
Miti-DETR: Object Detection based on Transformers with Mitigatory
  Self-Attention Convergence
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
96
15
0
26 Dec 2021
SimViT: Exploring a Simple Vision Transformer with sliding windows
SimViT: Exploring a Simple Vision Transformer with sliding windows
Gang Li
Di Xu
Xingyi Cheng
Hui Xiong
Changwen Zheng
ViT
67
16
0
24 Dec 2021
Assessing the Impact of Attention and Self-Attention Mechanisms on the
  Classification of Skin Lesions
Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions
Rafael Pedro
Arlindo L. Oliveira
53
14
0
23 Dec 2021
Learned Queries for Efficient Local Attention
Learned Queries for Efficient Local Attention
Moab Arar
Ariel Shamir
Amit H. Bermano
ViT
143
30
0
21 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Lite Vision Transformer with Enhanced Self-Attention
Chenglin Yang
Yilin Wang
Jianming Zhang
He Zhang
Zijun Wei
Zhe Lin
Alan Yuille
ViT
79
118
0
20 Dec 2021
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch
  Fusion
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch Fusion
Haoyi Wang
Victor Sanchez
Chang-Tsun Li
CVBM
53
31
0
19 Dec 2021
A Simple Single-Scale Vision Transformer for Object Localization and
  Instance Segmentation
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Wuyang Chen
Xianzhi Du
Fan Yang
Lucas Beyer
Xiaohua Zhai
...
Huizhong Chen
Jing Li
Xiaodan Song
Zhangyang Wang
Denny Zhou
ViT
111
23
0
17 Dec 2021
Efficient Visual Tracking with Exemplar Transformers
Efficient Visual Tracking with Exemplar Transformers
Philippe Blatter
Menelaos Kanakis
Martin Danelljan
Luc Van Gool
ViT
128
84
0
17 Dec 2021
Full Transformer Framework for Robust Point Cloud Registration with Deep
  Information Interaction
Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction
Guang-Sheng Chen
Meiling Wang
Yufeng Yue
Qingxiang Zhang
Li-xin Yuan
ViT
86
18
0
17 Dec 2021
Embracing Single Stride 3D Object Detector with Sparse Transformer
Embracing Single Stride 3D Object Detector with Sparse Transformer
Lue Fan
Ziqi Pang
Tianyuan Zhang
Yu-Xiong Wang
Hang Zhao
Feng Wang
Naiyan Wang
Zhaoxiang Zhang
ViT
96
269
0
13 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
84
3
0
10 Dec 2021
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Anirudh Thatipelli
Sanath Narayan
Salman Khan
Rao Muhammad Anwer
Fahad Shahbaz Khan
Guohao Li
ViT
83
92
0
09 Dec 2021
3D Medical Point Transformer: Introducing Convolution to Attention
  Networks for Medical Point Cloud Analysis
3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Jianhui Yu
Chaoyi Zhang
Heng Wang
Dingxin Zhang
Yang Song
Tiange Xiang
Dongnan Liu
Weidong (Tom) Cai
ViTMedIm
81
32
0
09 Dec 2021
Previous
123...567...101112
Next