ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05909
  4. Cited By
Stand-Alone Self-Attention in Vision Models

Stand-Alone Self-Attention in Vision Models

13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
    VLMSLRViT
ArXiv (abs)PDFHTML

Papers citing "Stand-Alone Self-Attention in Vision Models"

50 / 588 papers shown
Title
Neural Distributed Image Compression with Cross-Attention Feature
  Alignment
Neural Distributed Image Compression with Cross-Attention Feature Alignment
N. Mital
Ezgi Ozyilkan
Ali Garjani
Deniz Gunduz
57
23
0
18 Jul 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
135
259
0
12 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input
  Image
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Nayeon Lee
Yichang Shih
R. Ramamoorthi
ViT
100
114
0
12 Jul 2022
Improving Domain Generalization by Learning without Forgetting:
  Application in Retail Checkout
Improving Domain Generalization by Learning without Forgetting: Application in Retail Checkout
Thuy C. Nguyen
N. Phan
S. T. Nguyen
OOD
56
3
0
12 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive
  Spatiotemporal Sampling
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
108
2
0
12 Jul 2022
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation
  Learning
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Ting Yao
Yingwei Pan
Yehao Li
Chong-Wah Ngo
Tao Mei
ViT
225
142
0
11 Jul 2022
Dual Vision Transformer
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
233
81
0
11 Jul 2022
Snow Mask Guided Adaptive Residual Network for Image Snow Removal
Snow Mask Guided Adaptive Residual Network for Image Snow Removal
Bodong Cheng
Juncheng Li
Ying-Cong Chen
Shuyi Zhang
T. Zeng
70
44
0
11 Jul 2022
Attention and Self-Attention in Random Forests
Attention and Self-Attention in Random Forests
Lev V. Utkin
A. Konstantinov
71
7
0
09 Jul 2022
Cross-Attention Transformer for Video Interpolation
Cross-Attention Transformer for Video Interpolation
Hannah Kim
Shuzhi Yu
Shuai Yuan
Carlo Tomasi
ViT
70
16
0
08 Jul 2022
kMaX-DeepLab: k-means Mask Transformer
kMaX-DeepLab: k-means Mask Transformer
Qihang Yu
Huiyu Wang
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
179
19
0
08 Jul 2022
MaiT: Leverage Attention Masks for More Efficient Image Transformers
MaiT: Leverage Attention Masks for More Efficient Image Transformers
Ling Li
Ali Shafiee Ardestani
Joseph Hassoun
41
1
0
06 Jul 2022
Transformers discover an elementary calculation system exploiting local
  attention and grid-like problem representation
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation
Samuel Cognolato
Alberto Testolin
71
7
0
06 Jul 2022
Polarized Color Image Denoising using Pocoformer
Zhuoxiao Li
Hai-bo Jiang
Yinqiang Zheng
85
3
0
01 Jul 2022
PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Lilang Zheng
Jiaxuan Fang
Xiaorun Tang
Hanzhang Li
Jiaxin Fan
Tianyi Wang
Rui Zhou
Zhaoyan Yan
ViTMedIm
88
2
0
30 Jun 2022
The Third Place Solution for CVPR2022 AVA Accessibility Vision and
  Autonomy Challenge
The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Bo Yan
Leilei Cao
Zhuang Li
Hongbin Wang
48
0
0
28 Jun 2022
Excavating RoI Attention for Underwater Object Detection
Excavating RoI Attention for Underwater Object Detection
Xutao Liang
Pinhao Song
38
37
0
24 Jun 2022
Vicinity Vision Transformer
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
73
34
0
21 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViTMedIm
124
92
0
17 Jun 2022
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Yuxuan Zhou
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Lei Zhang
Margret Keuper
Xia Hua
ViT
71
15
0
15 Jun 2022
Peripheral Vision Transformer
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViTMDE
79
33
0
14 Jun 2022
Positional Label for Self-Supervised Vision Transformer
Positional Label for Self-Supervised Vision Transformer
Zhemin Zhang
Xun Gong
ViTMDE
59
6
0
10 Jun 2022
GAMR: A Guided Attention Model for (visual) Reasoning
GAMR: A Guided Attention Model for (visual) Reasoning
Mohit Vaishnav
Thomas Serre
LRM
88
16
0
10 Jun 2022
SwinCheX: Multi-label classification on chest X-ray images with
  transformers
SwinCheX: Multi-label classification on chest X-ray images with transformers
Sina Taslimi
Soroush Taslimi
Nima Fathi
Mohammad Salehi
M. Rohban
ViTMedIm
83
27
0
09 Jun 2022
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang
Kaihao Zhang
Wenhan Luo
Changsheng Li
Guoren Wang
CVBM
74
17
0
08 Jun 2022
Dual Decomposition of Convex Optimization Layers for Consistent
  Attention in Medical Images
Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images
Tom Ron
M. Weiler-Sagie
Tamir Hazan
FAttMedIm
81
6
0
06 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
117
88
0
01 Jun 2022
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for
  Edge Devices
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
Bahri Batuhan Bilecen
Alparslan Fisne
Mustafa Ayazoglu
65
2
0
01 Jun 2022
Future Transformer for Long-term Action Anticipation
Future Transformer for Long-term Action Anticipation
Dayoung Gong
Joonseok Lee
Manjin Kim
S. Ha
Minsu Cho
AI4TS
53
66
0
27 May 2022
Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral
  Compressive Imaging
Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging
Yuanhao Cai
Jing Lin
Haoqian Wang
Xin Yuan
Henghui Ding
Yulun Zhang
Radu Timofte
Luc Van Gool
165
124
0
20 May 2022
VNT-Net: Rotational Invariant Vector Neuron Transformers
VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling
Andrei Sharf
3DPC
53
1
0
19 May 2022
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous
  Driving Tasks
CARNet: A Dynamic Autoencoder for Learning Latent Dynamics in Autonomous Driving Tasks
A. Pak
Hemanth Manjunatha
Dimitar Filev
Panagiotis Tsiotras
40
5
0
18 May 2022
MulT: An End-to-End Multitask Learning Transformer
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
116
68
0
17 May 2022
Transformers in 3D Point Clouds: A Survey
Transformers in 3D Point Clouds: A Survey
Dening Lu
Qian Xie
Mingqiang Wei
Kyle Gao
Linlin Xu
Jonathan Li
3DPCViT
139
53
0
16 May 2022
Dense residual Transformer for image denoising
Dense residual Transformer for image denoising
Chao Yao
Shuo Jin
Meiqin Liu
Xiaojuan Ban
ViT
80
30
0
14 May 2022
RCMNet: A deep learning model assists CAR-T therapy for leukemia
RCMNet: A deep learning model assists CAR-T therapy for leukemia
Ruitao Zhang
Xue-ying Han
I. Gul
Shiyao Zhai
Yang Liu
...
Yuhan Dong
Lan Ma
Dongmei Yu
Jingqian Zhou
Peiwu Qin
30
32
0
06 May 2022
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and
  Object Re-Identification
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
Haowei Zhu
Wenjing Ke
Dong Li
Ji Liu
Lu Tian
Yi Shan
104
143
0
04 May 2022
Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel
  Transformer
Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel Transformer
Wu Yun
Mengshi Qi
Chuanming Wang
Huiyuan Fu
Huadong Ma
ViT
71
6
0
30 Apr 2022
Learning Adaptive Warping for Real-World Rolling Shutter Correction
Learning Adaptive Warping for Real-World Rolling Shutter Correction
Ming Cao
Zhihang Zhong
Jiahao Wang
Yinqiang Zheng
Yujiu Yang
42
17
0
29 Apr 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
125
182
0
27 Apr 2022
DearKD: Data-Efficient Early Knowledge Distillation for Vision
  Transformers
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
Xianing Chen
Qiong Cao
Yujie Zhong
Jing Zhang
Shenghua Gao
Dacheng Tao
ViT
100
80
0
27 Apr 2022
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
Adrian Holzbock
Alexander Tsaregorodtsev
Youssef Dawoud
Klaus C. J. Dietmayer
Vasileios Belagiannis
80
12
0
25 Apr 2022
Twitter-Based Gender Recognition Using Transformers
Twitter-Based Gender Recognition Using Transformers
Z. Nia
A. Ahmadi
Bruce Mellado
Jianhong Wu
J. Orbinski
A. Asgary
J. Kong
ViT
45
5
0
24 Apr 2022
Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous
  Vehicles
Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles
Pha Nguyen
Kha Gia Quach
C. Duong
Ngan Le
Xuan-Bac Nguyen
Khoa Luu
3DPC
68
19
0
19 Apr 2022
A Convolutional-Attentional Neural Framework for Structure-Aware
  Performance-Score Synchronization
A Convolutional-Attentional Neural Framework for Structure-Aware Performance-Score Synchronization
Ruchit Agrawal
Daniel Wolff
S. Dixon
44
7
0
19 Apr 2022
VDTR: Video Deblurring with Transformer
VDTR: Video Deblurring with Transformer
Ming Cao
Yanbo Fan
Yong Zhang
Jue Wang
Yujiu Yang
ViT
66
41
0
17 Apr 2022
MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral
  Reconstruction
MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction
Yuanhao Cai
Jing Lin
Zudi Lin
Haoqian Wang
Yulun Zhang
Hanspeter Pfister
Radu Timofte
Luc Van Gool
50
183
0
17 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Visual Attention Methods in Deep Learning: An In-Depth Survey
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Ajmal Mian
136
166
0
16 Apr 2022
Neighborhood Attention Transformer
Neighborhood Attention Transformer
Ali Hassani
Steven Walton
Jiacheng Li
Shengjia Li
Humphrey Shi
ViTAI4TS
117
276
0
14 Apr 2022
OutfitTransformer: Learning Outfit Representations for Fashion
  Recommendation
OutfitTransformer: Learning Outfit Representations for Fashion Recommendation
Rohan Sarkar
Navaneeth Bodla
Mariya I. Vasileva
Yen-Liang Lin
Anu Beniwal
Alan Lu
Gérard Medioni
57
36
0
11 Apr 2022
Previous
123456...101112
Next