Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05909
Cited By
Stand-Alone Self-Attention in Vision Models
13 June 2019
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stand-Alone Self-Attention in Vision Models"
50 / 588 papers shown
Title
LIPT: Latency-aware Image Processing Transformer
Junbo Qiao
Wei Li
Haizhen Xie
Hanting Chen
Yunshuai Zhou
Zhijun Tu
Jie Hu
Shaohui Lin
SupR
VLM
104
2
0
09 Apr 2024
Learning Correlation Structures for Vision Transformers
Manjin Kim
Paul Hongsuck Seo
Cordelia Schmid
Minsu Cho
ViT
91
11
0
05 Apr 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Jienneg Chen
Qihang Yu
Xiaohui Shen
Alan Yuille
Liang-Chieh Chen
3DV
VLM
101
29
0
02 Apr 2024
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Hyeongjun Kwon
Jinhyun Jang
Jin-Hwa Kim
Kwonyoung Kim
Kwanghoon Sohn
132
3
0
01 Apr 2024
KeyPoint Relative Position Encoding for Face Recognition
Minchul Kim
Yiyang Su
Feng Liu
Anil Jain
Xiaoming Liu
CVBM
88
10
0
21 Mar 2024
Frequency Attention for Knowledge Distillation
Cuong Pham
Van-Anh Nguyen
Trung Le
Dinh Q. Phung
Gustavo Carneiro
Thanh-Toan Do
73
18
0
09 Mar 2024
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
Ali Hassani
Wen-mei W. Hwu
Humphrey Shi
66
9
0
07 Mar 2024
A Transformer Model for Boundary Detection in Continuous Sign Language
R. Rastgoo
Kourosh Kiani
Sergio Escalera
SLR
64
2
0
22 Feb 2024
CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory
Zexue He
Leonid Karlinsky
Donghyun Kim
Julian McAuley
Dmitry Krotov
Rogerio Feris
KELM
RALM
84
11
0
21 Feb 2024
Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks
Zihang Song
Prabodh Katti
Osvaldo Simeone
Bipin Rajendran
98
3
0
14 Feb 2024
Spatially-Attentive Patch-Hierarchical Network with Adaptive Sampling for Motion Deblurring
Maitreya Suin
Kuldeep Purohit
A. N. Rajagopalan
58
0
0
09 Feb 2024
TSJNet: A Multi-modality Target and Semantic Awareness Joint-driven Image Fusion Network
Yuchan Jie
Yushen Xu
Xiaosong Li
Haishu Tan
63
7
0
02 Feb 2024
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Amirhosein Ghasemabadi
Muhammad Kamran Janjua
Mohammad Salameh
Chunhua Zhou
Fengyu Sun
Di Niu
95
12
0
26 Jan 2024
AMANet: Advancing SAR Ship Detection with Adaptive Multi-Hierarchical Attention Network
Xiaolin Ma
Junkai Cheng
Aihua Li
Yuhua Zhang
Zhilong Lin
69
2
0
24 Jan 2024
Locality enhanced dynamic biasing and sampling strategies for contextual ASR
Md. Asif Jalal
Pablo Peso Parada
George Pavlidis
Vasileios Moschopoulos
Karthikeyan P. Saravanan
...
Jisi Zhang
Anastasios Drosou
Gil Ho Lee
Jungin Lee
Seokyeong Jung
81
2
0
23 Jan 2024
Fast Registration of Photorealistic Avatars for VR Facial Animation
Chaitanya Patel
Shaojie Bai
Tenia Wang
Jason M. Saragih
S. Wei
80
0
0
19 Jan 2024
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation
Jiahui Zhong
Wenhong Tian
Yuanlun Xie
Zhijia Liu
Jie Ou
Taoran Tian
Lei Zhang
49
10
0
15 Jan 2024
3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework
Fan Zhang
Shuyi Mao
Qing Li
Xiaojiang Peng
3DPC
3DH
59
0
0
14 Jan 2024
A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding
Xujin Li
Wei Wei
Shuang Qiu
Huiguang He
59
2
0
12 Jan 2024
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Zhaoyang Zhang
Wenqi Shao
Yixiao Ge
Xiaogang Wang
Liang Feng
Ping Luo
58
3
0
20 Dec 2023
Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items
Akshat Shrivastava
Tapan K. Gandhi
63
1
0
19 Dec 2023
Delving Deeper Into Astromorphic Transformers
Md. Zesun Ahmed Mia
Malyaban Bal
Abhronil Sengupta
176
1
0
18 Dec 2023
A Comprehensive Study of Vision Transformers in Image Classification Tasks
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
64
10
0
02 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
162
0
0
01 Dec 2023
TransCORALNet: A Two-Stream Transformer CORAL Networks for Supply Chain Credit Assessment Cold Start
Jie Shi
A. Siebes
S. Mehrkanoon
87
0
0
30 Nov 2023
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Xiaohan Ding
Yiyuan Zhang
Yixiao Ge
Sijie Zhao
Lin Song
Xiangyu Yue
Ying Shan
VLM
AI4TS
SSL
104
129
0
27 Nov 2023
Eye Disease Prediction using Ensemble Learning and Attention on OCT Scans
Gauri Naik
Nandini Narvekar
Dimple Agarwal
Nishita Nandanwar
Himangi Pande
22
6
0
26 Nov 2023
The Heat is On: Thermal Facial Landmark Tracking
James Baker
CVBM
37
0
0
14 Nov 2023
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
Sheng-Hsuan Peng
Seongmin Lee
Xiaojing Wang
Rajarajeswari Balasubramaniyan
Duen Horng Chau
ViT
LMTD
46
3
0
09 Nov 2023
p-Laplacian Transformer
Tuan Nguyen
Tam Nguyen
Vinh-Tiep Nguyen
Tan-Minh Nguyen
104
0
0
06 Nov 2023
Neural-based Compression Scheme for Solar Image Data
Ali Zafari
Atefeh Khoshkhahtinat
Jeremy A. Grajeda
P. Mehta
Nasser M. Nasrabadi
L. Boucheron
Barbara J. Thompson
M. Kirk
D. D. da Silva
50
0
0
06 Nov 2023
CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine Context-Guided Motion Reasoning
Azin Jahedi
Maximilian Luz
Marc Rivinius
Andrés Bruhn
80
5
0
05 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
75
4
0
01 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
159
15
0
16 Oct 2023
Reinforcement Learning-based Mixture of Vision Transformers for Video Violence Recognition
Hamid Reza Mohammadi
Ehsan Nazerfard
Tahereh Firoozi
ViT
64
2
0
04 Oct 2023
Superpixel Transformers for Efficient Semantic Segmentation
Xiao Han
Jieru Mei
Lu Zhang
Hang Yan
Yongkai Wu
Liang-Chieh Chen
Henrik Kretzschmar
ViT
59
11
0
28 Sep 2023
ZS6D: Zero-shot 6D Object Pose Estimation using Vision Transformers
P. Ausserlechner
David Haberger
Stefan Thalhammer
Jean-Baptiste Weibel
Markus Vincze
78
30
0
21 Sep 2023
Multi-spectral Entropy Constrained Neural Compression of Solar Imagery
Ali Zafari
Atefeh Khoshkhahtinat
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
72
0
0
19 Sep 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
81
11
0
16 Sep 2023
HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen
Xintao Wang
Wenlong Zhang
Xiangtao Kong
Yu Qiao
Jiantao Zhou
Chao Dong
101
53
0
11 Sep 2023
Short-Term Load Forecasting Using A Particle-Swarm Optimized Multi-Head Attention-Augmented CNN-LSTM Network
P. K. Quansah
Edwin Kwesi Ansah Tenkorang
AI4TS
24
0
0
07 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
91
27
0
04 Sep 2023
A comprehensive review on Plant Leaf Disease detection using Deep learning
Sumaya Mustofa
Mehedi Hasan Munna
Yousuf Rayhan Emon
Golam Rabbany
Md. Taimur Ahad
18
14
0
27 Aug 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
80
103
0
27 Aug 2023
Weakly Supervised Face and Whole Body Recognition in Turbulent Environments
Kshitij Nikhal
B. Riggan
CVBM
75
2
0
22 Aug 2023
Vision Relation Transformer for Unbiased Scene Graph Generation
Gopika Sudhakaran
Devendra Singh Dhami
Kristian Kersting
Stefan Roth
ViT
117
18
0
18 Aug 2023
Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global Pixels
Ching-Hsun Tseng
S. Chien
Po-Shen Wang
Shin-Jye Lee
Wei-Huan Hu
Bin Pu
Xiaojun Zeng
79
2
0
15 Aug 2023
Seed Kernel Counting using Domain Randomization and Object Tracking Neural Networks
Venkat Margapuri
Prapti Thapaliya
Michael L. Neilsen
95
0
0
10 Aug 2023
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification
Subeen Lee
WonJun Moon
Hyun Seok Seong
Jae-Pil Heo
83
1
0
28 Jul 2023
Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition
Jia-Xin Zhuang
Jiabin Cai
Jianguo Zhang
Wei-Shi Zheng
Ruixuan Wang
45
11
0
19 Jul 2023
Previous
1
2
3
4
5
...
10
11
12
Next