Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.13621
Cited By
Exploring Self-attention for Image Recognition
28 April 2020
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Self-attention for Image Recognition"
50 / 316 papers shown
Title
MixFormer: Mixing Features across Windows and Dimensions
Qiang Chen
Qiman Wu
Jian Wang
Qinghao Hu
T. Hu
Errui Ding
Jian Cheng
Jingdong Wang
MDE
ViT
31
101
0
06 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
29
6
0
04 Apr 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
21
136
0
31 Mar 2022
NL-FCOS: Improving FCOS through Non-Local Modules for Object Detection
Lukas Pavez
Jose M. Saavedra Rondo
ObjD
29
0
0
29 Mar 2022
Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation
Koji Takeda
Kanji Tanaka
Yoshimasa Nakamura
3DPC
19
3
0
29 Mar 2022
Stratified Transformer for 3D Point Cloud Segmentation
Xin Lai
Jianhui Liu
Li Jiang
Liwei Wang
Hengshuang Zhao
Shu-Lin Liu
Xiaojuan Qi
Jiaya Jia
3DPC
ViT
24
261
0
28 Mar 2022
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
31
296
0
27 Mar 2022
Exploring Self-Attention for Visual Intersection Classification
Haruki Nakata
Kanji Tanaka
Koji Takeda
15
0
0
26 Mar 2022
Global Tracking Transformers
Xingyi Zhou
Tianwei Yin
V. Koltun
Philipp Krahenbuhl
VOT
24
133
0
24 Mar 2022
Lane detection with Position Embedding
Jun Xie
Jiacheng Han
Dezhen Qi
F. Chen
Kaer Huang
Jia Shuai
30
4
0
23 Mar 2022
ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer
Rui Yang
Hailong Ma
Jie Wu
Yansong Tang
Xuefeng Xiao
Min Zheng
Xiu Li
ViT
19
53
0
21 Mar 2022
CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
Tianchen Zhao
Niansong Zhang
Xuefei Ning
He-Nan Wang
Li Yi
Yu Wang
3DPC
ViT
22
8
0
18 Mar 2022
DuMLP-Pin: A Dual-MLP-dot-product Permutation-invariant Network for Set Feature Extraction
Jiajun Fei
Ziyu Zhu
Wenlei Liu
Zhidong Deng
Mingyang Li
Huanjun Deng
Shuo Zhang
3DPC
8
6
0
08 Mar 2022
Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions
Ruikang Ju
Ting-Yu Lin
Jen-Shiun Chiang
Jia-Hao Jian
Yu-Shian Lin
Liu-Rui-Yi Huang
ViT
16
1
0
02 Mar 2022
3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
Dening Lu
Qian Xie
Linlin Xu
Jonathan Li
3DV
19
68
0
02 Mar 2022
Learn From the Past: Experience Ensemble Knowledge Distillation
Chaofei Wang
Shaowei Zhang
S. Song
Gao Huang
27
4
0
25 Feb 2022
Guided Visual Attention Model Based on Interactions Between Top-down and Bottom-up Information for Robot Pose Prediction
Hyogo Hiruma
Hiroki Mori
Hiroshi Ito
Tetsuya Ogata
9
0
0
21 Feb 2022
Equivariant Graph Attention Networks for Molecular Property Prediction
Tuan Le
Frank Noé
Djork-Arné Clevert
16
21
0
20 Feb 2022
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Guangting Wang
Yucheng Zhao
Chuanxin Tang
Chong Luo
Wenjun Zeng
22
68
0
26 Jan 2022
Attention-based Proposals Refinement for 3D Object Detection
Minh-Quan Dao
Elwan Héry
Vincent Frémont
3DPC
16
2
0
18 Jan 2022
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings
Zhaohua Zheng
Jianfang Li
Lingjie Zhu
Honghua Li
F. Petzold
Ping Tan
15
14
0
03 Jan 2022
Few-shot Backdoor Defense Using Shapley Estimation
Jiyang Guan
Zhuozhuo Tu
Ran He
Dacheng Tao
AAML
31
53
0
30 Dec 2021
Learned Queries for Efficient Local Attention
Moab Arar
Ariel Shamir
Amit H. Bermano
ViT
38
29
0
21 Dec 2021
Lite Vision Transformer with Enhanced Self-Attention
Chenglin Yang
Yilin Wang
Jianming Zhang
He Zhang
Zijun Wei
Zhe-nan Lin
Alan Yuille
ViT
21
112
0
20 Dec 2021
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Wuyang Chen
Xianzhi Du
Fan Yang
Lucas Beyer
Xiaohua Zhai
...
Huizhong Chen
Jing Li
Xiaodan Song
Zhangyang Wang
Denny Zhou
ViT
29
20
0
17 Dec 2021
EEG-Transformer: Self-attention from Transformer Architecture for Decoding EEG of Imagined Speech
Y. E. Lee
Seo-Hyun Lee
15
45
0
15 Dec 2021
Embracing Single Stride 3D Object Detector with Sparse Transformer
Lue Fan
Ziqi Pang
Tianyuan Zhang
Yu-xiong Wang
Hang Zhao
Feng Wang
Naiyan Wang
Zhaoxiang Zhang
ViT
27
255
0
13 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
28
3
0
10 Dec 2021
Fast Point Transformer
Chunghyun Park
Yoonwoo Jeong
Minsu Cho
Jaesik Park
3DPC
ViT
32
168
0
09 Dec 2021
PTTR: Relational 3D Point Cloud Object Tracking with Transformer
Changqing Zhou
Zhipeng Luo
Yueru Luo
Tianrui Liu
Liang Pan
Zhongang Cai
Haiyu Zhao
Shijian Lu
ViT
3DPC
27
94
0
06 Dec 2021
CTIN: Robust Contextual Transformer Network for Inertial Navigation
Bingbing Rao
Ehsan Kazemi
Yifan Ding
D. Shila
F. M. Tucker
Liqiang Wang
3DPC
37
32
0
03 Dec 2021
TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing
Bo Yan
Leilei Cao
Hongbin Wang
ViT
11
1
0
02 Dec 2021
Robust Partial-to-Partial Point Cloud Registration in a Full Range
Liang Pan
Zhongang Cai
Ziwei Liu
3DPC
28
22
0
30 Nov 2021
On the Integration of Self-Attention and Convolution
Xuran Pan
Chunjiang Ge
Rui Lu
S. Song
Guanfu Chen
Zeyi Huang
Gao Huang
SSL
41
287
0
29 Nov 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
25
6
0
26 Nov 2021
MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection
Zhenhong Sun
Ming Lin
Xiuyu Sun
Zhiyu Tan
Hao Li
Rong Jin
23
32
0
26 Nov 2021
PointMixer: MLP-Mixer for Point Cloud Understanding
Jaesung Choe
Chunghyun Park
François Rameau
Jaesik Park
In So Kweon
3DPC
39
98
0
22 Nov 2021
DuDoTrans: Dual-Domain Transformer Provides More Attention for Sinogram Restoration in Sparse-View CT Reconstruction
Ce Wang
Kun Shang
Haimiao Zhang
Qian Li
Yuan Hui
S. Kevin Zhou
ViT
MedIm
11
28
0
21 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,636
0
15 Nov 2021
Searching for TrioNet: Combining Convolution with Local and Global Self-Attention
Huaijin Pi
Huiyu Wang
Yingwei Li
Zizhang Li
Alan Yuille
ViT
21
3
0
15 Nov 2021
Full-attention based Neural Architecture Search using Context Auto-regression
Yuan Zhou
Haiyang Wang
Shuwei Huo
Boyu Wang
27
3
0
13 Nov 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
77
330
0
11 Nov 2021
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViT
AAML
192
257
0
10 Nov 2021
Direct Multi-view Multi-person 3D Pose Estimation
Tao Wang
Jianfeng Zhang
Yujun Cai
Shuicheng Yan
Jiashi Feng
3DH
34
89
0
07 Nov 2021
Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images
Guo-Ye Yang
Xiang-Li Li
Ralph Robert Martin
Shimin Hu
3DPC
21
13
0
05 Nov 2021
Relational Self-Attention: What's Missing in Attention for Video Understanding
Manjin Kim
Heeseung Kwon
Chunyu Wang
Suha Kwak
Minsu Cho
ViT
27
28
0
02 Nov 2021
Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation
Hanz Cuevas-Velasquez
Antonio Javier Gallego
Robert B. Fisher
3DPC
22
1
0
30 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
18
161
0
22 Oct 2021
AIR-Nets: An Attention-Based Framework for Locally Conditioned Implicit Representations
Simon Giebenhain
Bastian Goldlücke
3DPC
26
15
0
22 Oct 2021
Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation
Kuan-Chih Huang
Yu-Kai Huang
Winston H. Hsu
11
4
0
22 Oct 2021
Previous
1
2
3
4
5
6
7
Next