Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.13621
Cited By
Exploring Self-attention for Image Recognition
Computer Vision and Pattern Recognition (CVPR), 2020
28 April 2020
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring Self-attention for Image Recognition"
50 / 323 papers shown
Learning Informative Attention Weights for Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
493
0
0
24 Dec 2025
SDGraph: Multi-Level Sketch Representation Learning by Sparse-Dense Graph Architecture
Xi Cheng
Pingfa Feng
Zhichao Liao
Mingyu Fan
Long Zeng
Long Zeng
205
0
0
14 Oct 2025
Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images
Danling Cao
CVBM
3DH
3DV
489
0
0
12 Sep 2025
CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision
Puskal Khadka
Rodrigue Rizk
Longwei Wang
KC Santosh
ViT
156
0
0
10 Sep 2025
MyGO: Make your Goals Obvious, Avoiding Semantic Confusion in Prostate Cancer Lesion Region Segmentation
Zhengcheng Lin
Zuobin Ying
Zhenyu Li
Zhenyu Liu
Jian Lu
Weiping Ding
163
0
0
23 Jul 2025
GASPnet: Global Agreement to Synchronize Phases
A. Alamia
Sabine Muzellec
Thomas Serre
Rufin VanRullen
274
0
0
22 Jul 2025
Ensemble-Based Survival Models with the Self-Attended Beran Estimator Predictions
Computational Mathematics and Modeling (CMM), 2025
Lev V. Utkin
Semen P. Khomets
Vlada A. Efremenko
A. Konstantinov
Natalya M. Verbova
180
1
0
09 Jun 2025
HyperPointFormer: Multimodal Fusion in 3D Space with Dual-Branch Cross-Attention Transformers
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE J-STARS), 2025
Aldino Rizaldy
R. Gloaguen
Fabian Ewald Fassnacht
Pedram Ghamisi
3DPC
263
0
0
29 May 2025
Multimodal Fusion of Glucose Monitoring and Food Imagery for Caloric Content Prediction
Adarsh Kumar
439
0
0
13 May 2025
SignX: Continuous Sign Recognition in Compact Pose-Rich Latent Space
Sen Fang
Chunyu Sui
Hongwei Yi
C. Neidle
Dimitris N. Metaxas
Dimitris N. Metaxas
SLR
465
3
0
22 Apr 2025
MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework
Zhenkai Qin
Feng Zhu
Huan Zeng
Xunyi Nong
174
0
0
18 Apr 2025
Forward Learning with Differential Privacy
Mingqian Feng
Zeliang Zhang
Jinyang Jiang
Yijie Peng
Chenliang Xu
373
0
0
01 Apr 2025
Interpretable Deep Learning Framework for Improved Disease Classification in Medical Imaging
Jutika Borah
H. Singh
OOD
UQCV
426
0
0
14 Mar 2025
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
ACM Multimedia (MM), 2024
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
319
2
0
29 Nov 2024
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
Shitong Shao
Zikai Zhou
Tian Ye
Lichen Bai
Zhiqiang Xu
Bo Han
DiffM
495
7
0
16 Nov 2024
Exploring contextual modeling with linear complexity for point cloud segmentation
Yong Xien Chng
Xuchong Qiu
Yizeng Han
Yifan Pu
Jiewei Cao
Gao Huang
Mamba
333
1
0
28 Oct 2024
Learning to rumble: Automated elephant call classification, detection and endpointing using deep architectures
Christiaan M. Geldenhuys
Thomas R. Niesler
168
4
0
15 Oct 2024
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
British Machine Vision Conference (BMVC), 2024
Kovvuri Sai Gopal Reddy
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
Snehasis Mukherjee
321
2
0
08 Oct 2024
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
International Conference on Learning Representations (ICLR), 2024
Shitong Shao
Zikai Zhou
Lichen Bai
Haoyi Xiong
Bo Han
VGen
355
5
0
05 Oct 2024
Feature Importance in Pedestrian Intention Prediction: A Context-Aware Review
Mohsen Azarmi
Mahdi Rezaei
He Wang
Ali Arabian
257
4
0
11 Sep 2024
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation
Shehan Perera
Yunus Erzurumlu
Deepak Gulati
Alper Yilmaz
ViT
MedIm
263
17
0
04 Sep 2024
Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL
Mohammad Reshadati
254
1
0
04 Sep 2024
Panoptic Perception for Autonomous Driving: A Survey
Yunge Li
Lanyu Xu
322
6
0
27 Aug 2024
PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture
IEEE transactions on multimedia (IEEE TMM), 2024
Qiang Zheng
Chao Zhang
Jian Sun
443
3
0
10 Aug 2024
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
ViT
462
47
0
29 Jul 2024
Rethinking Attention Module Design for Point Cloud Analysis
International Conference on Pattern Recognition (ICPR), 2024
Chengzhi Wu
Kaige Wang
Zeyun Zhong
Hao Fu
Junwei Zheng
Kailai Li
Julius Pfrommer
Jürgen Beyerer
3DPC
371
3
0
27 Jul 2024
GMT: Effective Global Framework for Multi-Camera Multi-Target Tracking
Yihao Zhen
Tinghui Zhao
Qiang Wang
Baojie Fan
Yandong Tang
Tinghui Zhao
Huijie Fan
397
3
0
01 Jul 2024
ATAC-Net: Zoomed view works better for Anomaly Detection
Shaurya Gupta
Neil Gautam
Anurag Malyala
262
0
0
20 Jun 2024
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
Neural Information Processing Systems (NeurIPS), 2024
Seungwoo Yoo
Juil Koo
Kyeongmin Yeo
Minhyuk Sung
3DH
DRL
300
4
0
14 Jun 2024
A Multimodal Dangerous State Recognition and Early Warning System for Elderly with Intermittent Dementia
Liyun Deng
Lei Jin
Guangcheng Wang
Quan Shi
Han Wang
127
1
0
30 May 2024
Towards Natural Machine Unlearning
Zhengbao He
Tao Li
Xinwen Cheng
Zhehao Huang
Xiaolin Huang
MU
443
10
0
24 May 2024
Mesh Denoising Transformer
Wenbo Zhao
Xianming Liu
Deming Zhai
Junjun Jiang
Xiangyang Ji
AI4CE
206
1
0
10 May 2024
UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
Kovvuri Sai
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
214
1
0
09 May 2024
CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks
Nick Nikzad
Yongsheng Gao
Jun Zhou
272
2
0
09 May 2024
AFter: Attention-based Fusion Router for RGBT Tracking
IEEE Transactions on Image Processing (TIP), 2024
Andong Lu
Wanyu Wang
Chenglong Li
Jin Tang
Bin Luo
278
16
0
04 May 2024
Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network
Yusra Alkendi
Rana Azzam
Sajid Javed
Lakmal Seneviratne
Yahya Zweiri
ViT
255
6
0
16 Apr 2024
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Hyeongjun Kwon
Jinhyun Jang
Jin-Hwa Kim
Kwonyoung Kim
Kwanghoon Sohn
372
11
0
01 Apr 2024
Surface Reconstruction from Point Clouds via Grid-based Intersection Prediction
Hui Tian
Kai Xu
3DPC
3DV
413
3
0
21 Mar 2024
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
Abu Zahid Bin Aziz
Mokshagna Sai Teja Karanam
Tushar Kataria
Shireen Y. Elhabian
ViT
MedIm
251
6
0
16 Mar 2024
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
Zichao Dong
Bowen Pang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
3DPC
325
2
0
08 Mar 2024
ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures
S. Rukhsar
Anil Kumar Tiwari
309
14
0
05 Mar 2024
Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation
Dipesh Gyawali
Jian Zhang
B. Karki
ViT
3DPC
206
1
0
03 Mar 2024
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Hongyu Sun
Yongcai Wang
Wang Chen
Haoran Deng
Deying Li
VPVLM
352
13
0
24 Feb 2024
PIP-Net: Pedestrian Intention Prediction in the Wild
Mohsen Azarmi
Mahdi Rezaei
He Wang
303
24
0
20 Feb 2024
PointMamba: A Simple State Space Model for Point Cloud Analysis
Dingkang Liang
Xin Zhou
Wei Xu
Xingkui Zhu
Zhikang Zou
Xiaoqing Ye
Xinyu Wang
Xiang Bai
581
242
0
16 Feb 2024
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
Engineering applications of artificial intelligence (EAAI), 2024
Haruna Yunusa
Shiyin Qin
Abdulrahman Hamman Adama Chukkol
Abdulganiyu Abdu Yusuf
Isah Bello
A. Lawan
ViT
323
54
0
05 Feb 2024
3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework
Fan Zhang
Shuyi Mao
Qing Li
Xiaojiang Peng
3DPC
3DH
236
2
0
14 Jan 2024
Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection
Lian Huang
Chi-Man Pun
214
14
0
11 Jan 2024
CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks
British Machine Vision Conference (BMVC), 2024
S. Howard
P. Norreys
Andreas Döpp
264
5
0
09 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Computer Vision and Pattern Recognition (CVPR), 2024
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
588
22
0
04 Jan 2024
1
2
3
4
5
6
7
Next
Page 1 of 7