Exploring Self-attention for Image Recognition

Computer Vision and Pattern Recognition (CVPR), 2020

28 April 2020

Papers citing "Exploring Self-attention for Image Recognition"

50 / 323 papers shown

Learning Informative Attention Weights for Person Re-Identification

Yancheng Wang

Nebojsa Jojic

Yingzhen Yang

496

24 Dec 2025

SDGraph: Multi-Level Sketch Representation Learning by Sparse-Dense Graph Architecture

216

14 Oct 2025

Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images

Danling Cao

CVBM 3DH 3DV

504

12 Sep 2025

CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

156

10 Sep 2025

MyGO: Make your Goals Obvious, Avoiding Semantic Confusion in Prostate Cancer Lesion Region Segmentation

164

23 Jul 2025

GASPnet: Global Agreement to Synchronize Phases

281

22 Jul 2025

Ensemble-Based Survival Models with the Self-Attended Beran Estimator PredictionsComputational Mathematics and Modeling (CMM), 2025

184

09 Jun 2025

HyperPointFormer: Multimodal Fusion in 3D Space with Dual-Branch Cross-Attention TransformersIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE J-STARS), 2025

Aldino Rizaldy

R. Gloaguen

Fabian Ewald Fassnacht

Pedram Ghamisi

3DPC

267

29 May 2025

Multimodal Fusion of Glucose Monitoring and Food Imagery for Caloric Content Prediction

Adarsh Kumar

441

13 May 2025

SignX: Continuous Sign Recognition in Compact Pose-Rich Latent Space

Hongwei Yi

Hezhen Hu

Dimitris N. Metaxas

SLR

486

22 Apr 2025

MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework

182

18 Apr 2025

Forward Learning with Differential Privacy

382

01 Apr 2025

Interpretable Deep Learning Framework for Improved Disease Classification in Medical Imaging

Jutika Borah

H. Singh

OOD UQCV

459

14 Mar 2025

LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable AttentionACM Multimedia (MM), 2024

321

29 Nov 2024

Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer

500

16 Nov 2024

Exploring contextual modeling with linear complexity for point cloud segmentation

Gao Huang

346

28 Oct 2024

Learning to rumble: Automated elephant call classification, detection and endpointing using deep architectures

Christiaan M. Geldenhuys

Thomas R. Niesler

168

15 Oct 2024

UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA FiltersBritish Machine Vision Conference (BMVC), 2024

Kovvuri Sai Gopal Reddy

321

08 Oct 2024

IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video SynthesisInternational Conference on Learning Representations (ICLR), 2024

Haoyi Xiong

360

05 Oct 2024

Feature Importance in Pedestrian Intention Prediction: A Context-Aware Review

257

11 Sep 2024

MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation

267

04 Sep 2024

Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL

Mohammad Reshadati

259

04 Sep 2024

Panoptic Perception for Autonomous Driving: A Survey

Yunge Li

Lanyu Xu

324

27 Aug 2024

PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer ArchitectureIEEE transactions on multimedia (IEEE TMM), 2024

Qiang Zheng

Chao Zhang

Jian Sun

451

10 Aug 2024

Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial ImagesIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024

467

29 Jul 2024

Rethinking Attention Module Design for Point Cloud AnalysisInternational Conference on Pattern Recognition (ICPR), 2024

393

27 Jul 2024

GMT: Effective Global Framework for Multi-Camera Multi-Target Tracking

411

01 Jul 2024

ATAC-Net: Zoomed view works better for Anomaly Detection

Shaurya Gupta

Neil Gautam

Anurag Malyala

265

20 Jun 2024

Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object PosesNeural Information Processing Systems (NeurIPS), 2024

307

14 Jun 2024

A Multimodal Dangerous State Recognition and Early Warning System for Elderly with Intermittent Dementia

Han Wang

135

30 May 2024

Towards Natural Machine Unlearning

446

24 May 2024

Mesh Denoising Transformer

Xianming Liu

208

10 May 2024

UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks

218

09 May 2024

CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks

Nick Nikzad

Yongsheng Gao

Jun Zhou

282

09 May 2024

AFter: Attention-based Fusion Router for RGBT TrackingIEEE Transactions on Image Processing (TIP), 2024

Chenglong Li

280

04 May 2024

Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network

Rana Azzam

Lakmal Seneviratne

259

16 Apr 2024

Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

374

01 Apr 2024

Surface Reconstruction from Point Clouds via Grid-based Intersection Prediction

Hui Tian

Kai Xu

3DPC 3DV

440

21 Mar 2024

EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration

Abu Zahid Bin Aziz

Mokshagna Sai Teja Karanam

Tushar Kataria

Shireen Y. Elhabian

ViT MedIm

266

16 Mar 2024

LVIC: Multi-modality segmentation by Lifting Visual Info as Cue

Xin Zhan

328

08 Mar 2024

ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures

S. Rukhsar

Anil Kumar Tiwari

312

05 Mar 2024

Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

213

03 Mar 2024

Parameter-efficient Prompt Learning for 3D Point Cloud Understanding

363

24 Feb 2024

PIP-Net: Pedestrian Intention Prediction in the Wild

Mohsen Azarmi

Mahdi Rezaei

He Wang

306

20 Feb 2024

PointMamba: A Simple State Space Model for Point Cloud Analysis

Dingkang Liang

Xiaoqing Ye

597

256

16 Feb 2024

Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A surveyEngineering applications of artificial intelligence (EAAI), 2024

Haruna Yunusa

Shiyin Qin

Abdulrahman Hamman Adama Chukkol

Abdulganiyu Abdu Yusuf

Isah Bello

A. Lawan

ViT

325

05 Feb 2024

3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework

Xiaojiang Peng

253

14 Jan 2024

Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection

Lian Huang

Chi-Man Pun

220

11 Jan 2024

CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural NetworksBritish Machine Vision Conference (BMVC), 2024

S. Howard

P. Norreys

Andreas Döpp

274

09 Jan 2024

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything ModelComputer Vision and Pattern Recognition (CVPR), 2024

604

04 Jan 2024