ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.16302
  4. Cited By
Rethinking Spatial Dimensions of Vision Transformers

Rethinking Spatial Dimensions of Vision Transformers

30 March 2021
Byeongho Heo
Sangdoo Yun
Dongyoon Han
Sanghyuk Chun
Junsuk Choe
Seong Joon Oh
    ViT
ArXivPDFHTML

Papers citing "Rethinking Spatial Dimensions of Vision Transformers"

50 / 307 papers shown
Title
A 2D Semantic-Aware Position Encoding for Vision Transformers
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yuyao Zhang
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
31
0
0
14 May 2025
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Sainath Dey
Mitul Goswami
Jashika Sethi
Prasant Kumar Pattnaik
ViT
30
0
0
07 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
148
0
0
06 May 2025
A Simple DropConnect Approach to Transfer-based Targeted Attack
A Simple DropConnect Approach to Transfer-based Targeted Attack
Tongrui Su
Qingbin Li
Shengyu Zhu
Wei Chen
Xueqi Cheng
AAML
69
0
0
24 Apr 2025
MSAD-Net: Multiscale and Spatial Attention-based Dense Network for Lung Cancer Classification
MSAD-Net: Multiscale and Spatial Attention-based Dense Network for Lung Cancer Classification
Santanu Roy
Shweta Singh
Palak Sahu
Ashvath Suresh
Debashish Das
30
0
0
20 Apr 2025
The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability
The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability
Jiani Liu
Zhiyuan Wang
Zeliang Zhang
Chao Huang
Susan Liang
Yunlong Tang
Chenliang Xu
AAML
39
0
0
15 Apr 2025
Novel Pooling-based VGG-Lite for Pneumonia and Covid-19 Detection from Imbalanced Chest X-Ray Datasets
Novel Pooling-based VGG-Lite for Pneumonia and Covid-19 Detection from Imbalanced Chest X-Ray Datasets
Santanu Roy
Ashvath Suresh
Palak Sahu
Tulika Rudra Gupta
29
0
0
10 Apr 2025
Revisiting Funnel Transformers for Modern LLM Architectures with Comprehensive Ablations in Training and Inference Configurations
Revisiting Funnel Transformers for Modern LLM Architectures with Comprehensive Ablations in Training and Inference Configurations
DongHyun Choi
Lucas Spangher
Chris Hidey
Peter Grabowski
Ramy Eskander
AI4CE
44
0
0
02 Apr 2025
Diffusion models applied to skin and oral cancer classification
Diffusion models applied to skin and oral cancer classification
José J. M. Uliana
Renato A. Krohling
DiffM
MedIm
52
0
0
28 Mar 2025
MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation
MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation
C. Shiranthika
Zahra Hafezi Kafshgari
Hadi Hadizadeh
Parvaneh Saeedi
FedML
45
0
0
26 Mar 2025
k-NN as a Simple and Effective Estimator of Transferability
k-NN as a Simple and Effective Estimator of Transferability
Moein Sorkhei
Christos Matsoukas
Johan Fredin Haslum
Kevin Smith
57
0
0
24 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
62
0
0
21 Mar 2025
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
Yuchen Ren
Zhengyu Zhao
Chenhao Lin
Bo Yang
Lu Zhou
Zhe Liu
Chao Shen
ViT
47
0
0
19 Mar 2025
Fibonacci-Net: A Lightweight CNN model for Automatic Brain Tumor Classification
Fibonacci-Net: A Lightweight CNN model for Automatic Brain Tumor Classification
Santanu Roy
Ashvath Suresh
Archit Gupta
Shubhi Tiwari
Palak Sahu
Prashant Adhikari
Yuvraj S. Shekhawat
53
0
0
18 Mar 2025
Multi-Granular Multimodal Clue Fusion for Meme Understanding
Multi-Granular Multimodal Clue Fusion for Meme Understanding
Li Zheng
Hao Fei
Ting Dai
Zuquan Peng
Fei Li
Huisheng Ma
Chong Teng
Donghong Ji
60
0
0
16 Mar 2025
Revisiting Medical Image Retrieval via Knowledge Consolidation
Yang Nan
Huichi Zhou
Xiaodan Xing
G. Papanastasiou
Lei Zhu
Zhifan Gao
Alejandro F Fangi
G. Yang
44
0
0
12 Mar 2025
Boosting the Local Invariance for Better Adversarial Transferability
Bohan Liu
Xiaosen Wang
AAML
65
0
0
08 Mar 2025
VRM: Knowledge Distillation via Virtual Relation Matching
VRM: Knowledge Distillation via Virtual Relation Matching
W. Zhang
Fei Xie
Weidong Cai
Chao Ma
76
0
0
28 Feb 2025
Enhancing Adversarial Transferability via Component-Wise Transformation
Enhancing Adversarial Transferability via Component-Wise Transformation
Hangyu Liu
Bo Peng
Pengxiang Ding
Donglin Wang
Donglin Wang
AAML
52
0
0
21 Jan 2025
Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability
Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability
Hui Zeng
Sanshuai Cui
Biwei Chen
Anjie Peng
AAML
39
0
0
31 Dec 2024
One Pixel is All I Need
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
143
0
0
14 Dec 2024
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature
  Extraction and Interaction with Low-Resolution Images
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images
Xiangyong Lu
Masanori Suganuma
Takayuki Okatani
72
0
0
03 Dec 2024
Improving Transferable Targeted Attacks with Feature Tuning Mixup
Improving Transferable Targeted Attacks with Feature Tuning Mixup
K. Liang
Xuelong Dai
Yanjie Li
Dong Wang
Bin Xiao
AAML
155
0
0
23 Nov 2024
D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification
Minhee Jang
Juheon Son
Thanaporn Viriyasaranon
Junho Kim
Jang-Hwan Choi
MedIm
31
0
0
17 Nov 2024
LoFi: Neural Local Fields for Scalable Image Reconstruction
LoFi: Neural Local Fields for Scalable Image Reconstruction
AmirEhsan Khorashadizadeh
T. Liaudat
Tianlin Liu
Jason D. McEwen
Ivan Dokmanić
SupR
41
1
0
07 Nov 2024
Self-Satisfied: An end-to-end framework for SAT generation and
  prediction
Self-Satisfied: An end-to-end framework for SAT generation and prediction
Christopher R. Serrano
Jonathan Gallagher
Kenji Yamada
Alexei Kopylov
Michael A. Warren
29
0
0
18 Oct 2024
S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
S4^44ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
Yongxiang Liu
Bowen Peng
Li Liu
Xuzhao Li
113
0
0
13 Oct 2024
On the Adversarial Transferability of Generalized "Skip Connections"
On the Adversarial Transferability of Generalized "Skip Connections"
Yisen Wang
Yichuan Mo
Dongxian Wu
Mingjie Li
Xingjun Ma
Zhouchen Lin
AAML
28
2
0
11 Oct 2024
Using Interleaved Ensemble Unlearning to Keep Backdoors at Bay for
  Finetuning Vision Transformers
Using Interleaved Ensemble Unlearning to Keep Backdoors at Bay for Finetuning Vision Transformers
Zeyu Michael Li
AAML
23
0
0
01 Oct 2024
Kendall's $τ$ Coefficient for Logits Distillation
Kendall's τττ Coefficient for Logits Distillation
Yuchen Guan
Runxi Cheng
Kang Liu
Chun Yuan
33
0
0
26 Sep 2024
Agglomerative Token Clustering
Agglomerative Token Clustering
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
36
1
0
18 Sep 2024
Sparks of Artificial General Intelligence(AGI) in Semiconductor Material
  Science: Early Explorations into the Next Frontier of Generative AI-Assisted
  Electron Micrograph Analysis
Sparks of Artificial General Intelligence(AGI) in Semiconductor Material Science: Early Explorations into the Next Frontier of Generative AI-Assisted Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
33
0
0
17 Sep 2024
AD-Lite Net: A Lightweight and Concatenated CNN Model for Alzheimer's
  Detection from MRI Images
AD-Lite Net: A Lightweight and Concatenated CNN Model for Alzheimer's Detection from MRI Images
Santanu Roy
Archit Gupta
Shubhi Tiwari
Palak Sahu
MedIm
24
4
0
12 Sep 2024
MVTN: A Multiscale Video Transformer Network for Hand Gesture
  Recognition
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
38
1
0
05 Sep 2024
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language
  Instruction Tuning for Semiconductor Electron Micrograph Analysis
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
43
0
0
27 Aug 2024
Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant
  for Semiconductor Electron Micrograph Analysis
Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant for Semiconductor Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
38
1
0
27 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph
  Representation Learning with Foundational Large Language Models
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
35
0
0
24 Aug 2024
Preliminary Investigations of a Multi-Faceted Robust and Synergistic
  Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision
  Transformers with Large Language and Multimodal Models
Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
32
0
0
24 Aug 2024
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning
  Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
41
0
0
23 Aug 2024
Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes
Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes
Sakhinana Sagar Srinivas
Rajat Kumar Sarkar
Sreeja Gangasani
Venkataramana Runkana
35
2
0
21 Aug 2024
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Jiahao Chen
Zhou Feng
Rui Zeng
Yuwen Pu
Chunyi Zhou
Yi Jiang
Yuyou Gan
Jinbao Li
Shouling Ji
AAML
40
0
0
18 Aug 2024
Privacy-Preserving Split Learning with Vision Transformers using
  Patch-Wise Random and Noisy CutMix
Privacy-Preserving Split Learning with Vision Transformers using Patch-Wise Random and Noisy CutMix
Yang Jin
Sihun Baek
Lei Zhang
Hyelin Nam
Praneeth Vepakomma
Ramesh Raskar
Mehdi Bennis
Seong-Lyun Kim
36
2
0
02 Aug 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on
  Small Datasets
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViT
MDE
40
7
0
28 Jul 2024
Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for
  Vision Transformers
Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers
Zhengang Li
Alec Lu
Yanyue Xie
Zhenglun Kong
Mengshu Sun
...
Peiyan Dong
Caiwen Ding
Yanzhi Wang
Xue Lin
Zhenman Fang
34
5
0
25 Jul 2024
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Fudong Lin
Jiadong Lou
Xu Yuan
Nianfeng Tzeng
ViT
AAML
30
1
0
22 Jul 2024
DuoFormer: Leveraging Hierarchical Visual Representations by Local and
  Global Attention
DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
Xiaoya Tang
Bodong Zhang
Beatrice S. Knudsen
Tolga Tasdizen
ViT
MedIm
50
1
0
18 Jul 2024
Graph Transformers: A Survey
Graph Transformers: A Survey
Ahsan Shehzad
Feng Xia
Shagufta Abid
Ciyuan Peng
Shuo Yu
Dongyu Zhang
Karin Verspoor
AI4CE
34
9
0
13 Jul 2024
Improving the Transferability of Adversarial Examples by Feature
  Augmentation
Improving the Transferability of Adversarial Examples by Feature Augmentation
Donghua Wang
Wen Yao
Tingsong Jiang
Xiaohu Zheng
Junqi Wu
Xiaoqian Chen
AAML
53
0
0
09 Jul 2024
ChangeViT: Unleashing Plain Vision Transformers for Change Detection
ChangeViT: Unleashing Plain Vision Transformers for Change Detection
Duowang Zhu
Xiaohu Huang
Haiyan Huang
Zhenfeng Shao
Q. Cheng
36
8
0
18 Jun 2024
Learning 1D Causal Visual Representation with De-focus Attention
  Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
Chenxin Tao
Xizhou Zhu
Shiqian Su
Lewei Lu
Changyao Tian
...
Gao Huang
Hongsheng Li
Yu Qiao
Jie Zhou
Jifeng Dai
70
1
0
06 Jun 2024
1234567
Next