Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.10497
Cited By
Intriguing Properties of Vision Transformers
21 May 2021
Muzammal Naseer
Kanchana Ranasinghe
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Intriguing Properties of Vision Transformers"
50 / 119 papers shown
Title
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
34
0
0
09 May 2025
Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video
Sonia Joseph
Praneet Suresh
Lorenz Hufe
Edward Stevinson
Robert Graham
Yash Vadi
Danilo Bzdok
Sebastian Lapuschkin
Lee Sharkey
Blake A. Richards
72
0
0
28 Apr 2025
DG-DETR: Toward Domain Generalized Detection Transformer
Seongmin Hwang
Daeyoung Han
Moongu Jeon
ViT
65
0
0
28 Apr 2025
A multi-scale vision transformer-based multimodal GeoAI model for mapping Arctic permafrost thaw
Wenwen Li
Chia-Yu Hsu
Sizhe Wang
Zhining Gu
Yili Yang
Brendan M. Rogers
A. Liljedahl
59
0
0
23 Apr 2025
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
Rupayan Mallick
Sibo Dong
Nataniel Ruiz
Sarah Adel Bargal
DiffM
47
0
0
08 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
79
0
0
03 Apr 2025
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions
Giulia Marchiori Pietrosanti
Giulio Rossolini
Alessandro Biondi
Giorgio Buttazzo
AAML
80
0
0
02 Apr 2025
Discovering Influential Neuron Path in Vision Transformers
Yifan Wang
Yifei Liu
Yingdong Shi
C. Li
Anqi Pang
Sibei Yang
Jingyi Yu
Kan Ren
ViT
69
0
0
12 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
44
0
0
10 Mar 2025
Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation
Tianyang Xu
Jiyong Rao
Xiaoning Song
Zhenhua Feng
Xiao Wu
ViT
62
1
0
25 Feb 2025
Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers
Yunshan Zhong
Yuyao Zhou
Yuxin Zhang
Shen Li
Yong Li
Fei Chao
Zhanpeng Zeng
Rongrong Ji
MQ
94
0
0
31 Dec 2024
Locality Alignment Improves Vision-Language Models
Ian Covert
Tony Sun
James Y. Zou
Tatsunori Hashimoto
VLM
67
3
0
14 Oct 2024
Token Turing Machines are Efficient Vision Models
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiravathukal
James C. Davis
Yung-Hsiang Lu
88
0
0
11 Sep 2024
SUMix: Mixup with Semantic and Uncertain Information
Huafeng Qin
Xin Jin
Hongyu Zhu
Hongchao Liao
M. El-Yacoubi
Xinbo Gao
UQCV
26
5
0
10 Jul 2024
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling
Cristian Rodriguez-Opazo
Ehsan Abbasnejad
Damien Teney
Edison Marrese-Taylor
Hamed Damirchi
A. Hengel
VLM
37
1
0
27 May 2024
StarLKNet: Star Mixup with Large Kernel Networks for Palm Vein Identification
Xin Jin
Hongyu Zhu
M. El-Yacoubi
Hongchao Liao
Huafeng Qin
Yun Jiang
35
6
0
21 May 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
38
22
0
12 Apr 2024
BruSLeAttack: A Query-Efficient Score-Based Black-Box Sparse Adversarial Attack
Viet Vo
Ehsan Abbasnejad
D. Ranasinghe
AAML
33
5
0
08 Apr 2024
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory
Sensen Gao
Xiaojun Jia
Xuhong Ren
Ivor Tsang
Qing-Wu Guo
AAML
38
14
0
19 Mar 2024
Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Siddharth Tiwari
MedIm
ViT
34
0
0
10 Jan 2024
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie-jin Yang
Yun Gu
42
1
0
13 Dec 2023
Improving Interpretation Faithfulness for Vision Transformers
Lijie Hu
Yixin Liu
Ninghao Liu
Mengdi Huai
Lichao Sun
Di Wang
24
5
0
29 Nov 2023
Rotation Invariant Transformer for Recognizing Object in UAVs
Shuo Chen
Mang Ye
Bo Du
ViT
30
18
0
05 Nov 2023
What Makes Pre-Trained Visual Representations Successful for Robust Manipulation?
Kaylee Burns
Zach Witzel
Jubayer Ibn Hamid
Tianhe Yu
Chelsea Finn
Karol Hausman
OOD
SSL
25
22
0
03 Nov 2023
Investigating the Robustness and Properties of Detection Transformers (DETR) Toward Difficult Images
Zhao Ning Zou
Yuhang Zhang
Robert Wijaya
18
0
0
12 Oct 2023
Leveraging the Power of Data Augmentation for Transformer-based Tracking
Jie Zhao
Johan Edstedt
M. Felsberg
D. Wang
Huchuan Lu
ViT
19
4
0
15 Sep 2023
AnyOKP: One-Shot and Instance-Aware Object Keypoint Extraction with Pretrained ViT
Fangbo Qin
Taogang Hou
Shan Lin
Kaiyuan Wang
Michael C. Yip
Shan Yu
19
0
0
15 Sep 2023
Progressive Attention Guidance for Whole Slide Vulvovaginal Candidiasis Screening
Jiangdong Cai
Honglin Xiong
Mao-Hong Cao
Luyan Liu
Lichi Zhang
Qian Wang
15
4
0
06 Sep 2023
Learning Diverse Features in Vision Transformers for Improved Generalization
A. Nicolicioiu
Andrei Liviu Nicolicioiu
B. Alexe
Damien Teney
29
3
0
30 Aug 2023
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
Dong Lu
Zhiqiang Wang
Teng Wang
Weili Guan
Hongchang Gao
Feng Zheng
AAML
51
65
0
26 Jul 2023
Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network
Chull Hwan Song
Taebaek Hwang
Jooyoung Yoon
Shunghyun Choi
Y. Gu
29
0
0
25 Jul 2023
COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts
Xiaofeng Mao
YueFeng Chen
Yao Zhu
Da Chen
Hang Su
Rong Zhang
H. Xue
ObjD
OOD
33
18
0
24 Jul 2023
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar
Runwei Guan
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yong 0001Yue
Yutao Yue
VOS
26
16
0
14 Jul 2023
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Yumeng Li
Dan Zhang
M. Keuper
Anna Khoreva
46
10
0
02 Jul 2023
Estimating Conditional Mutual Information for Dynamic Feature Selection
S. Gadgil
Ian Covert
Su-In Lee
24
3
0
05 Jun 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
17
30
0
22 May 2023
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses
Joana Cabral Costa
Tiago Roxo
Hugo Manuel Proença
Pedro R. M. Inácio
AAML
34
49
0
18 May 2023
On enhancing the robustness of Vision Transformers: Defensive Diffusion
Raza Imam
Muhammad Huzaifa
Mohammed El-Amine Azz
MedIm
DiffM
29
5
0
14 May 2023
Permutation Equivariance of Transformers and Its Applications
Hengyuan Xu
Liyao Xiang
Hang Ye
Dixi Yao
Pengzhi Chu
Baochun Li
17
13
0
16 Apr 2023
Towards Evaluating Explanations of Vision Transformers for Medical Imaging
Piotr Komorowski
Hubert Baniecki
P. Biecek
MedIm
33
27
0
12 Apr 2023
High Dynamic Range Imaging with Context-aware Transformer
Fangfang Zhou
Dan Zhang
Zhengming Fu
ViT
29
16
0
10 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
42
11
0
06 Apr 2023
Multimodal Hyperspectral Image Classification via Interconnected Fusion
Lu Huo
Jiahao Xia
Leijie Zhang
Haimin Zhang
Min Xu
17
2
0
02 Apr 2023
Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving
Zijian Zhu
Yichi Zhang
Hai Chen
Yinpeng Dong
Shu Zhao
Wenbo Ding
Jiachen Zhong
Shibao Zheng
AAML
3DPC
17
38
0
30 Mar 2023
Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery
Mingxuan Liu
Subhankar Roy
Zhun Zhong
N. Sebe
Elisa Ricci
CLL
SSL
32
10
0
28 Mar 2023
Texture Learning Domain Randomization for Domain Generalized Segmentation
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
19
18
0
21 Mar 2023
Active Visual Exploration Based on Attention-Map Entropy
Adam Pardyl
Grzegorz Rype'sć
Grzegorz Kurzejamski
Bartosz Zieliñski
Tomasz Trzciñski
17
5
0
11 Mar 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
Steerable Equivariant Representation Learning
Sangnie Bhardwaj
Willie McClinton
Tongzhou Wang
Guillaume Lajoie
Chen Sun
Phillip Isola
Dilip Krishnan
OOD
LLMSV
26
5
0
22 Feb 2023
Learning Non-Local Spatial-Angular Correlation for Light Field Image Super-Resolution
Zhengyu Liang
Yingqian Wang
Longguang Wang
Jungang Yang
Shilin Zhou
Y. Guo
34
38
0
16 Feb 2023
1
2
3
Next