ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.10497
  4. Cited By
Intriguing Properties of Vision Transformers

Intriguing Properties of Vision Transformers

21 May 2021
Muzammal Naseer
Kanchana Ranasinghe
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
    ViT
ArXivPDFHTML

Papers citing "Intriguing Properties of Vision Transformers"

50 / 136 papers shown
Title
Self-attention in Vision Transformers Performs Perceptual Grouping, Not
  Attention
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
Generic-to-Specific Distillation of Masked Autoencoders
Generic-to-Specific Distillation of Masked Autoencoders
Wei Huang
Zhiliang Peng
Li Dong
Furu Wei
Jianbin Jiao
QiXiang Ye
32
22
0
28 Feb 2023
Steerable Equivariant Representation Learning
Steerable Equivariant Representation Learning
Sangnie Bhardwaj
Willie McClinton
Tongzhou Wang
Guillaume Lajoie
Chen Sun
Phillip Isola
Dilip Krishnan
OOD
LLMSV
34
5
0
22 Feb 2023
Learning Non-Local Spatial-Angular Correlation for Light Field Image
  Super-Resolution
Learning Non-Local Spatial-Angular Correlation for Light Field Image Super-Resolution
Zhengyu Liang
Yingqian Wang
Longguang Wang
Jungang Yang
Shilin Zhou
Y. Guo
42
38
0
16 Feb 2023
Inference Time Evidences of Adversarial Attacks for Forensic on
  Transformers
Inference Time Evidences of Adversarial Attacks for Forensic on Transformers
Hugo Lemarchant
Liang Li
Yiming Qian
Yuta Nakashima
Hajime Nagahara
ViT
AAML
43
0
0
31 Jan 2023
Discovering and Mitigating Visual Biases through Keyword Explanation
Discovering and Mitigating Visual Biases through Keyword Explanation
Younghyun Kim
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jaeho Lee
Jinwoo Shin
40
32
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
37
2
0
25 Jan 2023
Comparing the Decision-Making Mechanisms by Transformers and CNNs via
  Explanation Methods
Comparing the Decision-Making Mechanisms by Transformers and CNNs via Explanation Methods
Ming-Xiu Jiang
Saeed Khorram
Li Fuxin
FAtt
22
9
0
13 Dec 2022
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
Xiaoshui Huang
Zhou Huang
Shengjia Li
Wentao Qu
Tong He
Yuenan Hou
Yifan Zuo
Wanli Ouyang
13
11
0
08 Dec 2022
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video
  Learning
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
A. Piergiovanni
Weicheng Kuo
A. Angelova
ViT
36
54
0
06 Dec 2022
Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases
Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases
Mazda Moayeri
Wenxiao Wang
Sahil Singla
S. Feizi
69
14
0
05 Dec 2022
Finding Differences Between Transformers and ConvNets Using
  Counterfactual Simulation Testing
Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing
Nataniel Ruiz
Sarah Adel Bargal
Cihang Xie
Kate Saenko
Stan Sclaroff
ViT
36
5
0
29 Nov 2022
LUMix: Improving Mixup by Better Modelling Label Uncertainty
LUMix: Improving Mixup by Better Modelling Label Uncertainty
Shuyang Sun
Jieneng Chen
Ruifei He
Alan Yuille
Philip H. S. Torr
Song Bai
UQCV
NoLa
18
5
0
29 Nov 2022
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with
  Pre-trained Transformers
FedTune: A Deep Dive into Efficient Federated Fine-Tuning with Pre-trained Transformers
Jinyu Chen
Wenchao Xu
Song Guo
Junxiao Wang
Jie Zhang
Yining Qi
FedML
28
32
0
15 Nov 2022
MARLIN: Masked Autoencoder for facial video Representation LearnINg
MARLIN: Masked Autoencoder for facial video Representation LearnINg
Zhixi Cai
Shreya Ghosh
Kalin Stefanov
Abhinav Dhall
Jianfei Cai
Hamid Rezatofighi
Reza Haffari
Munawar Hayat
ViT
CVBM
25
60
0
12 Nov 2022
ViT-CX: Causal Explanation of Vision Transformers
ViT-CX: Causal Explanation of Vision Transformers
Weiyan Xie
Xiao-hui Li
Caleb Chen Cao
Nevin L.Zhang
ViT
26
17
0
06 Nov 2022
1st Place Solution of The Robust Vision Challenge 2022 Semantic
  Segmentation Track
1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track
Junfei Xiao
Zhichao Xu
Shiyi Lan
Zhiding Yu
Alan Yuille
Anima Anandkumar
19
5
0
23 Oct 2022
Delving into Masked Autoencoders for Multi-Label Thorax Disease
  Classification
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification
Junfei Xiao
Yutong Bai
Alan Yuille
Zongwei Zhou
MedIm
ViT
37
59
0
23 Oct 2022
A Unified View of Masked Image Modeling
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
54
35
0
19 Oct 2022
Curved Representation Space of Vision Transformers
Curved Representation Space of Vision Transformers
Juyeop Kim
Junha Park
Songkuk Kim
Jongseok Lee
ViT
35
6
0
11 Oct 2022
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial
  Viewpoints
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints
Yinpeng Dong
Shouwei Ruan
Hang Su
Cai Kang
Xingxing Wei
Junyi Zhu
AAML
30
50
0
08 Oct 2022
Relational Proxies: Emergent Relationships as Fine-Grained
  Discriminators
Relational Proxies: Emergent Relationships as Fine-Grained Discriminators
Abhra Chaudhuri
Massimiliano Mancini
Zeynep Akata
Anjan Dutta
23
4
0
05 Oct 2022
Relational Reasoning via Set Transformers: Provable Efficiency and
  Applications to MARL
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Fengzhuo Zhang
Boyi Liu
Kaixin Wang
Vincent Y. F. Tan
Zhuoran Yang
Zhaoran Wang
OffRL
LRM
51
10
0
20 Sep 2022
Transformers in Remote Sensing: A Survey
Transformers in Remote Sensing: A Survey
Abdulaziz Amer Aleissaee
Amandeep Kumar
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
Guisong Xia
F. Khan
ViT
54
175
0
02 Sep 2022
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Shenglian Lu
Xiaoyu Liu
Zixaun He
Wenbo Liu
Xin Zhang
Manoj Karkee
26
39
0
30 Aug 2022
Exploring Adversarial Robustness of Vision Transformers in the Spectral
  Perspective
Exploring Adversarial Robustness of Vision Transformers in the Spectral Perspective
Gihyun Kim
Juyeop Kim
Jong-Seok Lee
AAML
ViT
24
4
0
20 Aug 2022
Prompt Vision Transformer for Domain Generalization
Prompt Vision Transformer for Domain Generalization
Zangwei Zheng
Xiangyu Yue
Kai Wang
Yang You
VLM
VPVLM
MDE
30
51
0
18 Aug 2022
Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image
  Classification
Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification
Faris Almalik
Mohammad Yaqub
Karthik Nandakumar
ViT
AAML
MedIm
26
33
0
04 Aug 2022
Understanding Adversarial Robustness of Vision Transformers via Cauchy
  Problem
Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem
Zheng Wang
Wenjie Ruan
ViT
39
8
0
01 Aug 2022
Adaptive occlusion sensitivity analysis for visually explaining video
  recognition networks
Adaptive occlusion sensitivity analysis for visually explaining video recognition networks
Tomoki Uchiyama
Naoya Sogi
S. Iizuka
Koichiro Niinuma
Kazuhiro Fukui
24
2
0
26 Jul 2022
Contrastive Self-Supervised Learning Leads to Higher Adversarial
  Susceptibility
Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility
Rohit Gupta
Naveed Akhtar
Ajmal Saeed Mian
M. Shah
AAML
SSL
26
5
0
22 Jul 2022
Towards Efficient Adversarial Training on Vision Transformers
Towards Efficient Adversarial Training on Vision Transformers
Boxi Wu
Jindong Gu
Zhifeng Li
Deng Cai
Xiaofei He
Wei Liu
ViT
AAML
43
37
0
21 Jul 2022
Assaying Out-Of-Distribution Generalization in Transfer Learning
Assaying Out-Of-Distribution Generalization in Transfer Learning
F. Wenzel
Andrea Dittadi
Peter V. Gehler
Carl-Johann Simon-Gabriel
Max Horn
...
Chris Russell
Thomas Brox
Bernt Schiele
Bernhard Schölkopf
Francesco Locatello
OOD
OODD
AAML
57
71
0
19 Jul 2022
Adversarial Pixel Restoration as a Pretext Task for Transferable
  Perturbations
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations
H. Malik
Shahina Kunhimon
Muzammal Naseer
Salman Khan
F. Khan
AAML
23
8
0
18 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
46
23
0
15 Jul 2022
Large-scale Robustness Analysis of Video Action Recognition Models
Large-scale Robustness Analysis of Video Action Recognition Models
Madeline Chantry Schiappa
Naman Biyani
Prudvi Kamtam
Shruti Vyas
Hamid Palangi
Vibhav Vineet
Y. S. Rawat
AAML
34
24
0
04 Jul 2022
Backdoor Attacks on Vision Transformers
Backdoor Attacks on Vision Transformers
Akshayvarun Subramanya
Aniruddha Saha
Soroush Abbasi Koohpayegani
Ajinkya Tejankar
Hamed Pirsiavash
ViT
AAML
12
16
0
16 Jun 2022
INDIGO: Intrinsic Multimodality for Domain Generalization
INDIGO: Intrinsic Multimodality for Domain Generalization
Puneet Mangla
Shivam Chandhok
Milan Aggarwal
V. Balasubramanian
Balaji Krishnamurthy
VLM
38
2
0
13 Jun 2022
SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer
SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer
Junde Wu
Huihui Fang
Fangxin Shang
Dalu Yang
Zhao-Yang Wang
Jing Gao
Yehui Yang
Yanwu Xu
MedIm
ViT
17
19
0
12 Jun 2022
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Siyuan Li
Di Wu
Fang Wu
Lei Shang
Stan.Z.Li
32
48
0
27 May 2022
Label-Efficient Self-Supervised Federated Learning for Tackling Data
  Heterogeneity in Medical Imaging
Label-Efficient Self-Supervised Federated Learning for Tackling Data Heterogeneity in Medical Imaging
Rui Yan
Liangqiong Qu
Qingyue Wei
Shih-Cheng Huang
Liyue Shen
D. Rubin
Lei Xing
Yuyin Zhou
FedML
78
90
0
17 May 2022
Deeper Insights into the Robustness of ViTs towards Common Corruptions
Deeper Insights into the Robustness of ViTs towards Common Corruptions
Rui Tian
Zuxuan Wu
Qi Dai
Han Hu
Yu-Gang Jiang
ViT
AAML
21
4
0
26 Apr 2022
Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot
  Learning
Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning
Mathias Lechner
Alexander Amini
Daniela Rus
T. Henzinger
AAML
26
9
0
15 Apr 2022
ViTOL: Vision Transformer for Weakly Supervised Object Localization
ViTOL: Vision Transformer for Weakly Supervised Object Localization
Saurav Gupta
Sourav Lakhotia
Abhay Rawat
Rahul Tallamraju
WSOL
32
21
0
14 Apr 2022
Does Robustness on ImageNet Transfer to Downstream Tasks?
Does Robustness on ImageNet Transfer to Downstream Tasks?
Yutaro Yamada
Mayu Otani
OOD
32
27
0
08 Apr 2022
Multi-Task Distributed Learning using Vision Transformer with Random
  Patch Permutation
Multi-Task Distributed Learning using Vision Transformer with Random Patch Permutation
Sangjoon Park
Jong Chul Ye
FedML
MedIm
42
19
0
07 Apr 2022
Give Me Your Attention: Dot-Product Attention Considered Harmful for
  Adversarial Patch Robustness
Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
Giulio Lovisotto
Nicole Finnie
Mauricio Muñoz
Chaithanya Kumar Mummadi
J. H. Metzen
AAML
ViT
30
32
0
25 Mar 2022
Unsupervised Salient Object Detection with Spectral Cluster Voting
Unsupervised Salient Object Detection with Spectral Cluster Voting
Gyungin Shin
Samuel Albanie
Weidi Xie
24
65
0
23 Mar 2022
Harnessing Hard Mixed Samples with Decoupled Regularizer
Harnessing Hard Mixed Samples with Decoupled Regularizer
Zicheng Liu
Siyuan Li
Ge Wang
Cheng Tan
Lirong Wu
Stan Z. Li
59
18
0
21 Mar 2022
Stepwise Feature Fusion: Local Guides Global
Stepwise Feature Fusion: Local Guides Global
Jinfeng Wang
Qiming Huang
Feilong Tang
Jia Meng
Jionglong Su
Sifan Song
ViT
MedIm
24
179
0
07 Mar 2022
Previous
123
Next