76

[Re] Improving Interpretation Faithfulness for Vision Transformers

Main:13 Pages
12 Figures
Bibliography:2 Pages
9 Tables
Appendix:14 Pages
Abstract

This work aims to reproduce the results of Faithful Vision Transformers (FViTs) proposed byarXiv:2311.17983alongside interpretability methods for Vision Transformers fromarXiv:2012.09838and Xu (2022) et al. We investigate claims made byarXiv:2311.17983, namely that the usage of Diffusion Denoised Smoothing (DDS) improves interpretability robustness to (1) attacks in a segmentation task and (2) perturbation and attacks in a classification task. We also extend the original study by investigating the authors' claims that adding DDS to any interpretability method can improve its robustness under attack. This is tested on baseline methods and the recently proposed Attribution Rollout method. In addition, we measure the computational costs and environmental impact of obtaining an FViT through DDS. Our results broadly agree with the original study's findings, although minor discrepancies were found and discussed.

View on arXiv
Comments on this paper