Stability Guarantees for Feature Attributions with Multiplicative Smoothing

12 July 2023

Papers citing "Stability Guarantees for Feature Attributions with Multiplicative Smoothing"

10 / 10 papers shown

Title
Probabilistic Stability Guarantees for Feature Attributions Helen Jin Anton Xue Weiqiu You Surbhi Goel Eric Wong 27 0 0 18 Apr 2025
One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability Gabriel Kasmi Amandine Brunetto Thomas Fel Jayneel Parekh AAML FAtt 35 0 0 02 Oct 2024
Enhancing Model Interpretability with Local Attribution over Global Exploration Zhiyu Zhu Zhibo Jin Jiayu Zhang Huaming Chen FAtt 35 4 0 14 Aug 2024
SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks Alexander Robey Eric Wong Hamed Hassani George J. Pappas AAML 43 215 0 05 Oct 2023
Towards Faithful Model Explanation in NLP: A Survey Qing Lyu Marianna Apidianaki Chris Callison-Burch XAI 112 107 0 22 Sep 2022
The Solvability of Interpretability Evaluation Metrics Yilun Zhou J. Shah 70 8 0 18 May 2022
"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification Jasmijn Bastings Sebastian Ebert Polina Zablotskaia Anders Sandholm Katja Filippova 115 75 0 14 Nov 2021
Certified Patch Robustness via Smoothed Vision Transformers Hadi Salman Saachi Jain Eric Wong Aleksander Mkadry AAML 70 58 0 11 Oct 2021
Adversarial Machine Learning at Scale Alexey Kurakin Ian Goodfellow Samy Bengio AAML 288 3,110 0 04 Nov 2016
ImageNet Large Scale Visual Recognition Challenge Olga Russakovsky Jia Deng Hao Su J. Krause S. Satheesh ... A. Karpathy A. Khosla Michael S. Bernstein Alexander C. Berg Li Fei-Fei VLM ObjD 296 39,198 0 01 Sep 2014