Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.05902
Cited By
Stability Guarantees for Feature Attributions with Multiplicative Smoothing
12 July 2023
Anton Xue
Rajeev Alur
Eric Wong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stability Guarantees for Feature Attributions with Multiplicative Smoothing"
10 / 10 papers shown
Title
Probabilistic Stability Guarantees for Feature Attributions
Helen Jin
Anton Xue
Weiqiu You
Surbhi Goel
Eric Wong
27
0
0
18 Apr 2025
One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability
Gabriel Kasmi
Amandine Brunetto
Thomas Fel
Jayneel Parekh
AAML
FAtt
35
0
0
02 Oct 2024
Enhancing Model Interpretability with Local Attribution over Global Exploration
Zhiyu Zhu
Zhibo Jin
Jiayu Zhang
Huaming Chen
FAtt
35
4
0
14 Aug 2024
SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks
Alexander Robey
Eric Wong
Hamed Hassani
George J. Pappas
AAML
43
215
0
05 Oct 2023
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
112
107
0
22 Sep 2022
The Solvability of Interpretability Evaluation Metrics
Yilun Zhou
J. Shah
70
8
0
18 May 2022
"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification
Jasmijn Bastings
Sebastian Ebert
Polina Zablotskaia
Anders Sandholm
Katja Filippova
115
75
0
14 Nov 2021
Certified Patch Robustness via Smoothed Vision Transformers
Hadi Salman
Saachi Jain
Eric Wong
Aleksander Mkadry
AAML
70
58
0
11 Oct 2021
Adversarial Machine Learning at Scale
Alexey Kurakin
Ian Goodfellow
Samy Bengio
AAML
288
3,110
0
04 Nov 2016
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
296
39,198
0
01 Sep 2014
1