Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.07538
Cited By
Towards Robust Interpretability with Self-Explaining Neural Networks
20 June 2018
David Alvarez-Melis
Tommi Jaakkola
MILM
XAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Robust Interpretability with Self-Explaining Neural Networks"
50 / 507 papers shown
Title
Logic Rules as Explanations for Legal Case Retrieval
ZhongXiang Sun
Kepu Zhang
Weijie Yu
Haoyu Wang
Jun Xu
AILaw
ELM
41
6
0
03 Mar 2024
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Roy Xie
Orevaoghene Ahia
Yulia Tsvetkov
Antonios Anastasopoulos
40
4
0
27 Feb 2024
From Movements to Metrics: Evaluating Explainable AI Methods in Skeleton-Based Human Activity Recognition
Kimji N. Pellano
Inga Strümke
Espen Alexander F. Ihlen
40
7
0
20 Feb 2024
Explaining Probabilistic Models with Distributional Values
Luca Franceschi
Michele Donini
Cédric Archambeau
Matthias Seeger
FAtt
37
2
0
15 Feb 2024
Variational Shapley Network: A Probabilistic Approach to Self-Explaining Shapley values with Uncertainty Quantification
Mert Ketenci
Inigo Urteaga
Victor Alfonso Rodriguez
Noémie Elhadad
A. Perotte
FAtt
22
0
0
06 Feb 2024
Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs
He Zhao
V. Kitsios
Terry O'Kane
Edwin V. Bonilla
CML
24
1
0
06 Feb 2024
InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts
Vinitra Swamy
Syrielle Montariol
Julian Blackwell
Jibril Frej
Martin Jaggi
Tanja Kaser
44
3
0
05 Feb 2024
Focal Modulation Networks for Interpretable Sound Classification
Luca Della Libera
Cem Subakan
Mirco Ravanelli
33
2
0
05 Feb 2024
NormEnsembleXAI: Unveiling the Strengths and Weaknesses of XAI Ensemble Techniques
Weronika Hryniewska-Guzik
Bartosz Sawicki
P. Biecek
38
0
0
30 Jan 2024
Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition
Sangyu Han
Yearim Kim
Nojun Kwak
AAML
29
1
0
25 Jan 2024
A comprehensive study on fidelity metrics for XAI
Miquel Miró-Nicolau
Antoni Jaume-i-Capó
Gabriel Moyà Alcover
36
11
0
19 Jan 2024
DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
Ricardo Moreira
Jacopo Bono
Mário Cardoso
Pedro Saleiro
Mário A. T. Figueiredo
P. Bizarro
CML
28
4
0
16 Jan 2024
MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment
Yequan Bie
Luyang Luo
Hao Chen
26
14
0
16 Jan 2024
Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test
Anna Hedström
Leander Weber
Sebastian Lapuschkin
Marina M.-C. Höhne
LRM
35
3
0
12 Jan 2024
A tree-based varying coefficient model
Henning Zakrisson
Mathias Lindholm
35
1
0
11 Jan 2024
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian
Chenxu Zhao
Yangyi Li
Fenglong Ma
Chao Zhang
Mengdi Huai
UQCV
47
2
0
03 Jan 2024
3VL: Using Trees to Improve Vision-Language Models' Interpretability
Nir Yellinek
Leonid Karlinsky
Raja Giryes
CoGe
VLM
49
4
0
28 Dec 2023
Q-SENN: Quantized Self-Explaining Neural Networks
Thomas Norrenbrock
Marco Rudolph
Bodo Rosenhahn
FAtt
AAML
MILM
28
6
0
21 Dec 2023
Concept-based Explainable Artificial Intelligence: A Survey
Eleonora Poeta
Gabriele Ciravegna
Eliana Pastor
Tania Cerquitelli
Elena Baralis
LRM
XAI
24
42
0
20 Dec 2023
ALMANACS: A Simulatability Benchmark for Language Model Explainability
Edmund Mills
Shiye Su
Stuart J. Russell
Scott Emmons
51
7
0
20 Dec 2023
Prototypical Self-Explainable Models Without Re-training
Srishti Gautam
Ahcène Boubekki
Marina M.-C. Höhne
Michael C. Kampffmeyer
31
2
0
13 Dec 2023
Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable and Trustworthy Image Recognition
Chong Wang
Yuanhong Chen
Fengbei Liu
Yuyuan Liu
Davis J. McCarthy
Helen Frazer
Gustavo Carneiro
26
1
0
30 Nov 2023
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement
Avani Gupta
Saurabh Saini
P. J. Narayanan
28
6
0
26 Nov 2023
The Disagreement Problem in Faithfulness Metrics
Brian Barr
Noah Fatsi
Leif Hancox-Li
Peter Richter
Daniel Proano
Caleb Mok
42
4
0
13 Nov 2023
Assessing Fidelity in XAI post-hoc techniques: A Comparative Study with Ground Truth Explanations Datasets
Miquel Miró-Nicolau
Antoni Jaume-i-Capó
Gabriel Moyà Alcover
XAI
42
11
0
03 Nov 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
11
4
0
25 Oct 2023
On the stability, correctness and plausibility of visual explanation methods based on feature importance
Romain Xu-Darme
Jenny Benois-Pineau
R. Giot
Georges Quénot
Zakaria Chihani
M. Rousset
Alexey Zhukov
XAI
FAtt
22
1
0
25 Oct 2023
Sanity checks for patch visualisation in prototype-based image classification
Romain Xu-Darme
Georges Quénot
Zakaria Chihani
M. Rousset
19
6
0
25 Oct 2023
XTSC-Bench: Quantitative Benchmarking for Explainers on Time Series Classification
Jacqueline Höllig
Steffen Thoma
Florian Grimm
AI4TS
17
1
0
23 Oct 2023
Cross-Modal Conceptualization in Bottleneck Models
Danis Alukaev
S. Kiselev
Ilya S. Pershin
Bulat Ibragimov
Vladimir Ivanov
Alexey Kornaev
Ivan Titov
41
7
0
23 Oct 2023
Evaluating Large Language Models on Controlled Generation Tasks
Jiao Sun
Yufei Tian
Wangchunshu Zhou
Nan Xu
Qian Hu
Rahul Gupta
John Wieting
Nanyun Peng
Xuezhe Ma
LRM
ELM
40
61
0
23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization
Mohammad Reza Ghasemi Madani
Pasquale Minervini
35
4
0
22 Oct 2023
Explanation-based Training with Differentiable Insertion/Deletion Metric-aware Regularizers
Yuya Yoshikawa
Tomoharu Iwata
24
0
0
19 Oct 2023
A Framework for Interpretability in Machine Learning for Medical Imaging
Alan Q. Wang
Batuhan K. Karaman
Heejong Kim
Jacob Rosenthal
Rachit Saluja
Sean I. Young
M. Sabuncu
AI4CE
17
10
0
02 Oct 2023
Learning to Receive Help: Intervention-Aware Concept Embedding Models
M. Zarlenga
Katherine M. Collins
Krishnamurthy Dvijotham
Adrian Weller
Z. Shams
M. Jamnik
24
23
0
29 Sep 2023
Language Models as a Service: Overview of a New Paradigm and its Challenges
Emanuele La Malfa
Aleksandar Petrov
Simon Frieder
Christoph Weinhuber
Ryan Burnell
Raza Nazar
Anthony Cohn
Nigel Shadbolt
Michael Wooldridge
ALM
ELM
35
3
0
28 Sep 2023
Towards Faithful Neural Network Intrinsic Interpretation with Shapley Additive Self-Attribution
Ying Sun
Hengshu Zhu
Huixia Xiong
TDI
FAtt
MILM
25
1
0
27 Sep 2023
Provably Robust and Plausible Counterfactual Explanations for Neural Networks via Robust Optimisation
Junqi Jiang
Jianglin Lan
Francesco Leofante
Antonio Rago
Francesca Toni
OOD
35
9
0
22 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Arkadiy Saakyan
Smaranda Muresan
23
3
0
15 Sep 2023
Learning by Self-Explaining
Wolfgang Stammer
Felix Friedrich
David Steinmann
Manuel Brack
Hikaru Shindo
Kristian Kersting
26
7
0
15 Sep 2023
How Faithful are Self-Explainable GNNs?
Marc Christiansen
Lea Villadsen
Zhiqiang Zhong
Stefano Teso
Davide Mottin
23
3
0
29 Aug 2023
Learning to Intervene on Concept Bottlenecks
David Steinmann
Wolfgang Stammer
Felix Friedrich
Kristian Kersting
17
19
0
25 Aug 2023
Fairness Explainability using Optimal Transport with Applications in Image Classification
Philipp Ratz
Franccois Hu
Arthur Charpentier
23
0
0
22 Aug 2023
Interpretable Graph Neural Networks for Tabular Data
Amr Alkhatib
Sofiane Ennadir
Henrik Bostrom
Michalis Vazirgiannis
LMTD
36
4
0
17 Aug 2023
Explainable AI for clinical risk prediction: a survey of concepts, methods, and modalities
Munib Mesinovic
Peter Watkinson
Ting Zhu
FaML
19
3
0
16 Aug 2023
Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations
Mikolaj Sacha
Bartosz Jura
Dawid Rymarczyk
Lukasz Struski
Jacek Tabor
Bartosz Zieliñski
32
14
0
16 Aug 2023
FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods
Robin Hesse
Simone Schaub-Meyer
Stefan Roth
AAML
37
32
0
11 Aug 2023
TrajPAC: Towards Robustness Verification of Pedestrian Trajectory Prediction Models
Liang Zhang
Nathaniel Xu
Pengfei Yang
Gao Jin
Cheng-Chao Huang
Lijun Zhang
28
8
0
11 Aug 2023
Precise Benchmarking of Explainable AI Attribution Methods
Rafael Brandt
Daan Raatjens
G. Gaydadjiev
XAI
27
4
0
06 Aug 2023
Two Approaches to Supervised Image Segmentation
Alexandre Benatti
L. D. F. Costa
38
2
0
19 Jul 2023
Previous
1
2
3
4
5
6
...
9
10
11
Next