Towards Robust Interpretability with Self-Explaining Neural Networks

20 June 2018

David Alvarez-Melis

Papers citing "Towards Robust Interpretability with Self-Explaining Neural Networks"

50 / 507 papers shown

Title
Logic Rules as Explanations for Legal Case Retrieval ZhongXiang Sun Kepu Zhang Weijie Yu Haoyu Wang Jun Xu AILaw ELM 41 6 0 03 Mar 2024
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers Roy Xie Orevaoghene Ahia Yulia Tsvetkov Antonios Anastasopoulos 40 4 0 27 Feb 2024
From Movements to Metrics: Evaluating Explainable AI Methods in Skeleton-Based Human Activity Recognition Kimji N. Pellano Inga Strümke Espen Alexander F. Ihlen 40 7 0 20 Feb 2024
Explaining Probabilistic Models with Distributional Values Luca Franceschi Michele Donini Cédric Archambeau Matthias Seeger FAtt 37 2 0 15 Feb 2024
Variational Shapley Network: A Probabilistic Approach to Self-Explaining Shapley values with Uncertainty Quantification Mert Ketenci Inigo Urteaga Victor Alfonso Rodriguez Noémie Elhadad A. Perotte FAtt 22 0 0 06 Feb 2024
Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs He Zhao V. Kitsios Terry O'Kane Edwin V. Bonilla CML 24 1 0 06 Feb 2024
InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts Vinitra Swamy Syrielle Montariol Julian Blackwell Jibril Frej Martin Jaggi Tanja Kaser 44 3 0 05 Feb 2024
Focal Modulation Networks for Interpretable Sound Classification Luca Della Libera Cem Subakan Mirco Ravanelli 33 2 0 05 Feb 2024
NormEnsembleXAI: Unveiling the Strengths and Weaknesses of XAI Ensemble Techniques Weronika Hryniewska-Guzik Bartosz Sawicki P. Biecek 38 0 0 30 Jan 2024
Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition Sangyu Han Yearim Kim Nojun Kwak AAML 29 1 0 25 Jan 2024
A comprehensive study on fidelity metrics for XAI Miquel Miró-Nicolau Antoni Jaume-i-Capó Gabriel Moyà Alcover 36 11 0 19 Jan 2024
DiConStruct: Causal Concept-based Explanations through Black-Box Distillation Ricardo Moreira Jacopo Bono Mário Cardoso Pedro Saleiro Mário A. T. Figueiredo P. Bizarro CML 28 4 0 16 Jan 2024
MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment Yequan Bie Luyang Luo Hao Chen 26 14 0 16 Jan 2024
Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test Anna Hedström Leander Weber Sebastian Lapuschkin Marina M.-C. Höhne LRM 35 3 0 12 Jan 2024
A tree-based varying coefficient model Henning Zakrisson Mathias Lindholm 35 1 0 11 Jan 2024
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction Wei Qian Chenxu Zhao Yangyi Li Fenglong Ma Chao Zhang Mengdi Huai UQCV 47 2 0 03 Jan 2024
3VL: Using Trees to Improve Vision-Language Models' Interpretability Nir Yellinek Leonid Karlinsky Raja Giryes CoGe VLM 49 4 0 28 Dec 2023
Q-SENN: Quantized Self-Explaining Neural Networks Thomas Norrenbrock Marco Rudolph Bodo Rosenhahn FAtt AAML MILM 28 6 0 21 Dec 2023
Concept-based Explainable Artificial Intelligence: A Survey Eleonora Poeta Gabriele Ciravegna Eliana Pastor Tania Cerquitelli Elena Baralis LRM XAI 24 42 0 20 Dec 2023
ALMANACS: A Simulatability Benchmark for Language Model Explainability Edmund Mills Shiye Su Stuart J. Russell Scott Emmons 51 7 0 20 Dec 2023
Prototypical Self-Explainable Models Without Re-training Srishti Gautam Ahcène Boubekki Marina M.-C. Höhne Michael C. Kampffmeyer 31 2 0 13 Dec 2023
Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable and Trustworthy Image Recognition Chong Wang Yuanhong Chen Fengbei Liu Yuyuan Liu Davis J. McCarthy Helen Frazer Gustavo Carneiro 26 1 0 30 Nov 2023
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement Avani Gupta Saurabh Saini P. J. Narayanan 28 6 0 26 Nov 2023
The Disagreement Problem in Faithfulness Metrics Brian Barr Noah Fatsi Leif Hancox-Li Peter Richter Daniel Proano Caleb Mok 42 4 0 13 Nov 2023
Assessing Fidelity in XAI post-hoc techniques: A Comparative Study with Ground Truth Explanations Datasets Miquel Miró-Nicolau Antoni Jaume-i-Capó Gabriel Moyà Alcover XAI 42 11 0 03 Nov 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction Yuqing Wang Prashanth Vijayaraghavan Ehsan Degan 11 4 0 25 Oct 2023
On the stability, correctness and plausibility of visual explanation methods based on feature importance Romain Xu-Darme Jenny Benois-Pineau R. Giot Georges Quénot Zakaria Chihani M. Rousset Alexey Zhukov XAI FAtt 22 1 0 25 Oct 2023
Sanity checks for patch visualisation in prototype-based image classification Romain Xu-Darme Georges Quénot Zakaria Chihani M. Rousset 19 6 0 25 Oct 2023
XTSC-Bench: Quantitative Benchmarking for Explainers on Time Series Classification Jacqueline Höllig Steffen Thoma Florian Grimm AI4TS 17 1 0 23 Oct 2023
Cross-Modal Conceptualization in Bottleneck Models Danis Alukaev S. Kiselev Ilya S. Pershin Bulat Ibragimov Vladimir Ivanov Alexey Kornaev Ivan Titov 41 7 0 23 Oct 2023
Evaluating Large Language Models on Controlled Generation Tasks Jiao Sun Yufei Tian Wangchunshu Zhou Nan Xu Qian Hu Rahul Gupta John Wieting Nanyun Peng Xuezhe Ma LRM ELM 40 61 0 23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization Mohammad Reza Ghasemi Madani Pasquale Minervini 35 4 0 22 Oct 2023
Explanation-based Training with Differentiable Insertion/Deletion Metric-aware Regularizers Yuya Yoshikawa Tomoharu Iwata 24 0 0 19 Oct 2023
A Framework for Interpretability in Machine Learning for Medical Imaging Alan Q. Wang Batuhan K. Karaman Heejong Kim Jacob Rosenthal Rachit Saluja Sean I. Young M. Sabuncu AI4CE 17 10 0 02 Oct 2023
Learning to Receive Help: Intervention-Aware Concept Embedding Models M. Zarlenga Katherine M. Collins Krishnamurthy Dvijotham Adrian Weller Z. Shams M. Jamnik 24 23 0 29 Sep 2023
Language Models as a Service: Overview of a New Paradigm and its Challenges Emanuele La Malfa Aleksandar Petrov Simon Frieder Christoph Weinhuber Ryan Burnell Raza Nazar Anthony Cohn Nigel Shadbolt Michael Wooldridge ALM ELM 35 3 0 28 Sep 2023
Towards Faithful Neural Network Intrinsic Interpretation with Shapley Additive Self-Attribution Ying Sun Hengshu Zhu Huixia Xiong TDI FAtt MILM 25 1 0 27 Sep 2023
Provably Robust and Plausible Counterfactual Explanations for Neural Networks via Robust Optimisation Junqi Jiang Jianglin Lan Francesco Leofante Antonio Rago Francesca Toni OOD 35 9 0 22 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer Arkadiy Saakyan Smaranda Muresan 23 3 0 15 Sep 2023
Learning by Self-Explaining Wolfgang Stammer Felix Friedrich David Steinmann Manuel Brack Hikaru Shindo Kristian Kersting 26 7 0 15 Sep 2023
How Faithful are Self-Explainable GNNs? Marc Christiansen Lea Villadsen Zhiqiang Zhong Stefano Teso Davide Mottin 23 3 0 29 Aug 2023
Learning to Intervene on Concept Bottlenecks David Steinmann Wolfgang Stammer Felix Friedrich Kristian Kersting 17 19 0 25 Aug 2023
Fairness Explainability using Optimal Transport with Applications in Image Classification Philipp Ratz Franccois Hu Arthur Charpentier 23 0 0 22 Aug 2023
Interpretable Graph Neural Networks for Tabular Data Amr Alkhatib Sofiane Ennadir Henrik Bostrom Michalis Vazirgiannis LMTD 36 4 0 17 Aug 2023
Explainable AI for clinical risk prediction: a survey of concepts, methods, and modalities Munib Mesinovic Peter Watkinson Ting Zhu FaML 19 3 0 16 Aug 2023
Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations Mikolaj Sacha Bartosz Jura Dawid Rymarczyk Lukasz Struski Jacek Tabor Bartosz Zieliñski 32 14 0 16 Aug 2023
FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods Robin Hesse Simone Schaub-Meyer Stefan Roth AAML 37 32 0 11 Aug 2023
TrajPAC: Towards Robustness Verification of Pedestrian Trajectory Prediction Models Liang Zhang Nathaniel Xu Pengfei Yang Gao Jin Cheng-Chao Huang Lijun Zhang 28 8 0 11 Aug 2023
Precise Benchmarking of Explainable AI Attribution Methods Rafael Brandt Daan Raatjens G. Gaydadjiev XAI 27 4 0 06 Aug 2023
Two Approaches to Supervised Image Segmentation Alexandre Benatti L. D. F. Costa 38 2 0 19 Jul 2023