v1v2 (latest)

Axiomatic Attribution for Deep Networks

4 March 2017

Ankur Taly

Papers citing "Axiomatic Attribution for Deep Networks"

50 / 2,871 papers shown

Title
Protecting Publicly Available Data With Machine Learning Shortcuts Nicolas Müller Maximilian Burgert Pascal Debus Jennifer Williams Philip Sperl Konstantin Böttinger 65 0 0 30 Oct 2023
TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery Jialin Chen Rex Ying AI4TS 59 24 0 30 Oct 2023
D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion Jialin Chen Shirley Wu Abhijit Gupta Rex Ying DiffM 57 5 0 30 Oct 2023
This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations Chiyu Ma Brandon Zhao Chaofan Chen Cynthia Rudin 82 29 0 28 Oct 2023
Visual Explanations via Iterated Integrated Attributions Oren Barkan Yehonatan Elisha Yuval Asher Amit Eshel Noam Koenigstein FAtt XAI 47 18 0 28 Oct 2023
Sample based Explanations via Generalized Representers Che-Ping Tsai Chih-Kuan Yeh Pradeep Ravikumar FAtt 95 9 0 27 Oct 2023
Understanding Parameter Saliency via Extreme Value Theory Shuo Wang Issei Sato AAML FAtt 36 0 0 27 Oct 2023
A Comprehensive and Reliable Feature Attribution Method: Double-sided Remove and Reconstruct (DoRaR) Dong Qin G. Amariucai Daji Qiao Yong Guan Shen Fu 134 5 0 27 Oct 2023
A Survey on Transferability of Adversarial Examples across Deep Neural Networks Jindong Gu Xiaojun Jia Pau de Jorge Wenqain Yu Xinwei Liu ... Anjun Hu Ashkan Khakzar Zhijiang Li Xiaochun Cao Philip Torr AAML 120 31 0 26 Oct 2023
SoK: Pitfalls in Evaluating Black-Box Attacks Fnu Suya Anshuman Suri Tingwei Zhang Jingtao Hong Yuan Tian David Evans AAML 104 6 0 26 Oct 2023
This Reads Like That: Deep Learning for Interpretable Natural Language Processing Claudio Fanconi Moritz Vandenhirtz Severin Husmann Julia E. Vogt FAtt 70 2 0 25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction Yuqing Wang Prashanth Vijayaraghavan Ehsan Degan 63 4 0 25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models Oren Barkan Yuval Asher Amit Eshel Yehonatan Elisha Noam Koenigstein 75 5 0 25 Oct 2023
On the stability, correctness and plausibility of visual explanation methods based on feature importance Romain Xu-Darme Jenny Benois-Pineau R. Giot Georges Quénot Zakaria Chihani M. Rousset Alexey Zhukov XAI FAtt 78 1 0 25 Oct 2023
Sanity checks for patch visualisation in prototype-based image classification Romain Xu-Darme Georges Quénot Zakaria Chihani M. Rousset 58 6 0 25 Oct 2023
Corrupting Neuron Explanations of Deep Visual Features Divyansh Srivastava Tuomas P. Oikarinen Tsui-Wei Weng FAtt AAML 44 2 0 25 Oct 2023
Instance-wise Linearization of Neural Network for Model Interpretation Zhimin Li Shusen Liu B. Kailkhura Timo Bremer Valerio Pascucci MILM FAtt 64 0 0 25 Oct 2023
Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups Weiqiu You Helen Qu Marco Gatti Bhuvnesh Jain Eric Wong FAtt FaML 97 3 0 25 Oct 2023
Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words Hiroto Kurita Goro Kobayashi Sho Yokoi Kentaro Inui 64 4 0 24 Oct 2023
Climate Change Impact on Agricultural Land Suitability: An Interpretable Machine Learning-Based Eurasia Case Study Valeriy Shevchenko Daria Taniushkina Aleksander Lukashevich Aleksandr Bulkin Roland Grinis Kirill Kovalev Veronika Narozhnaia Nazar Sotiriadi Alexander Krenke Yury Maximov AI4CE 43 7 0 24 Oct 2023
Deep Integrated Explanations Oren Barkan Yehonatan Elisha Jonathan Weill Yuval Asher Amit Eshel Noam Koenigstein FAtt 107 7 0 23 Oct 2023
XTSC-Bench: Quantitative Benchmarking for Explainers on Time Series Classification Jacqueline Höllig Steffen Thoma Florian Grimm AI4TS 62 1 0 23 Oct 2023
Cross-Modal Conceptualization in Bottleneck Models Danis Alukaev S. Kiselev Ilya Pershin Bulat Ibragimov Vladimir Ivanov Alexey Kornaev Ivan Titov 78 7 0 23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization Mohammad Reza Ghasemi Madani Pasquale Minervini 91 4 0 22 Oct 2023
Preference Elicitation with Soft Attributes in Interactive Recommendation Erdem Biyik Fan Yao Yinlam Chow Alex Haig Chih-Wei Hsu Mohammad Ghavamzadeh Craig Boutilier 135 4 0 22 Oct 2023
Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making Yanrui Du Sendong Zhao Hao Wang Yuhan Chen Rui Bai Zewen Qiang Muzhen Cai Bing Qin 64 0 0 20 Oct 2023
Does Your Model Think Like an Engineer? Explainable AI for Bearing Fault Detection with Deep Learning Thomas Decker Michael Lebacher Volker Tresp 31 13 0 19 Oct 2023
Transformer-based Entity Legal Form Classification Alexander Arimond Mauro Molteni Dominik Jany Zornitsa Manolova Damian Borth Andreas G. F. Hoepner MedIm AILaw 54 1 0 19 Oct 2023
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation Chongyu Fan Jiancheng Liu Yihua Zhang Eric Wong Dennis Wei Sijia Liu MU 143 150 0 19 Oct 2023
MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable Speed Limits Yuhang Zhang Marcos Quiñones-Grueiro Zhiyao Zhang Yanbing Wang William Barbour Gautam Biswas Dan Work 48 5 0 18 Oct 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation Giuseppe Attanasio Flor Miriam Plaza del Arco Debora Nozza Anne Lauscher 72 19 0 18 Oct 2023
From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks Jae Hee Lee Sergio Lanza Stefan Wermter 73 10 0 18 Oct 2023
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification Shanshan Xu Santosh T.Y.S.S O. Ichim Isabella Risini Barbara Plank Matthias Grabmair AILaw 116 12 0 18 Oct 2023
VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights Shanshan Xu Leon Staufer Santosh T.Y.S.S O. Ichim Corina Heri Matthias Grabmair 50 0 0 17 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations Shiyuan Huang Siddarth Mamidanna Shreedhar Jangam Yilun Zhou Leilani H. Gilpin LRM MILM ELM 116 77 0 17 Oct 2023
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation S. Nigam Aniket Deroy Noel Shallum Ayush Kumar Mishra Anup Roy Shubham Kumar Mishra Arnab Bhattacharya Saptarshi Ghosh Kripabandhu Ghosh AILaw ELM 80 11 0 17 Oct 2023
Learning optimal integration of spatial and temporal information in noisy chemotaxis Albert Alonso J. B. Kirkegaard 54 4 0 16 Oct 2023
DANAA: Towards transferable attacks with double adversarial neuron attribution Zhibo Jin Zhiyu Zhu Xinyi Wang Jiayu Zhang Jun Shen Huaming Chen AAML 66 10 0 16 Oct 2023
Transparent Anomaly Detection via Concept-based Explanations Laya Rafiee Sevyeri Ivaxi Sheth Farhood Farahnak Samira Ebrahimi Kahou S. Enger 60 4 0 16 Oct 2023
LICO: Explainable Models with Language-Image Consistency Yiming Lei Zilong Li Yangyang Li Junping Zhang Hongming Shan VLM FAtt 53 7 0 15 Oct 2023
Assessing the Reliability of Large Language Model Knowledge Weixuan Wang Barry Haddow Alexandra Birch Wei Peng KELM HILM 106 15 0 15 Oct 2023
Notes on Applicability of Explainable AI Methods to Machine Learning Models Using Features Extracted by Persistent Homology Naofumi Hama 89 0 0 15 Oct 2023
Interpretable Diffusion via Information Decomposition Xianghao Kong Ollie Liu Han Li Dani Yogatama Greg Ver Steeg 107 22 0 12 Oct 2023
Faithfulness Measurable Masked Language Models Andreas Madsen Siva Reddy Sarath Chandar 85 3 0 11 Oct 2023
Human-Centered Evaluation of XAI Methods Karam Dawoud Wojciech Samek Peter Eisert Sebastian Lapuschkin Sebastian Bosse 66 4 0 11 Oct 2023
NeuroInspect: Interpretable Neuron-based Debugging Framework through Class-conditional Visualizations Yeong-Joon Ju Ji-Hoon Park Seong-Whan Lee AAML 46 0 0 11 Oct 2023
Comparing Styles across Languages: A Cross-Cultural Exploration of Politeness Shreya Havaldar Matthew Pressimone Eric Wong Lyle Ungar 125 2 0 11 Oct 2023
Evaluating Explanation Methods for Vision-and-Language Navigation Guanqi Chen Lei Yang Guanhua Chen Jia Pan XAI 65 1 0 10 Oct 2023
AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments Yang Zhang Yawei Li Hannah Brown Mina Rezaei Bernd Bischl Philip Torr Ashkan Khakzar Kenji Kawaguchi OOD 81 2 0 10 Oct 2023
Interpreting CLIP's Image Representation via Text-Based Decomposition Yossi Gandelsman Alexei A. Efros Jacob Steinhardt VLM 98 101 0 09 Oct 2023