Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.01365
Cited By
v1
v2 (latest)
Axiomatic Attribution for Deep Networks
4 March 2017
Mukund Sundararajan
Ankur Taly
Qiqi Yan
OOD
FAtt
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Axiomatic Attribution for Deep Networks"
50 / 2,871 papers shown
Title
FW-Shapley: Real-time Estimation of Weighted Shapley Values
Pranoy Panda
Siddharth Tandon
V. Balasubramanian
TDI
155
1
0
09 Mar 2025
Interpretable High-order Knowledge Graph Neural Network for Predicting Synthetic Lethality in Human Cancers
Xuexin Chen
Ruichu Cai
Zhengting Huang
Zijian Li
Jie Zheng
Min Wu
104
0
0
08 Mar 2025
MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model
Miguel Contreras
Jessica Sena
Andrea Davidson
Jiaqing Zhang
T. Ozrazgat-Baslanti
...
Jeremy A. Balch
Tyler J. Loftus
Subhash Nerella
A. Bihorac
Parisa Rashidi
151
0
0
08 Mar 2025
Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations
Eren Erogullari
Sebastian Lapuschkin
Wojciech Samek
Frederik Pahde
LLMSV
CoGe
96
0
0
07 Mar 2025
Towards Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
Niklas Penzel
Joachim Denzler
FAtt
92
0
0
07 Mar 2025
A Unified Framework with Novel Metrics for Evaluating the Effectiveness of XAI Techniques in LLMs
Melkamu Mersha
Mesay Gemeda Yigezu
Hassan Shakil
Ali Al shami
SangHyun Byun
Jugal Kalita
222
2
0
06 Mar 2025
Enhancing Network Security Management in Water Systems using FM-based Attack Attribution
Aleksandar Avdalovic
Joseph Khoury
Ahmad Taha
E. Bou-Harb
AAML
77
1
0
03 Mar 2025
Riemannian Integrated Gradients: A Geometric View of Explainable AI
Federico Costanza
Lachlan Simpson
88
0
0
02 Mar 2025
Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction
Wenrui Fan
L. M. Riza Rizky
Jiayang Zhang
Chen Chen
Haiping Lu
Kevin Teh
Dinesh Selvarajah
Shuo Zhou
92
0
0
28 Feb 2025
Enhancing Explainability with Multimodal Context Representations for Smarter Robots
Anargh Viswanath
Lokesh Veeramacheneni
Hendrik Buschmeier
66
0
0
28 Feb 2025
Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps
Lukasz Sztukiewicz
Ignacy Stepka
Michał Wiliński
Jerzy Stefanowski
146
0
0
28 Feb 2025
FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients
Leming Shen
Qiang Yang
Kaiyan Cui
Yuanqing Zheng
Xiao-Yong Wei
Jianwei Liu
Jinsong Han
FedML
286
11
0
28 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock
Timo Kaiser
Sovan Biswas
R. Manuvinakurike
Bodo Rosenhahn
158
0
0
27 Feb 2025
Interpreting CLIP with Hierarchical Sparse Autoencoders
Vladimir Zaigrajew
Hubert Baniecki
P. Biecek
260
1
0
27 Feb 2025
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
Itay Benou
Tammy Riklin-Raviv
170
1
0
27 Feb 2025
Models That Are Interpretable But Not Transparent
Chudi Zhong
Panyu Chen
Cynthia Rudin
AAML
105
0
0
26 Feb 2025
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Zihao Li
Ruixiang Tang
Lu Cheng
Shuaiqiang Wang
D. Yin
Jundong Li
154
0
0
25 Feb 2025
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
Qianli Ma
Dongrui Liu
Qian Chen
Linfeng Zhang
Jing Shao
MoMe
456
2
0
24 Feb 2025
Class-Dependent Perturbation Effects in Evaluating Time Series Attributions
Gregor Baer
Isel Grau
Chao Zhang
Pieter Van Gorp
AAML
122
1
0
24 Feb 2025
Interpretable Retinal Disease Prediction Using Biology-Informed Heterogeneous Graph Representations
Laurin Lux
Alexander H. Berger
Maria Romeo Tricas
Alaa E. Fayed
Siyang Song
Linus Kreitner
Jonas Weidner
Fernando Navarro
Daniel Rueckert
Johannes C. Paetzold
86
2
0
23 Feb 2025
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
Tue Cao
Nhat X. Hoang
Hieu H. Pham
P. Nguyen
My T. Thai
247
1
0
22 Feb 2025
SALTY: Explainable Artificial Intelligence Guided Structural Analysis for Hardware Trojan Detection
Tanzim Mahfuz
Pravin Gaikwad
Tasneem Suha
Swarup Bhunia
Prabuddha Chakraborty
73
0
0
21 Feb 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
98
3
0
21 Feb 2025
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
Weisong Sun
Yuchen Chen
Mengzhe Yuan
Chunrong Fang
Zhenpeng Chen
Chong Wang
Yang Liu
Baowen Xu
Zhenyu Chen
AAML
86
1
0
20 Feb 2025
SPEX: Scaling Feature Interaction Explanations for LLMs
Justin Singh Kang
Landon Butler
Abhineet Agarwal
Yigit Efe Erginbas
Ramtin Pedarsani
Kannan Ramchandran
Bin Yu
VLM
LRM
167
2
0
20 Feb 2025
Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining
Jinfan Hu
Zhiyuan You
Jinjin Gu
Kaiwen Zhu
Tianfan Xue
Chao Dong
121
0
0
18 Feb 2025
From Abstract to Actionable: Pairwise Shapley Values for Explainable AI
Jiaxin Xu
Hung Chau
Angela Burden
TDI
119
0
0
18 Feb 2025
Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models
Thomas Fel
Ekdeep Singh Lubana
Jacob S. Prince
M. Kowal
Victor Boutin
Isabel Papadimitriou
Binxu Wang
Martin Wattenberg
Demba Ba
Talia Konkle
76
8
0
18 Feb 2025
Error-controlled non-additive interaction discovery in machine learning models
Winston Chen
Yifan Jiang
William Stafford Noble
Yang Young Lu
136
1
0
17 Feb 2025
Mechanistic Unveiling of Transformer Circuits: Self-Influence as a Key to Model Reasoning
Lefei Zhang
Lijie Hu
Di Wang
LRM
205
5
0
17 Feb 2025
Suboptimal Shapley Value Explanations
Xiaolei Lu
FAtt
97
0
0
17 Feb 2025
Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks
Jon Vadillo
Roberto Santana
J. A. Lozano
Marta Z. Kwiatkowska
BDL
AAML
149
0
0
17 Feb 2025
Using the Path of Least Resistance to Explain Deep Networks
Sina Salek
Joseph Enguehard
FAtt
73
0
0
17 Feb 2025
Time-series attribution maps with regularized contrastive learning
Steffen Schneider
Rodrigo González Laiz
Anastasiia Filippova
Markus Frey
Mackenzie W. Mathis
BDL
FAtt
CML
AI4TS
114
1
0
17 Feb 2025
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability
Zhiyu Zhu
Zhibo Jin
Jiayu Zhang
Nan Yang
Jiahao Huang
Jianlong Zhou
Fang Chen
102
0
0
16 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Behrooz Azarkhalili
Maxwell Libbrecht
81
0
0
14 Feb 2025
Recent Advances in Malware Detection: Graph Learning and Explainability
Hossein Shokouhinejad
Roozbeh Razavi-Far
Hesamodin Mohammadian
Mahdi Rabbani
Samuel Ansong
Griffin Higgins
Ali Ghorbani
AAML
143
2
0
14 Feb 2025
Towards Transparent and Accurate Plasma State Monitoring at JET
Andrin Bürli
Alessandro Pau
Thomas Koller
Olivier Sauter
JET Contributors
92
2
0
14 Feb 2025
Applying Deep Learning to Ads Conversion Prediction in Last Mile Delivery Marketplace
Di Li
Xiaochang Miao
Huiyu Song
Chao Chu
Hao Xu
Mandar Rahurkar
54
0
0
14 Feb 2025
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models
Yiheng Liu
Xiaohui Gao
Haiyang Sun
Bao Ge
Tianming Liu
Junwei Han
X. Hu
88
2
0
13 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities
Ding Hu
Pengxiang Hua
Zhen Huang
261
0
0
09 Feb 2025
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment
Harrish Thasarathan
Julian Forsyth
Thomas Fel
M. Kowal
Konstantinos G. Derpanis
146
10
0
06 Feb 2025
Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis
Haowen Bai
Zixiang Zhao
Jiangshe Zhang
Baisong Jiang
Lilun Deng
Yukun Cui
Shuang Xu
Chunxia Zhang
146
4
0
03 Feb 2025
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation
Xingchen Wan
Han Zhou
Ruoxi Sun
Hootan Nakhost
Ke Jiang
Sercan Ö. Arık
ReLM
OffRL
LRM
83
4
0
01 Feb 2025
Sparse Autoencoder Insights on Voice Embeddings
Daniel Pluth
Yu Zhou
Vijay K. Gurbani
69
0
0
31 Jan 2025
CueTip: An Interactive and Explainable Physics-aware Pool Assistant
Sean Memery
Kevin Denamganai
Jiaxin Zhang
Zehai Tu
Yiwen Guo
Kartic Subr
LRM
101
0
0
30 Jan 2025
Fake News Detection After LLM Laundering: Measurement and Explanation
Rupak Kumar Das
Jonathan Dodge
196
1
0
29 Jan 2025
Extending Information Bottleneck Attribution to Video Sequences
Veronika Solopova
Lucas Schmidt
Dorothea Kolossa
80
0
0
28 Jan 2025
AI-Driven Predictive Analytics Approach for Early Prognosis of Chronic Kidney Disease Using Ensemble Learning and Explainable AI
K. M. T. Jawad
Anusha Verma
Fathi H. Amsaad
Lamia Ashraf
75
0
0
28 Jan 2025
B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Shreyash Arya
Sukrut Rao
Moritz Bohle
Bernt Schiele
187
3
0
28 Jan 2025
Previous
1
2
3
4
5
...
56
57
58
Next