v1v2 (latest)

Axiomatic Attribution for Deep Networks

4 March 2017

Ankur Taly

Papers citing "Axiomatic Attribution for Deep Networks"

50 / 2,871 papers shown

Title
FW-Shapley: Real-time Estimation of Weighted Shapley Values Pranoy Panda Siddharth Tandon V. Balasubramanian TDI 155 1 0 09 Mar 2025
Interpretable High-order Knowledge Graph Neural Network for Predicting Synthetic Lethality in Human Cancers Xuexin Chen Ruichu Cai Zhengting Huang Zijian Li Jie Zheng Min Wu 104 0 0 08 Mar 2025
MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model Miguel Contreras Jessica Sena Andrea Davidson Jiaqing Zhang T. Ozrazgat-Baslanti ... Jeremy A. Balch Tyler J. Loftus Subhash Nerella A. Bihorac Parisa Rashidi 151 0 0 08 Mar 2025
Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations Eren Erogullari Sebastian Lapuschkin Wojciech Samek Frederik Pahde LLMSV CoGe 96 0 0 07 Mar 2025
Towards Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients Niklas Penzel Joachim Denzler FAtt 92 0 0 07 Mar 2025
A Unified Framework with Novel Metrics for Evaluating the Effectiveness of XAI Techniques in LLMs Melkamu Mersha Mesay Gemeda Yigezu Hassan Shakil Ali Al shami SangHyun Byun Jugal Kalita 222 2 0 06 Mar 2025
Enhancing Network Security Management in Water Systems using FM-based Attack Attribution Aleksandar Avdalovic Joseph Khoury Ahmad Taha E. Bou-Harb AAML 77 1 0 03 Mar 2025
Riemannian Integrated Gradients: A Geometric View of Explainable AI Federico Costanza Lachlan Simpson 88 0 0 02 Mar 2025
Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction Wenrui Fan L. M. Riza Rizky Jiayang Zhang Chen Chen Haiping Lu Kevin Teh Dinesh Selvarajah Shuo Zhou 92 0 0 28 Feb 2025
Enhancing Explainability with Multimodal Context Representations for Smarter Robots Anargh Viswanath Lokesh Veeramacheneni Hendrik Buschmeier 66 0 0 28 Feb 2025
Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps Lukasz Sztukiewicz Ignacy Stepka Michał Wiliński Jerzy Stefanowski 146 0 0 28 Feb 2025
FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients Leming Shen Qiang Yang Kaiyan Cui Yuanqing Zheng Xiao-Yong Wei Jianwei Liu Jinsong Han FedML 286 11 0 28 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification Thomas Norrenbrock Timo Kaiser Sovan Biswas R. Manuvinakurike Bodo Rosenhahn 158 0 0 27 Feb 2025
Interpreting CLIP with Hierarchical Sparse Autoencoders Vladimir Zaigrajew Hubert Baniecki P. Biecek 260 1 0 27 Feb 2025
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models Itay Benou Tammy Riklin-Raviv 170 1 0 27 Feb 2025
Models That Are Interpretable But Not Transparent Chudi Zhong Panyu Chen Cynthia Rudin AAML 105 0 0 26 Feb 2025
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models Zihao Li Ruixiang Tang Lu Cheng Shuaiqiang Wang D. Yin Jundong Li 154 0 0 25 Feb 2025
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint Qianli Ma Dongrui Liu Qian Chen Linfeng Zhang Jing Shao MoMe 456 2 0 24 Feb 2025
Class-Dependent Perturbation Effects in Evaluating Time Series Attributions Gregor Baer Isel Grau Chao Zhang Pieter Van Gorp AAML 122 1 0 24 Feb 2025
Interpretable Retinal Disease Prediction Using Biology-Informed Heterogeneous Graph Representations Laurin Lux Alexander H. Berger Maria Romeo Tricas Alaa E. Fayed Siyang Song Linus Kreitner Jonas Weidner Fernando Navarro Daniel Rueckert Johannes C. Paetzold 86 2 0 23 Feb 2025
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions Tue Cao Nhat X. Hoang Hieu H. Pham P. Nguyen My T. Thai 247 1 0 22 Feb 2025
SALTY: Explainable Artificial Intelligence Guided Structural Analysis for Hardware Trojan Detection Tanzim Mahfuz Pravin Gaikwad Tasneem Suha Swarup Bhunia Prabuddha Chakraborty 73 0 0 21 Feb 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models L. Arras Bruno Puri Patrick Kahardipraja Sebastian Lapuschkin Wojciech Samek 98 3 0 21 Feb 2025
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness Weisong Sun Yuchen Chen Mengzhe Yuan Chunrong Fang Zhenpeng Chen Chong Wang Yang Liu Baowen Xu Zhenyu Chen AAML 86 1 0 20 Feb 2025
SPEX: Scaling Feature Interaction Explanations for LLMs Justin Singh Kang Landon Butler Abhineet Agarwal Yigit Efe Erginbas Ramtin Pedarsani Kannan Ramchandran Bin Yu VLM LRM 167 2 0 20 Feb 2025
Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining Jinfan Hu Zhiyuan You Jinjin Gu Kaiwen Zhu Tianfan Xue Chao Dong 121 0 0 18 Feb 2025
From Abstract to Actionable: Pairwise Shapley Values for Explainable AI Jiaxin Xu Hung Chau Angela Burden TDI 119 0 0 18 Feb 2025
Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models Thomas Fel Ekdeep Singh Lubana Jacob S. Prince M. Kowal Victor Boutin Isabel Papadimitriou Binxu Wang Martin Wattenberg Demba Ba Talia Konkle 76 8 0 18 Feb 2025
Error-controlled non-additive interaction discovery in machine learning models Winston Chen Yifan Jiang William Stafford Noble Yang Young Lu 136 1 0 17 Feb 2025
Mechanistic Unveiling of Transformer Circuits: Self-Influence as a Key to Model Reasoning Lefei Zhang Lijie Hu Di Wang LRM 205 5 0 17 Feb 2025
Suboptimal Shapley Value Explanations Xiaolei Lu FAtt 97 0 0 17 Feb 2025
Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks Jon Vadillo Roberto Santana J. A. Lozano Marta Z. Kwiatkowska BDL AAML 149 0 0 17 Feb 2025
Using the Path of Least Resistance to Explain Deep Networks Sina Salek Joseph Enguehard FAtt 73 0 0 17 Feb 2025
Time-series attribution maps with regularized contrastive learning Steffen Schneider Rodrigo González Laiz Anastasiia Filippova Markus Frey Mackenzie W. Mathis BDL FAtt CML AI4TS 114 1 0 17 Feb 2025
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability Zhiyu Zhu Zhibo Jin Jiayu Zhang Nan Yang Jiahao Huang Jianlong Zhou Fang Chen 102 0 0 16 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow Behrooz Azarkhalili Maxwell Libbrecht 81 0 0 14 Feb 2025
Recent Advances in Malware Detection: Graph Learning and Explainability Hossein Shokouhinejad Roozbeh Razavi-Far Hesamodin Mohammadian Mahdi Rabbani Samuel Ansong Griffin Higgins Ali Ghorbani AAML 143 2 0 14 Feb 2025
Towards Transparent and Accurate Plasma State Monitoring at JET Andrin Bürli Alessandro Pau Thomas Koller Olivier Sauter JET Contributors 92 2 0 14 Feb 2025
Applying Deep Learning to Ads Conversion Prediction in Last Mile Delivery Marketplace Di Li Xiaochang Miao Huiyu Song Chao Chu Hao Xu Mandar Rahurkar 54 0 0 14 Feb 2025
Brain-Inspired Exploration of Functional Networks and Key Neurons in Large Language Models Yiheng Liu Xiaohui Gao Haiyang Sun Bao Ge Tianming Liu Junwei Han X. Hu 88 2 0 13 Feb 2025
Survey on Recent Progress of AI for Chemistry: Methods, Applications, and Opportunities Ding Hu Pengxiang Hua Zhen Huang 261 0 0 09 Feb 2025
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment Harrish Thasarathan Julian Forsyth Thomas Fel M. Kowal Konstantinos G. Derpanis 146 10 0 06 Feb 2025
Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis Haowen Bai Zixiang Zhao Jiangshe Zhang Baisong Jiang Lilun Deng Yukun Cui Shuang Xu Chunxia Zhang 146 4 0 03 Feb 2025
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation Xingchen Wan Han Zhou Ruoxi Sun Hootan Nakhost Ke Jiang Sercan Ö. Arık ReLM OffRL LRM 83 4 0 01 Feb 2025
Sparse Autoencoder Insights on Voice Embeddings Daniel Pluth Yu Zhou Vijay K. Gurbani 69 0 0 31 Jan 2025
CueTip: An Interactive and Explainable Physics-aware Pool Assistant Sean Memery Kevin Denamganai Jiaxin Zhang Zehai Tu Yiwen Guo Kartic Subr LRM 101 0 0 30 Jan 2025
Fake News Detection After LLM Laundering: Measurement and Explanation Rupak Kumar Das Jonathan Dodge 196 1 0 29 Jan 2025
Extending Information Bottleneck Attribution to Video Sequences Veronika Solopova Lucas Schmidt Dorothea Kolossa 80 0 0 28 Jan 2025
AI-Driven Predictive Analytics Approach for Early Prognosis of Chronic Kidney Disease Using Ensemble Learning and Explainable AI K. M. T. Jawad Anusha Verma Fathi H. Amsaad Lamia Ashraf 75 0 0 28 Jan 2025
B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable Shreyash Arya Sukrut Rao Moritz Bohle Bernt Schiele 187 3 0 28 Jan 2025