A Unified Approach to Interpreting Model Predictions

22 May 2017

Papers citing "A Unified Approach to Interpreting Model Predictions"

50 / 1,823 papers shown

Title
Machine Learning For An Explainable Cost Prediction of Medical Insurance U. Orji Elochukwu A. Ukwandu 42 31 0 23 Nov 2023
Towards Auditing Large Language Models: Improving Text-based Stereotype Detection Wu Zekun Sahan Bulathwela Adriano Soares Koshiyama 38 13 0 23 Nov 2023
You Only Explain Once David A. Kelly Hana Chockler Daniel Kroening Nathan Blake Aditi Ramaswamy Melane Navaratnarajah Aaditya Shivakumar 77 2 0 23 Nov 2023
A Cross Attention Approach to Diagnostic Explainability using Clinical Practice Guidelines for Depression Sumit Dalal Deepa Tilwani Kaushik Roy Manas Gaur Sarika Jain V. Shalin Amit P. Sheth 52 6 0 23 Nov 2023
Labeling Neural Representations with Inverse Recognition Kirill Bykov Laura Kopf Shinichi Nakajima Marius Kloft Marina M.-C. Höhne BDL 81 16 0 22 Nov 2023
Explaining high-dimensional text classifiers Odelia Melamed Rich Caruana 37 0 0 22 Nov 2023
Pruning-Based Extraction of Descriptions from Probabilistic Circuits Sieben Bocklandt Vincent Derkinderen Koen Vanderstraeten Wouter Pijpops Kurt Jaspers Wannes Meert 40 0 0 22 Nov 2023
Improving performance of heart rate time series classification by grouping subjects Michael Beekhuizen Arman Naseri David Tax Ivo van der Bilt Marcel J. T. Reinders 17 0 0 22 Nov 2023
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue Aron Molnar Jaap Jumelet Mario Giulianelli Arabella J. Sinclair 54 2 0 21 Nov 2023
Neural Network Pruning by Gradient Descent Zhang Zhang Ruyi Tao Jiang Zhang 42 4 0 21 Nov 2023
InterPrompt: Interpretable Prompting for Interrelated Interpersonal Risk Factors in Reddit Posts Msvpj Sathvik Surjodeep Sarkar Chandni Saxena Sunghwan Sohn Muskan Garg 14 1 0 21 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language Models Theodora Worledge Judy Hanwen Shen Nicole Meister Caleb Winston Carlos Guestrin TDI 59 10 0 20 Nov 2023
Explaining Deep Learning Models for Age-related Gait Classification based on time series acceleration Xiaoping Zheng Bert Otten M. Reneman Claudine JC. Lamoth 32 4 0 20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents Zhuosheng Zhang Yao Yao Aston Zhang Xiangru Tang Xinbei Ma ... Yiming Wang Mark B. Gerstein Rui Wang Gongshen Liu Hai Zhao LLMAG LM&Ro LRM 73 56 0 20 Nov 2023
On the Relationship Between Interpretability and Explainability in Machine Learning Benjamin Leblanc Pascal Germain FaML 67 0 0 20 Nov 2023
Designing Interpretable ML System to Enhance Trust in Healthcare: A Systematic Review to Proposed Responsible Clinician-AI-Collaboration Framework Elham Nasarian R. Alizadehsani U. Acharya Kwok-Leung Tsui 48 47 0 18 Nov 2023
RecExplainer: Aligning Large Language Models for Explaining Recommendation Models Yuxuan Lei Jianxun Lian Jing Yao Xu Huang Defu Lian Xing Xie LRM 58 7 0 18 Nov 2023
A novel post-hoc explanation comparison metric and applications Shreyan Mitra Leilani H. Gilpin FAtt 41 0 0 17 Nov 2023
Using Cooperative Game Theory to Prune Neural Networks M. Diaz-Ortiz Benjamin Kempinski Daphne Cornelisse Yoram Bachrach Tal Kachman 60 2 0 17 Nov 2023
Inherently Interpretable Time Series Classification via Multiple Instance Learning Joseph Early Gavin K. C. Cheung Kurt Cutajar Hanting Xie Jas Kandola Niall Twomey AI4TS 48 11 0 16 Nov 2023
Language Models (Mostly) Do Not Consider Emotion Triggers When Predicting Emotion Smriti Singh Cornelia Caragea Junyi Jessy Li 45 3 0 16 Nov 2023
LymphoML: An interpretable artificial intelligence-based method identifies morphologic features that correlate with lymphoma subtype V. Shankar Xiaoli Yang Vrishab Krishna Brent Tan Oscar Silva ... Edward L Briercheck D. Weinstock Y. Natkunam S. Fernandez-Pol Pranav Rajpurkar 20 5 0 16 Nov 2023
Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects -- A Survey Ashok Urlana Pruthwik Mishra Tathagato Roy Rahul Mishra 54 9 0 15 Nov 2023
Model Agnostic Explainable Selective Regression via Uncertainty Estimation Andrea Pugnana Carlos Mougan Dan Saattrup Nielsen 71 0 0 15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games Kokil Jaidka Hansin Ahuja Lynnette Ng 97 7 0 15 Nov 2023
Explainable History Distillation by Marked Temporal Point Process Sishun Liu Ke Deng Yan Wang Xiuzhen Zhang 40 0 0 13 Nov 2023
Predicting the First Response Latency of Maintainers and Contributors in Pull Requests SayedHassan Khatoonabadi Ahmad Abdellatif D. Costa Emad Shihab VLM 42 3 0 13 Nov 2023
The Disagreement Problem in Faithfulness Metrics Brian Barr Noah Fatsi Leif Hancox-Li Peter Richter Daniel Proano Caleb Mok 52 4 0 13 Nov 2023
On Measuring Faithfulness or Self-consistency of Natural Language Explanations Letitia Parcalabescu Anette Frank LRM 84 24 0 13 Nov 2023
A Voting Approach for Explainable Classification with Rule Learning Albert Nössig Tobias Hell Georg Moser FAtt 14 3 0 13 Nov 2023
Explaining black boxes with a SMILE: Statistical Model-agnostic Interpretability with Local Explanations Koorosh Aslansefat Mojgan Hashemian M. Walker Mohammed Naveed Akram Ioannis Sorokos Y. Papadopoulos FAtt AAML 40 2 0 13 Nov 2023
To Transformers and Beyond: Large Language Models for the Genome Micaela Elisa Consens Cameron Dufault Michael Wainberg Duncan Forster Mehran Karimzadeh Hani Goodarzi Fabian J. Theis Alan Moses Bo Wang LM&MA MedIm 36 30 0 13 Nov 2023
AGRAMPLIFIER: Defending Federated Learning Against Poisoning Attacks Through Local Update Amplification Zirui Gong Liyue Shen Yanjun Zhang Leo Yu Zhang Jingwei Wang Guangdong Bai Yong Xiang AAML 53 7 0 13 Nov 2023
Assessing the Interpretability of Programmatic Policies with Large Language Models Zahra Bashir Michael Bowling Levi H. S. Lelis ELM 94 3 0 12 Nov 2023
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives Rojina Kashefi Leili Barekatain Mohammad Sabokrou Fatemeh Aghaeipoor ViT 56 9 0 12 Nov 2023
A Saliency-based Clustering Framework for Identifying Aberrant Predictions A. Tersol Montserrat Alexander R. Loftus Yael Daihes 65 0 0 11 Nov 2023
Greedy PIG: Adaptive Integrated Gradients Kyriakos Axiotis Sami Abu-El-Haija Lin Chen Matthew Fahrbach Gang Fu FAtt 49 0 0 10 Nov 2023
Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous Ziwei Wang Nabil Aouf Jose Pizarro Christophe Honvault AAML 43 0 0 10 Nov 2023
Pioneering EEG Motor Imagery Classification Through Counterfactual Analysis Kang Yin Hye-Bin Shin Hee-Dong Kim Seong-Whan Lee 13 0 0 10 Nov 2023
Deep Natural Language Feature Learning for Interpretable Prediction Felipe Urrutia Cristian Buc Valentin Barriere 48 2 0 09 Nov 2023
Taxonomy for Resident Space Objects in LEO: A Deep Learning Approach Marta Guimarães Cláudia Soares Chiara Manfletti 6 1 0 09 Nov 2023
Accelerated Shapley Value Approximation for Data Evaluation Lauren Watson Zeno Kujawa R. Andreeva Hao-Tsung Yang Tariq Elahi Rik Sarkar FAtt FedML TDI 44 2 0 09 Nov 2023
ABIGX: A Unified Framework for eXplainable Fault Detection and Classification Yue Zhuo Jinchuan Qian Zhihuan Song Zhiqiang Ge 33 1 0 09 Nov 2023
SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training Rui Xu Wenkang Qin Peixiang Huang Hao Wang Lin Luo FAtt AAML 48 2 0 09 Nov 2023
DEMASQ: Unmasking the ChatGPT Wordsmith Kavita Kumari Alessandro Pegoraro Hossein Fereidooni Ahmad-Reza Sadeghi DeLMO 37 4 0 08 Nov 2023
Interpreting Pretrained Language Models via Concept Bottlenecks Zhen Tan Lu Cheng Song Wang Yuan Bo Wenlin Yao Huan Liu LRM 52 23 0 08 Nov 2023
The PetShop Dataset -- Finding Causes of Performance Issues across Microservices Michaela Hardt William Orchard Patrick Blobaum S. Kasiviswanathan Elke Kirschbaum AI4TS 35 2 0 08 Nov 2023
Explained anomaly detection in text reviews: Can subjective scenarios be correctly evaluated? David Novoa-Paradela O. Fontenla-Romero Bertha Guijarro-Berdiñas 33 0 0 08 Nov 2023
Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum Urban Knuplevs Diego Frassinelli Sabine Schulte im Walde 35 1 0 08 Nov 2023
Explainable AI for Earth Observation: Current Methods, Open Challenges, and Opportunities G. Taşkın E. Aptoula Alp Ertürk 51 2 0 08 Nov 2023