Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07874
Cited By
A Unified Approach to Interpreting Model Predictions
22 May 2017
Scott M. Lundberg
Su-In Lee
FAtt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Unified Approach to Interpreting Model Predictions"
50 / 1,823 papers shown
Title
Machine Learning For An Explainable Cost Prediction of Medical Insurance
U. Orji
Elochukwu A. Ukwandu
42
31
0
23 Nov 2023
Towards Auditing Large Language Models: Improving Text-based Stereotype Detection
Wu Zekun
Sahan Bulathwela
Adriano Soares Koshiyama
38
13
0
23 Nov 2023
You Only Explain Once
David A. Kelly
Hana Chockler
Daniel Kroening
Nathan Blake
Aditi Ramaswamy
Melane Navaratnarajah
Aaditya Shivakumar
77
2
0
23 Nov 2023
A Cross Attention Approach to Diagnostic Explainability using Clinical Practice Guidelines for Depression
Sumit Dalal
Deepa Tilwani
Kaushik Roy
Manas Gaur
Sarika Jain
V. Shalin
Amit P. Sheth
52
6
0
23 Nov 2023
Labeling Neural Representations with Inverse Recognition
Kirill Bykov
Laura Kopf
Shinichi Nakajima
Marius Kloft
Marina M.-C. Höhne
BDL
81
16
0
22 Nov 2023
Explaining high-dimensional text classifiers
Odelia Melamed
Rich Caruana
37
0
0
22 Nov 2023
Pruning-Based Extraction of Descriptions from Probabilistic Circuits
Sieben Bocklandt
Vincent Derkinderen
Koen Vanderstraeten
Wouter Pijpops
Kurt Jaspers
Wannes Meert
40
0
0
22 Nov 2023
Improving performance of heart rate time series classification by grouping subjects
Michael Beekhuizen
Arman Naseri
David Tax
Ivo van der Bilt
Marcel J. T. Reinders
17
0
0
22 Nov 2023
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue
Aron Molnar
Jaap Jumelet
Mario Giulianelli
Arabella J. Sinclair
54
2
0
21 Nov 2023
Neural Network Pruning by Gradient Descent
Zhang Zhang
Ruyi Tao
Jiang Zhang
42
4
0
21 Nov 2023
InterPrompt: Interpretable Prompting for Interrelated Interpersonal Risk Factors in Reddit Posts
Msvpj Sathvik
Surjodeep Sarkar
Chandni Saxena
Sunghwan Sohn
Muskan Garg
14
1
0
21 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language Models
Theodora Worledge
Judy Hanwen Shen
Nicole Meister
Caleb Winston
Carlos Guestrin
TDI
59
10
0
20 Nov 2023
Explaining Deep Learning Models for Age-related Gait Classification based on time series acceleration
Xiaoping Zheng
Bert Otten
M. Reneman
Claudine JC. Lamoth
32
4
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
73
56
0
20 Nov 2023
On the Relationship Between Interpretability and Explainability in Machine Learning
Benjamin Leblanc
Pascal Germain
FaML
67
0
0
20 Nov 2023
Designing Interpretable ML System to Enhance Trust in Healthcare: A Systematic Review to Proposed Responsible Clinician-AI-Collaboration Framework
Elham Nasarian
R. Alizadehsani
U. Acharya
Kwok-Leung Tsui
48
47
0
18 Nov 2023
RecExplainer: Aligning Large Language Models for Explaining Recommendation Models
Yuxuan Lei
Jianxun Lian
Jing Yao
Xu Huang
Defu Lian
Xing Xie
LRM
58
7
0
18 Nov 2023
A novel post-hoc explanation comparison metric and applications
Shreyan Mitra
Leilani H. Gilpin
FAtt
41
0
0
17 Nov 2023
Using Cooperative Game Theory to Prune Neural Networks
M. Diaz-Ortiz
Benjamin Kempinski
Daphne Cornelisse
Yoram Bachrach
Tal Kachman
60
2
0
17 Nov 2023
Inherently Interpretable Time Series Classification via Multiple Instance Learning
Joseph Early
Gavin K. C. Cheung
Kurt Cutajar
Hanting Xie
Jas Kandola
Niall Twomey
AI4TS
48
11
0
16 Nov 2023
Language Models (Mostly) Do Not Consider Emotion Triggers When Predicting Emotion
Smriti Singh
Cornelia Caragea
Junyi Jessy Li
45
3
0
16 Nov 2023
LymphoML: An interpretable artificial intelligence-based method identifies morphologic features that correlate with lymphoma subtype
V. Shankar
Xiaoli Yang
Vrishab Krishna
Brent Tan
Oscar Silva
...
Edward L Briercheck
D. Weinstock
Y. Natkunam
S. Fernandez-Pol
Pranav Rajpurkar
20
5
0
16 Nov 2023
Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects -- A Survey
Ashok Urlana
Pruthwik Mishra
Tathagato Roy
Rahul Mishra
54
9
0
15 Nov 2023
Model Agnostic Explainable Selective Regression via Uncertainty Estimation
Andrea Pugnana
Carlos Mougan
Dan Saattrup Nielsen
71
0
0
15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games
Kokil Jaidka
Hansin Ahuja
Lynnette Ng
97
7
0
15 Nov 2023
Explainable History Distillation by Marked Temporal Point Process
Sishun Liu
Ke Deng
Yan Wang
Xiuzhen Zhang
40
0
0
13 Nov 2023
Predicting the First Response Latency of Maintainers and Contributors in Pull Requests
SayedHassan Khatoonabadi
Ahmad Abdellatif
D. Costa
Emad Shihab
VLM
42
3
0
13 Nov 2023
The Disagreement Problem in Faithfulness Metrics
Brian Barr
Noah Fatsi
Leif Hancox-Li
Peter Richter
Daniel Proano
Caleb Mok
52
4
0
13 Nov 2023
On Measuring Faithfulness or Self-consistency of Natural Language Explanations
Letitia Parcalabescu
Anette Frank
LRM
84
24
0
13 Nov 2023
A Voting Approach for Explainable Classification with Rule Learning
Albert Nössig
Tobias Hell
Georg Moser
FAtt
14
3
0
13 Nov 2023
Explaining black boxes with a SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Koorosh Aslansefat
Mojgan Hashemian
M. Walker
Mohammed Naveed Akram
Ioannis Sorokos
Y. Papadopoulos
FAtt
AAML
40
2
0
13 Nov 2023
To Transformers and Beyond: Large Language Models for the Genome
Micaela Elisa Consens
Cameron Dufault
Michael Wainberg
Duncan Forster
Mehran Karimzadeh
Hani Goodarzi
Fabian J. Theis
Alan Moses
Bo Wang
LM&MA
MedIm
36
30
0
13 Nov 2023
AGRAMPLIFIER: Defending Federated Learning Against Poisoning Attacks Through Local Update Amplification
Zirui Gong
Liyue Shen
Yanjun Zhang
Leo Yu Zhang
Jingwei Wang
Guangdong Bai
Yong Xiang
AAML
53
7
0
13 Nov 2023
Assessing the Interpretability of Programmatic Policies with Large Language Models
Zahra Bashir
Michael Bowling
Levi H. S. Lelis
ELM
94
3
0
12 Nov 2023
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives
Rojina Kashefi
Leili Barekatain
Mohammad Sabokrou
Fatemeh Aghaeipoor
ViT
56
9
0
12 Nov 2023
A Saliency-based Clustering Framework for Identifying Aberrant Predictions
A. Tersol Montserrat
Alexander R. Loftus
Yael Daihes
65
0
0
11 Nov 2023
Greedy PIG: Adaptive Integrated Gradients
Kyriakos Axiotis
Sami Abu-El-Haija
Lin Chen
Matthew Fahrbach
Gang Fu
FAtt
49
0
0
10 Nov 2023
Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous
Ziwei Wang
Nabil Aouf
Jose Pizarro
Christophe Honvault
AAML
43
0
0
10 Nov 2023
Pioneering EEG Motor Imagery Classification Through Counterfactual Analysis
Kang Yin
Hye-Bin Shin
Hee-Dong Kim
Seong-Whan Lee
13
0
0
10 Nov 2023
Deep Natural Language Feature Learning for Interpretable Prediction
Felipe Urrutia
Cristian Buc
Valentin Barriere
48
2
0
09 Nov 2023
Taxonomy for Resident Space Objects in LEO: A Deep Learning Approach
Marta Guimarães
Cláudia Soares
Chiara Manfletti
6
1
0
09 Nov 2023
Accelerated Shapley Value Approximation for Data Evaluation
Lauren Watson
Zeno Kujawa
R. Andreeva
Hao-Tsung Yang
Tariq Elahi
Rik Sarkar
FAtt
FedML
TDI
44
2
0
09 Nov 2023
ABIGX: A Unified Framework for eXplainable Fault Detection and Classification
Yue Zhuo
Jinchuan Qian
Zhihuan Song
Zhiqiang Ge
33
1
0
09 Nov 2023
SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Rui Xu
Wenkang Qin
Peixiang Huang
Hao Wang
Lin Luo
FAtt
AAML
48
2
0
09 Nov 2023
DEMASQ: Unmasking the ChatGPT Wordsmith
Kavita Kumari
Alessandro Pegoraro
Hossein Fereidooni
Ahmad-Reza Sadeghi
DeLMO
37
4
0
08 Nov 2023
Interpreting Pretrained Language Models via Concept Bottlenecks
Zhen Tan
Lu Cheng
Song Wang
Yuan Bo
Wenlin Yao
Huan Liu
LRM
52
23
0
08 Nov 2023
The PetShop Dataset -- Finding Causes of Performance Issues across Microservices
Michaela Hardt
William Orchard
Patrick Blobaum
S. Kasiviswanathan
Elke Kirschbaum
AI4TS
35
2
0
08 Nov 2023
Explained anomaly detection in text reviews: Can subjective scenarios be correctly evaluated?
David Novoa-Paradela
O. Fontenla-Romero
Bertha Guijarro-Berdiñas
33
0
0
08 Nov 2023
Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum
Urban Knuplevs
Diego Frassinelli
Sabine Schulte im Walde
35
1
0
08 Nov 2023
Explainable AI for Earth Observation: Current Methods, Open Challenges, and Opportunities
G. Taşkın
E. Aptoula
Alp Ertürk
51
2
0
08 Nov 2023
Previous
1
2
3
...
31
32
33
...
35
36
37
Next