Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.05796
Cited By
Network Dissection: Quantifying Interpretability of Deep Visual Representations
19 April 2017
David Bau
Bolei Zhou
A. Khosla
A. Oliva
Antonio Torralba
MILM
FAtt
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Network Dissection: Quantifying Interpretability of Deep Visual Representations"
50 / 787 papers shown
Title
CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models
Teodor Chiaburu
Frank Haußer
Felix Bießmann
75
4
0
23 Apr 2024
Automatic Discovery of Visual Circuits
Achyuta Rajaram
Neil Chowdhury
Antonio Torralba
Jacob Andreas
Sarah Schwettmann
GNN
70
4
0
22 Apr 2024
A Multimodal Automated Interpretability Agent
Tamar Rott Shaham
Sarah Schwettmann
Franklin Wang
Achyuta Rajaram
Evan Hernandez
Jacob Andreas
Antonio Torralba
221
28
0
22 Apr 2024
Uncovering Safety Risks of Large Language Models through Concept Activation Vector
Zhihao Xu
Ruixuan Huang
Changyu Chen
Shuai Wang
Xiting Wang
LLMSV
101
27
0
18 Apr 2024
Toward Understanding the Disagreement Problem in Neural Network Feature Attribution
Niklas Koenen
Marvin N. Wright
FAtt
81
5
0
17 Apr 2024
Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes
Ivica Obadic
Alex Levering
Lars Pennig
Dario Augusto Borges Oliveira
Diego Marcos
Xiaoxiang Zhu
72
0
0
15 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Binghua Li
Jie Mao
Zhe Sun
Chao Li
Qibin Zhao
Toshihisa Tanaka
38
0
0
13 Apr 2024
Knowledge graphs for empirical concept retrieval
Lenka Tětková
Teresa Scheidt
Maria Mandrup Fogh
Ellen Marie Gaunby Jorgensen
F. Nielsen
Lars Kai Hansen
64
2
0
10 Apr 2024
Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models
M. Kowal
Richard P. Wildes
Konstantinos G. Derpanis
GNN
106
8
0
02 Apr 2024
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Chenshuang Zhang
Fei Pan
Junmo Kim
In So Kweon
Chengzhi Mao
85
11
1
27 Mar 2024
Multi-scale Unified Network for Image Classification
Wenzhuo Liu
Fei Zhu
Cheng-Lin Liu
CVBM
96
0
0
27 Mar 2024
The Anatomy of Adversarial Attacks: Concept-based XAI Dissection
Georgii Mikriukov
Gesina Schwalbe
Franz Motzkus
Korinna Bade
AAML
77
1
0
25 Mar 2024
Ensemble Adversarial Defense via Integration of Multiple Dispersed Low Curvature Models
Kaikang Zhao
Xi Chen
Wei Huang
Liuxin Ding
Xianglong Kong
Fan Zhang
AAML
75
1
0
25 Mar 2024
Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling
Boyang Li
Zhiling Lan
M. Papka
OffRL
49
0
0
24 Mar 2024
Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning
Tausifa Jan Saleem
Ramanjit Ahuja
Surendra Prasad
Brejesh Lall
95
0
0
22 Mar 2024
A survey on Concept-based Approaches For Model Improvement
Avani Gupta
P. J. Narayanan
LRM
79
5
0
21 Mar 2024
What Does Evaluation of Explainable Artificial Intelligence Actually Tell Us? A Case for Compositional and Contextual Validation of XAI Building Blocks
Kacper Sokol
Julia E. Vogt
86
12
0
19 Mar 2024
DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation
Qilong Zhao
Yifei Zhang
Mengdan Zhu
Siyi Gu
Yuyang Gao
Xiaofeng Yang
Liang Zhao
MedIm
114
2
0
16 Mar 2024
Interpretable Machine Learning for Survival Analysis
Sophie Hanna Langbein
Mateusz Krzyzinski
Mikolaj Spytek
Hubert Baniecki
P. Biecek
Marvin N. Wright
85
2
0
15 Mar 2024
HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers
Francesco Dibitonto
Fabio Garcea
Andre' Panisson
Alan Perotti
Lia Morra
AAML
52
0
0
13 Mar 2024
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Michael Toker
Hadas Orgad
Mor Ventura
Dana Arad
Yonatan Belinkov
DiffM
92
13
0
09 Mar 2024
On the Origins of Linear Representations in Large Language Models
Yibo Jiang
Goutham Rajendran
Pradeep Ravikumar
Bryon Aragam
Victor Veitch
113
32
0
06 Mar 2024
Resilience of Entropy Model in Distributed Neural Networks
Milin Zhang
Mohammad Abdi
Shahriar Rifat
Francesco Restuccia
AAML
83
0
0
01 Mar 2024
WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron Concepts
Yong Hyun Ahn
Hyeon Bae Kim
Seong Tae Kim
67
6
0
29 Feb 2024
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)
Usha Bhalla
Alexander X. Oesterling
Suraj Srinivas
Flavio du Pin Calmon
Himabindu Lakkaraju
125
44
0
16 Feb 2024
Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion
Edgar Heinert
Matthias Rottmann
Kira Maag
Karsten Kahl
65
6
0
14 Feb 2024
Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models
Goutham Rajendran
Simon Buchholz
Bryon Aragam
Bernhard Schölkopf
Pradeep Ravikumar
AI4CE
175
23
0
14 Feb 2024
Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review
Anton Kuznietsov
Bálint Gyevnár
Cheng Wang
Steven Peters
Stefano V. Albrecht
XAI
86
35
0
08 Feb 2024
Towards Generating Informative Textual Description for Neurons in Language Models
Shrayani Mondal
Rishabh Garodia
Arbaaz Qureshi
Taesung Lee
Youngja Park
MILM
56
0
0
30 Jan 2024
Defining and Extracting generalizable interaction primitives from DNNs
Lu Chen
Siyu Lou
Benhao Huang
Quanshi Zhang
96
12
0
29 Jan 2024
Knowledge-Aware Neuron Interpretation for Scene Classification
Yong Guan
Freddy Lecue
Jiaoyan Chen
Ru Li
Jeff Z. Pan
51
1
0
29 Jan 2024
GOAt: Explaining Graph Neural Networks via Graph Output Attribution
Shengyao Lu
Keith G. Mills
Jiao He
Bang Liu
Di Niu
FAtt
88
9
0
26 Jan 2024
Unveiling the Unseen: Identifiable Clusters in Trained Depthwise Convolutional Kernels
Z. Babaiee
Peyman M. Kiasari
Daniela Rus
Radu Grosu
70
1
0
25 Jan 2024
Interactive Mars Image Content-Based Search with Interpretable Machine Learning
Bhavan Kumar Vasu
Steven Lu
Emily Dunkel
K. Wagstaff
Kevin Grimes
Michael McAuley
44
0
0
19 Jan 2024
Understanding Video Transformers via Universal Concept Discovery
M. Kowal
Achal Dave
Rares Andrei Ambrus
Adrien Gaidon
Konstantinos G. Derpanis
P. Tokmakov
ViT
128
12
0
19 Jan 2024
Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions
Namitha Padmanabhan
M. Gwilliam
Pulkit Kumar
Shishira R. Maiya
Max Ehrlich
Abhinav Shrivastava
100
2
1
18 Jan 2024
XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection
Tobias Clement
Truong Thanh Hung Nguyen
Mohamed Abdelaal
Hung Cao
29
1
0
18 Jan 2024
Manipulating Feature Visualizations with Gradient Slingshots
Dilyara Bareeva
Marina M.-C. Höhne
Alexander Warnecke
Lukas Pirch
Klaus-Robert Müller
Konrad Rieck
Sebastian Lapuschkin
Kirill Bykov
AAML
76
6
0
11 Jan 2024
Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective
Haoyi Xiong
Xuhong Li
Xiaofei Zhang
Jiamin Chen
Xinhao Sun
Yuchen Li
Zeyi Sun
Jundong Li
XAI
140
9
0
09 Jan 2024
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning
Alfirsa Damasyifa Fauzulhaq
Wahyu Parwitayasa
Joseph A. Sugihdharma
M. F. Ridhani
N. Yudistira
74
0
0
05 Jan 2024
Fast gradient-free activation maximization for neurons in spiking neural networks
N. Pospelov
Andrei Chertkov
Maxim Beketov
Ivan Oseledets
Konstantin Anokhin
62
2
0
28 Dec 2023
Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision
Wonjoon Chang
Dahee Kwon
Jaesik Choi
64
1
0
28 Dec 2023
Q-SENN: Quantized Self-Explaining Neural Networks
Thomas Norrenbrock
Marco Rudolph
Bodo Rosenhahn
FAtt
AAML
MILM
104
7
0
21 Dec 2023
Concept-based Explainable Artificial Intelligence: A Survey
Eleonora Poeta
Gabriele Ciravegna
Eliana Pastor
Tania Cerquitelli
Elena Baralis
LRM
XAI
110
56
0
20 Dec 2023
Successor Heads: Recurring, Interpretable Attention Heads In The Wild
Rhys Gould
Euan Ong
George Ogden
Arthur Conmy
LRM
44
52
0
14 Dec 2023
Estimation of Concept Explanations Should be Uncertainty Aware
Vihari Piratla
Juyeon Heo
Katherine M. Collins
Sukriti Singh
Adrian Weller
69
1
0
13 Dec 2023
FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision
Ravidu Suien Rammuni Silva
Jordan J. Bird
FAtt
54
1
0
10 Dec 2023
Artificial Neural Nets and the Representation of Human Concepts
Timo Freiesleben
NAI
74
1
0
08 Dec 2023
Conceptualizing the Relationship between AI Explanations and User Agency
Iyadunni Adenuga
Jonathan Dodge
64
2
0
05 Dec 2023
TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Aditya Chinchure
Pushkar Shukla
Gaurav Bhatt
Kiri Salij
K. Hosanagar
Leonid Sigal
Matthew Turk
92
29
0
03 Dec 2023
Previous
1
2
3
4
5
6
...
14
15
16
Next