Network Dissection: Quantifying Interpretability of Deep Visual Representations

19 April 2017

Antonio Torralba

Papers citing "Network Dissection: Quantifying Interpretability of Deep Visual Representations"

50 / 787 papers shown

Title
CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models Teodor Chiaburu Frank Haußer Felix Bießmann 75 4 0 23 Apr 2024
Automatic Discovery of Visual Circuits Achyuta Rajaram Neil Chowdhury Antonio Torralba Jacob Andreas Sarah Schwettmann GNN 70 4 0 22 Apr 2024
A Multimodal Automated Interpretability Agent Tamar Rott Shaham Sarah Schwettmann Franklin Wang Achyuta Rajaram Evan Hernandez Jacob Andreas Antonio Torralba 221 28 0 22 Apr 2024
Uncovering Safety Risks of Large Language Models through Concept Activation Vector Zhihao Xu Ruixuan Huang Changyu Chen Shuai Wang Xiting Wang LLMSV 101 27 0 18 Apr 2024
Toward Understanding the Disagreement Problem in Neural Network Feature Attribution Niklas Koenen Marvin N. Wright FAtt 81 5 0 17 Apr 2024
Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes Ivica Obadic Alex Levering Lars Pennig Dario Augusto Borges Oliveira Diego Marcos Xiaoxiang Zhu 72 0 0 15 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification Binghua Li Jie Mao Zhe Sun Chao Li Qibin Zhao Toshihisa Tanaka 38 0 0 13 Apr 2024
Knowledge graphs for empirical concept retrieval Lenka Tětková Teresa Scheidt Maria Mandrup Fogh Ellen Marie Gaunby Jorgensen F. Nielsen Lars Kai Hansen 64 2 0 10 Apr 2024
Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models M. Kowal Richard P. Wildes Konstantinos G. Derpanis GNN 106 8 0 02 Apr 2024
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang Fei Pan Junmo Kim In So Kweon Chengzhi Mao 85 11 1 27 Mar 2024
Multi-scale Unified Network for Image Classification Wenzhuo Liu Fei Zhu Cheng-Lin Liu CVBM 96 0 0 27 Mar 2024
The Anatomy of Adversarial Attacks: Concept-based XAI Dissection Georgii Mikriukov Gesina Schwalbe Franz Motzkus Korinna Bade AAML 77 1 0 25 Mar 2024
Ensemble Adversarial Defense via Integration of Multiple Dispersed Low Curvature Models Kaikang Zhao Xi Chen Wei Huang Liuxin Ding Xianglong Kong Fan Zhang AAML 75 1 0 25 Mar 2024
Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling Boyang Li Zhiling Lan M. Papka OffRL 49 0 0 24 Mar 2024
Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning Tausifa Jan Saleem Ramanjit Ahuja Surendra Prasad Brejesh Lall 95 0 0 22 Mar 2024
A survey on Concept-based Approaches For Model Improvement Avani Gupta P. J. Narayanan LRM 79 5 0 21 Mar 2024
What Does Evaluation of Explainable Artificial Intelligence Actually Tell Us? A Case for Compositional and Contextual Validation of XAI Building Blocks Kacper Sokol Julia E. Vogt 86 12 0 19 Mar 2024
DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation Qilong Zhao Yifei Zhang Mengdan Zhu Siyi Gu Yuyang Gao Xiaofeng Yang Liang Zhao MedIm 114 2 0 16 Mar 2024
Interpretable Machine Learning for Survival Analysis Sophie Hanna Langbein Mateusz Krzyzinski Mikolaj Spytek Hubert Baniecki P. Biecek Marvin N. Wright 85 2 0 15 Mar 2024
HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers Francesco Dibitonto Fabio Garcea Andre' Panisson Alan Perotti Lia Morra AAML 52 0 0 13 Mar 2024
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines Michael Toker Hadas Orgad Mor Ventura Dana Arad Yonatan Belinkov DiffM 92 13 0 09 Mar 2024
On the Origins of Linear Representations in Large Language Models Yibo Jiang Goutham Rajendran Pradeep Ravikumar Bryon Aragam Victor Veitch 113 32 0 06 Mar 2024
Resilience of Entropy Model in Distributed Neural Networks Milin Zhang Mohammad Abdi Shahriar Rifat Francesco Restuccia AAML 83 0 0 01 Mar 2024
WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron Concepts Yong Hyun Ahn Hyeon Bae Kim Seong Tae Kim 67 6 0 29 Feb 2024
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) Usha Bhalla Alexander X. Oesterling Suraj Srinivas Flavio du Pin Calmon Himabindu Lakkaraju 125 44 0 16 Feb 2024
Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion Edgar Heinert Matthias Rottmann Kira Maag Karsten Kahl 65 6 0 14 Feb 2024
Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models Goutham Rajendran Simon Buchholz Bryon Aragam Bernhard Schölkopf Pradeep Ravikumar AI4CE 175 23 0 14 Feb 2024
Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review Anton Kuznietsov Bálint Gyevnár Cheng Wang Steven Peters Stefano V. Albrecht XAI 86 35 0 08 Feb 2024
Towards Generating Informative Textual Description for Neurons in Language Models Shrayani Mondal Rishabh Garodia Arbaaz Qureshi Taesung Lee Youngja Park MILM 56 0 0 30 Jan 2024
Defining and Extracting generalizable interaction primitives from DNNs Lu Chen Siyu Lou Benhao Huang Quanshi Zhang 96 12 0 29 Jan 2024
Knowledge-Aware Neuron Interpretation for Scene Classification Yong Guan Freddy Lecue Jiaoyan Chen Ru Li Jeff Z. Pan 51 1 0 29 Jan 2024
GOAt: Explaining Graph Neural Networks via Graph Output Attribution Shengyao Lu Keith G. Mills Jiao He Bang Liu Di Niu FAtt 88 9 0 26 Jan 2024
Unveiling the Unseen: Identifiable Clusters in Trained Depthwise Convolutional Kernels Z. Babaiee Peyman M. Kiasari Daniela Rus Radu Grosu 72 1 0 25 Jan 2024
Interactive Mars Image Content-Based Search with Interpretable Machine Learning Bhavan Kumar Vasu Steven Lu Emily Dunkel K. Wagstaff Kevin Grimes Michael McAuley 49 0 0 19 Jan 2024
Understanding Video Transformers via Universal Concept Discovery M. Kowal Achal Dave Rares Andrei Ambrus Adrien Gaidon Konstantinos G. Derpanis P. Tokmakov ViT 128 12 0 19 Jan 2024
Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan M. Gwilliam Pulkit Kumar Shishira R. Maiya Max Ehrlich Abhinav Shrivastava 100 2 1 18 Jan 2024
XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection Tobias Clement Truong Thanh Hung Nguyen Mohamed Abdelaal Hung Cao 29 1 0 18 Jan 2024
Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva Marina M.-C. Höhne Alexander Warnecke Lukas Pirch Klaus-Robert Müller Konrad Rieck Sebastian Lapuschkin Kirill Bykov AAML 76 6 0 11 Jan 2024
Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective Haoyi Xiong Xuhong Li Xiaofei Zhang Jiamin Chen Xinhao Sun Yuchen Li Zeyi Sun Jundong Li XAI 140 9 0 09 Jan 2024
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning Alfirsa Damasyifa Fauzulhaq Wahyu Parwitayasa Joseph A. Sugihdharma M. F. Ridhani N. Yudistira 74 0 0 05 Jan 2024
Fast gradient-free activation maximization for neurons in spiking neural networks N. Pospelov Andrei Chertkov Maxim Beketov Ivan Oseledets Konstantin Anokhin 62 2 0 28 Dec 2023
Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision Wonjoon Chang Dahee Kwon Jaesik Choi 64 1 0 28 Dec 2023
Q-SENN: Quantized Self-Explaining Neural Networks Thomas Norrenbrock Marco Rudolph Bodo Rosenhahn FAtt AAML MILM 104 7 0 21 Dec 2023
Concept-based Explainable Artificial Intelligence: A Survey Eleonora Poeta Gabriele Ciravegna Eliana Pastor Tania Cerquitelli Elena Baralis LRM XAI 110 56 0 20 Dec 2023
Successor Heads: Recurring, Interpretable Attention Heads In The Wild Rhys Gould Euan Ong George Ogden Arthur Conmy LRM 44 52 0 14 Dec 2023
Estimation of Concept Explanations Should be Uncertainty Aware Vihari Piratla Juyeon Heo Katherine M. Collins Sukriti Singh Adrian Weller 69 1 0 13 Dec 2023
FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision Ravidu Suien Rammuni Silva Jordan J. Bird FAtt 59 1 0 10 Dec 2023
Artificial Neural Nets and the Representation of Human Concepts Timo Freiesleben NAI 74 1 0 08 Dec 2023
Conceptualizing the Relationship between AI Explanations and User Agency Iyadunni Adenuga Jonathan Dodge 64 2 0 05 Dec 2023
TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models Aditya Chinchure Pushkar Shukla Gaurav Bhatt Kiri Salij K. Hosanagar Leonid Sigal Matthew Turk 92 29 0 03 Dec 2023