Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02078
Cited By
v1
v2 (latest)
Visualizing and Understanding Recurrent Networks
5 June 2015
A. Karpathy
Justin Johnson
Li Fei-Fei
HAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visualizing and Understanding Recurrent Networks"
50 / 458 papers shown
Title
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Or Shafran
Atticus Geiger
Mor Geva
MILM
122
0
0
12 Jun 2025
DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition
Marius Bock
Michael Moeller
Kristof Van Laerhoven
HAI
BDL
29
0
0
27 May 2025
FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks
Laines Schmalwasser
Niklas Penzel
Joachim Denzler
Julia Niebling
66
0
0
23 May 2025
A Model Zoo of Vision Transformers
Damian Falk
Léo Meynent
Florence Pfammatter
Konstantin Schurholt
Damian Borth
266
1
0
14 Apr 2025
In Shift and In Variance: Assessing the Robustness of HAR Deep Learning Models against Variability
Azhar Ali Khaked
Nobuyuki Oishi
Daniel Roggen
Paula Lago
109
0
0
14 Mar 2025
Discovering Influential Neuron Path in Vision Transformers
Yifan Wang
Yifei Liu
Yingdong Shi
Chong Li
Anqi Pang
Sibei Yang
Jingyi Yu
Kan Ren
ViT
261
0
0
12 Mar 2025
Superscopes: Amplifying Internal Feature Representations for Language Model Interpretation
Jonathan Jacobi
Gal Niv
LRM
ReLM
153
0
0
03 Mar 2025
An End-to-End Homomorphically Encrypted Neural Network
Marcos Florencio
Luiz Alencar
Bianca Lima
SyDa
147
0
0
22 Feb 2025
Can Input Attributions Explain Inductive Reasoning in In-Context Learning?
Mengyu Ye
Tatsuki Kuribayashi
Goro Kobayashi
Jun Suzuki
LRM
177
0
0
20 Dec 2024
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Shilin Sun
Wenbin An
Feng Tian
Fang Nan
Qidong Liu
Jing Liu
N. Shah
Ping Chen
170
6
0
18 Dec 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning
Ilya Kaufman
Omri Azencot
AI4TS
72
3
0
17 Oct 2024
Mechanistic?
Naomi Saphra
Sarah Wiegreffe
AI4CE
83
13
0
07 Oct 2024
Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents
Emanuela Boros
Maud Ehrmann
83
0
0
25 Sep 2024
G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis
Lutao Jiang
Lin Wang
70
0
0
09 Sep 2024
Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments
Angie Boggust
Venkatesh Sivaraman
Yannick Assogba
Donghao Ren
Dominik Moritz
Fred Hohman
VLM
89
3
0
06 Aug 2024
Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space
Eduardo Sánchez-Karhunen
Jose F. Quesada-Moreno
Miguel A. Gutiérrez-Naranjo
28
0
0
05 Aug 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
134
25
0
02 Aug 2024
Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++
Anh The Nguyen
T. H. Le
Muhammad Ali Babar
117
4
0
24 Jul 2024
Towards More Trustworthy and Interpretable LLMs for Code through Syntax-Grounded Explanations
David Nader-Palacio
Daniel Rodríguez-Cárdenas
Alejandro Velasco
Dipin Khati
Kevin Moran
Denys Poshyvanyk
100
6
0
12 Jul 2024
Confidence Regulation Neurons in Language Models
Alessandro Stolfo
Ben Wu
Wes Gurnee
Yonatan Belinkov
Xingyi Song
Mrinmaya Sachan
Neel Nanda
85
20
0
24 Jun 2024
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP
Marius Mosbach
Vagrant Gautam
Tomás Vergara-Browne
Dietrich Klakow
Mor Geva
AI4CE
87
10
0
18 Jun 2024
Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training
Jiancheng Xie
Lou C. Kohler Voinov
Noga Mudrik
Zhengchao Wan
Adam Charles
GNN
59
0
0
04 Jun 2024
Interpretability Needs a New Paradigm
Andreas Madsen
Himabindu Lakkaraju
Siva Reddy
Sarath Chandar
74
3
0
08 May 2024
On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System
Mohamed Roshdi
Julian Petzold
Mostafa Wahby
Hussein Ebrahim
Mladen Berekovic
Heiko Hamann
71
0
0
26 Apr 2024
A Multimodal Automated Interpretability Agent
Tamar Rott Shaham
Sarah Schwettmann
Franklin Wang
Achyuta Rajaram
Evan Hernandez
Jacob Andreas
Antonio Torralba
223
28
0
22 Apr 2024
Deep Neural Networks via Complex Network Theory: a Perspective
Emanuele La Malfa
G. Malfa
Giuseppe Nicosia
Vito Latora
GNN
69
3
0
17 Apr 2024
JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models
Yingchaojie Feng
Zhizhang Chen
Zhining Kang
Sijia Wang
Haoyu Tian
Wei Zhang
Minfeng Zhu
Wei Chen
120
4
0
12 Apr 2024
Multi-Objective Evolutionary Neural Architecture Search for Recurrent Neural Networks
Reinhard Booysen
Anna Sergeevna Bosman
70
1
0
17 Mar 2024
Word Importance Explains How Prompts Affect Language Model Outputs
Stefan Hackmann
Haniyeh Mahmoudian
Mark Steadman
Michael Schmidt
AAML
266
6
0
05 Mar 2024
Value Prediction for Spatiotemporal Gait Data Using Deep Learning
Ryan Cavanagh
Jelena Trajkovic
Wenlu Zhang
I-Hung Khoo
Vennila Krishnan
CVBM
102
0
0
29 Feb 2024
Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks
Franccois Deloche
Laurent Bonnasse-Gahot
Judit Gervain
48
0
0
22 Jan 2024
Are self-explanations from Large Language Models faithful?
Andreas Madsen
Sarath Chandar
Siva Reddy
LRM
110
36
0
15 Jan 2024
Part-of-Speech Tagger for Bodo Language using Deep Learning approach
Dhrubajyoti Pathak
Sanjib Narzary
Sukumar Nandi
Bidisha Som
36
1
0
06 Jan 2024
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning
Alfirsa Damasyifa Fauzulhaq
Wahyu Parwitayasa
Joseph A. Sugihdharma
M. F. Ridhani
N. Yudistira
84
0
0
05 Jan 2024
Knowledge Graph Enhanced Aspect-Level Sentiment Analysis
Kavita Sharma
Ritu Patel
Sunita Iyer
168
0
0
02 Dec 2023
Temporal Action Localization for Inertial-based Human Activity Recognition
Marius Bock
Michael Moeller
Kristof Van Laerhoven
65
0
0
27 Nov 2023
Automated Natural Language Explanation of Deep Visual Neurons with Large Models
Chenxu Zhao
Wei Qian
Yucheng Shi
Mengdi Huai
Ninghao Liu
59
3
0
16 Oct 2023
Neurons in Large Language Models: Dead, N-gram, Positional
Elena Voita
Javier Ferrando
Christoforos Nalmpantis
MILM
169
56
0
09 Sep 2023
Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and Stability
Tyler A. Chang
Zhuowen Tu
Benjamin Bergen
61
13
0
29 Aug 2023
Cerberus: A Deep Learning Hybrid Model for Lithium-Ion Battery Aging Estimation and Prediction Based on Relaxation Voltage Curves
Yue Xiang
Bo Jiang
Haifeng Dai
28
0
0
15 Aug 2023
A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment
Ying Zhao
Yu Bowen
Binyuan Hui
Haiyang Yu
Fei Huang
Yongbin Li
N. Zhang
139
25
0
10 Aug 2023
Evaluating and Explaining Large Language Models for Code Using Syntactic Structures
David Nader-Palacio
Alejandro Velasco
Daniel Rodríguez-Cárdenas
Kevin Moran
Denys Poshyvanyk
87
9
0
07 Aug 2023
Wider and Deeper LLM Networks are Fairer LLM Evaluators
Xinghua Zhang
Yu Bowen
Haiyang Yu
Yangyu Lv
Tingwen Liu
Fei Huang
Hongbo Xu
Yongbin Li
ALM
149
90
0
03 Aug 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?
Ari Holtzman
Peter West
Luke Zettlemoyer
AI4CE
110
15
0
31 Jul 2023
FSLens: A Visual Analytics Approach to Evaluating and Optimizing the Spatial Layout of Fire Stations
Long-fei Chen
He Wang
Ouyang Yang
Yang Zhou
Naiyu Wang
Quan Li
67
7
0
23 Jul 2023
Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks
Eldor Abdukhamidov
Mohammed Abuhamad
Simon S. Woo
Eric Chan-Tin
Tamer Abuhmed
AAML
53
3
0
21 Jul 2023
Visual Analytics For Machine Learning: A Data Perspective Survey
Junpeng Wang
Shixia Liu
Wei Zhang
HAI
93
20
0
15 Jul 2023
Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems
Eldor Abdukhamidov
Mohammed Abuhamad
Simon S. Woo
Eric Chan-Tin
Tamer Abuhmed
AAML
59
1
0
13 Jul 2023
Examining the Causal Effect of First Names on Language Models: The Case of Social Commonsense Reasoning
Sullam Jeoung
Jana Diesner
H. Kilicoglu
LRM
45
5
0
01 Jun 2023
NeuroX Library for Neuron Analysis of Deep NLP Models
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
84
11
0
26 May 2023
1
2
3
4
...
8
9
10
Next