Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02078
Cited By
v1
v2 (latest)
Visualizing and Understanding Recurrent Networks
5 June 2015
A. Karpathy
Justin Johnson
Li Fei-Fei
HAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visualizing and Understanding Recurrent Networks"
50 / 458 papers shown
Title
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Alessandro Stolfo
Yonatan Belinkov
Mrinmaya Sachan
MILM
KELM
LRM
113
54
0
24 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Basel Mousi
Nadir Durrani
Fahim Dalvi
103
13
0
22 May 2023
Explaining black box text modules in natural language with language models
Chandan Singh
Aliyah R. Hsu
Richard Antonello
Shailee Jain
Alexander G. Huth
Bin Yu
Jianfeng Gao
MILM
87
58
0
17 May 2023
WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition
Marius Bock
Hilde Kuehne
Kristof Van Laerhoven
Michael Moeller
EgoV
159
28
0
11 Apr 2023
Exposing the Functionalities of Neurons for Gated Recurrent Unit Based Sequence-to-Sequence Model
Yi-Ting Lee
Da-Yi Wu
Chih-Chun Yang
Shou-De Lin
MILM
107
0
0
27 Mar 2023
DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction
Zhijie Wang
Yuheng Huang
Basel Alomair
Lei Ma
Tianyi Zhang
HAI
107
7
0
02 Mar 2023
Toward a Theory of Causation for Interpreting Neural Code Models
David Nader-Palacio
Alejandro Velasco
Nathan Cooper
Á. Rodríguez
Kevin Moran
Denys Poshyvanyk
102
17
0
07 Feb 2023
Evaluating Neuron Interpretation Methods of NLP Models
Yimin Fan
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
82
8
0
30 Jan 2023
Explaining Deep Learning Hidden Neuron Activations using Concept Induction
Abhilekha Dalal
Md Kamruzzaman Sarker
Adrita Barua
Pascal Hitzler
FAtt
32
2
0
23 Jan 2023
Hierarchical Explanations for Video Action Recognition
Sadaf Gulshad
Teng Long
Nanne van Noord
FAtt
102
6
0
01 Jan 2023
Towards Scalable Physically Consistent Neural Networks: an Application to Data-driven Multi-zone Thermal Building Models
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
AI4CE
121
30
0
23 Dec 2022
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
81
3
0
10 Dec 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILM
MoE
90
52
0
14 Nov 2022
Machine learning-based approach for online fault Diagnosis of Discrete Event System
R. Saddem
D. Baptiste
18
3
0
24 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Filip Klubicka
John D. Kelleher
81
4
0
21 Oct 2022
On the Explainability of Natural Language Processing Deep Models
Julia El Zini
M. Awad
67
88
0
13 Oct 2022
Feature Importance for Time Series Data: Improving KernelSHAP
M. Villani
J. Lockhart
Daniele Magazzeni
FAtt
AI4TS
69
7
0
05 Oct 2022
Mining Duplicate Questions of Stack Overflow
Mihir Kale
Anirudha Rayasam
R. Parik
Pranav Dheram
30
7
0
04 Oct 2022
Model Zoos: A Dataset of Diverse Populations of Neural Network Models
Konstantin Schurholt
Diyar Taskiran
Boris Knyazev
Xavier Giró-i-Nieto
Damian Borth
141
30
0
29 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
254
121
0
22 Sep 2022
Policy Optimization with Sparse Global Contrastive Explanations
Jiayu Yao
S. Parbhoo
Weiwei Pan
Finale Doshi-Velez
OffRL
58
2
0
13 Jul 2022
Analyzing Encoded Concepts in Transformer Language Models
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
Firoj Alam
A. Khan
Jia Xu
80
47
0
27 Jun 2022
Discovering Salient Neurons in Deep NLP Models
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
KELM
MILM
114
16
0
27 Jun 2022
A Unified Understanding of Deep NLP Models for Text Classification
Zhuguo Li
Xiting Wang
Weikai Yang
Jing Wu
Zhengyan Zhang
Zhiyuan Liu
Maosong Sun
Hui Zhang
Shixia Liu
VLM
61
32
0
19 Jun 2022
The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations
Aparna Balagopalan
Haoran Zhang
Kimia Hamidieh
Thomas Hartvigsen
Frank Rudzicz
Marzyeh Ghassemi
89
80
0
06 May 2022
Implicit N-grams Induced by Recurrence
Xiaobing Sun
Wei Lu
59
3
0
05 May 2022
DeepBayes -- an estimator for parameter estimation in stochastic nonlinear dynamical models
Anubhab Ghosh
M. Abdalmoaty
Saikat Chatterjee
H. Hjalmarsson
BDL
35
3
0
04 May 2022
Visualizing and Explaining Language Models
Adrian M. P. Braşoveanu
Razvan Andonie
MILM
VLM
121
5
0
30 Apr 2022
A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference
Kerem Zaman
Yonatan Belinkov
101
8
0
11 Apr 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
Estelle Aflalo
Meng Du
Shao-Yen Tseng
Yongfei Liu
Chenfei Wu
Nan Duan
Vasudev Lal
110
47
0
30 Mar 2022
Explainability in Graph Neural Networks: An Experimental Survey
Peibo Li
Yixing Yang
Maurice Pagnucco
Yang Song
71
31
0
17 Mar 2022
Neural Network Training with Asymmetric Crosspoint Elements
M. Onen
Tayfun Gokmen
T. Todorov
T. Nowicki
Jesús A. del Alamo
J. Rozen
W. Haensch
Seyoung Kim
100
21
0
31 Jan 2022
Extracting Finite Automata from RNNs Using State Merging
William Merrill
Nikolaos Tsilivis
85
15
0
28 Jan 2022
Attention cannot be an Explanation
Arjun Reddy Akula
Song-Chun Zhu
FAtt
XAI
126
6
0
26 Jan 2022
Natural Language Descriptions of Deep Visual Features
Evan Hernandez
Sarah Schwettmann
David Bau
Teona Bagashvili
Antonio Torralba
Jacob Andreas
MILM
322
126
0
26 Jan 2022
A Latent-Variable Model for Intrinsic Probing
Karolina Stañczak
Lucas Torroba Hennigen
Adina Williams
Ryan Cotterell
Isabelle Augenstein
121
4
0
20 Jan 2022
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Cheng Wu
87
11
0
06 Dec 2021
Controlling Conditional Language Models without Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
AI4CE
119
35
0
01 Dec 2021
Data-Based Models for Hurricane Evolution Prediction: A Deep Learning Approach
Rikhi Bose
A. Pintar
E. Simiu
62
0
0
30 Oct 2021
Hyper-Representations: Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction
Konstantin Schurholt
Dimche Kostadinov
Damian Borth
SSL
126
15
0
28 Oct 2021
Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators
S. Lowe
Robert C. Earle
Jason dÉon
Thomas Trappenberg
Sageev Oore
69
2
0
22 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
99
47
0
20 Oct 2021
GenNI: Human-AI Collaboration for Data-Backed Text Generation
Hendrik Strobelt
J. Kinley
Robert Krueger
Johanna Beyer
Hanspeter Pfister
Alexander M. Rush
90
23
0
19 Oct 2021
Distinguishing rule- and exemplar-based generalization in learning systems
Ishita Dasgupta
Erin Grant
Thomas Griffiths
88
16
0
08 Oct 2021
Short-term traffic prediction using physics-aware neural networks
M. Pereira
Annika Lang
Balázs Kulcsár
83
22
0
21 Sep 2021
CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models
Arjun Reddy Akula
Keze Wang
Changsong Liu
Sari Saba-Sadiya
Hongjing Lu
S. Todorovic
J. Chai
Song-Chun Zhu
115
49
0
03 Sep 2021
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus
Robert Schwarzenberg
Sebastian Möller
123
14
0
31 Aug 2021
Neuron-level Interpretation of Deep NLP Models: A Survey
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
MILM
AI4CE
133
85
0
30 Aug 2021
A Learning-Based Fast Uplink Grant for Massive IoT via Support Vector Machines and Long Short-Term Memory
Eslam Eldeeb
M. Shehab
Hirley Alves
40
27
0
02 Aug 2021
Improving Deep Learning for HAR with shallow LSTMs
Marius Bock
Alexander Hoelzemann
Michael Moeller
Kristof Van Laerhoven
HAI
AI4TS
81
60
0
02 Aug 2021
Previous
1
2
3
4
5
...
8
9
10
Next