v1v2 (latest)

Visualizing and Understanding Recurrent Networks

5 June 2015

Li Fei-Fei

Papers citing "Visualizing and Understanding Recurrent Networks"

50 / 458 papers shown

Title
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis Alessandro Stolfo Yonatan Belinkov Mrinmaya Sachan MILM KELM LRM 113 54 0 24 May 2023
Can LLMs facilitate interpretation of pre-trained language models? Basel Mousi Nadir Durrani Fahim Dalvi 103 13 0 22 May 2023
Explaining black box text modules in natural language with language models Chandan Singh Aliyah R. Hsu Richard Antonello Shailee Jain Alexander G. Huth Bin Yu Jianfeng Gao MILM 87 58 0 17 May 2023
WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition Marius Bock Hilde Kuehne Kristof Van Laerhoven Michael Moeller EgoV 159 28 0 11 Apr 2023
Exposing the Functionalities of Neurons for Gated Recurrent Unit Based Sequence-to-Sequence Model Yi-Ting Lee Da-Yi Wu Chih-Chun Yang Shou-De Lin MILM 107 0 0 27 Mar 2023
DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction Zhijie Wang Yuheng Huang Basel Alomair Lei Ma Tianyi Zhang HAI 107 7 0 02 Mar 2023
Toward a Theory of Causation for Interpreting Neural Code Models David Nader-Palacio Alejandro Velasco Nathan Cooper Á. Rodríguez Kevin Moran Denys Poshyvanyk 102 17 0 07 Feb 2023
Evaluating Neuron Interpretation Methods of NLP Models Yimin Fan Fahim Dalvi Nadir Durrani Hassan Sajjad 82 8 0 30 Jan 2023
Explaining Deep Learning Hidden Neuron Activations using Concept Induction Abhilekha Dalal Md Kamruzzaman Sarker Adrita Barua Pascal Hitzler FAtt 32 2 0 23 Jan 2023
Hierarchical Explanations for Video Action Recognition Sadaf Gulshad Teng Long Nanne van Noord FAtt 102 6 0 01 Jan 2023
Towards Scalable Physically Consistent Neural Networks: an Application to Data-driven Multi-zone Thermal Building Models L. D. Natale B. Svetozarevic Philipp Heer Colin N. Jones AI4CE 121 30 0 23 Dec 2022
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions Cheng Wang Carolin (Haas) Lawrence Mathias Niepert 81 3 0 10 Dec 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models Xiaozhi Wang Kaiyue Wen Zhengyan Zhang Lei Hou Zhiyuan Liu Juanzi Li MILM MoE 90 52 0 14 Nov 2022
Machine learning-based approach for online fault Diagnosis of Discrete Event System R. Saddem D. Baptiste 18 3 0 24 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings Filip Klubicka John D. Kelleher 81 4 0 21 Oct 2022
On the Explainability of Natural Language Processing Deep Models Julia El Zini M. Awad 67 88 0 13 Oct 2022
Feature Importance for Time Series Data: Improving KernelSHAP M. Villani J. Lockhart Daniele Magazzeni FAtt AI4TS 69 7 0 05 Oct 2022
Mining Duplicate Questions of Stack Overflow Mihir Kale Anirudha Rayasam R. Parik Pranav Dheram 30 7 0 04 Oct 2022
Model Zoos: A Dataset of Diverse Populations of Neural Network Models Konstantin Schurholt Diyar Taskiran Boris Knyazev Xavier Giró-i-Nieto Damian Borth 141 30 0 29 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey Qing Lyu Marianna Apidianaki Chris Callison-Burch XAI 254 121 0 22 Sep 2022
Policy Optimization with Sparse Global Contrastive Explanations Jiayu Yao S. Parbhoo Weiwei Pan Finale Doshi-Velez OffRL 58 2 0 13 Jul 2022
Analyzing Encoded Concepts in Transformer Language Models Hassan Sajjad Nadir Durrani Fahim Dalvi Firoj Alam A. Khan Jia Xu 80 47 0 27 Jun 2022
Discovering Salient Neurons in Deep NLP Models Nadir Durrani Fahim Dalvi Hassan Sajjad KELM MILM 114 16 0 27 Jun 2022
A Unified Understanding of Deep NLP Models for Text Classification Zhuguo Li Xiting Wang Weikai Yang Jing Wu Zhengyan Zhang Zhiyuan Liu Maosong Sun Hui Zhang Shixia Liu VLM 61 32 0 19 Jun 2022
The Road to Explainability is Paved with Bias: Measuring the Fairness of Explanations Aparna Balagopalan Haoran Zhang Kimia Hamidieh Thomas Hartvigsen Frank Rudzicz Marzyeh Ghassemi 89 80 0 06 May 2022
Implicit N-grams Induced by Recurrence Xiaobing Sun Wei Lu 59 3 0 05 May 2022
DeepBayes -- an estimator for parameter estimation in stochastic nonlinear dynamical models Anubhab Ghosh M. Abdalmoaty Saikat Chatterjee H. Hjalmarsson BDL 35 3 0 04 May 2022
Visualizing and Explaining Language Models Adrian M. P. Braşoveanu Razvan Andonie MILM VLM 121 5 0 30 Apr 2022
A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference Kerem Zaman Yonatan Belinkov 101 8 0 11 Apr 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers Estelle Aflalo Meng Du Shao-Yen Tseng Yongfei Liu Chenfei Wu Nan Duan Vasudev Lal 110 47 0 30 Mar 2022
Explainability in Graph Neural Networks: An Experimental Survey Peibo Li Yixing Yang Maurice Pagnucco Yang Song 71 31 0 17 Mar 2022
Neural Network Training with Asymmetric Crosspoint Elements M. Onen Tayfun Gokmen T. Todorov T. Nowicki Jesús A. del Alamo J. Rozen W. Haensch Seyoung Kim 100 21 0 31 Jan 2022
Extracting Finite Automata from RNNs Using State Merging William Merrill Nikolaos Tsilivis 85 15 0 28 Jan 2022
Attention cannot be an Explanation Arjun Reddy Akula Song-Chun Zhu FAtt XAI 126 6 0 26 Jan 2022
Natural Language Descriptions of Deep Visual Features Evan Hernandez Sarah Schwettmann David Bau Teona Bagashvili Antonio Torralba Jacob Andreas MILM 322 126 0 26 Jan 2022
A Latent-Variable Model for Intrinsic Probing Karolina Stañczak Lucas Torroba Hennigen Adina Williams Ryan Cotterell Isabelle Augenstein 121 4 0 20 Jan 2022
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning Wenjie Shi Gao Huang Shiji Song Cheng Wu 87 11 0 06 Dec 2021
Controlling Conditional Language Models without Catastrophic Forgetting Tomasz Korbak Hady ElSahar Germán Kruszewski Marc Dymetman CLL AI4CE 119 35 0 01 Dec 2021
Data-Based Models for Hurricane Evolution Prediction: A Deep Learning Approach Rikhi Bose A. Pintar E. Simiu 62 0 0 30 Oct 2021
Hyper-Representations: Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction Konstantin Schurholt Dimche Kostadinov Damian Borth SSL 126 15 0 28 Oct 2021
Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators S. Lowe Robert C. Earle Jason dÉon Thomas Trappenberg Sageev Oore 69 2 0 22 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 99 47 0 20 Oct 2021
GenNI: Human-AI Collaboration for Data-Backed Text Generation Hendrik Strobelt J. Kinley Robert Krueger Johanna Beyer Hanspeter Pfister Alexander M. Rush 90 23 0 19 Oct 2021
Distinguishing rule- and exemplar-based generalization in learning systems Ishita Dasgupta Erin Grant Thomas Griffiths 88 16 0 08 Oct 2021
Short-term traffic prediction using physics-aware neural networks M. Pereira Annika Lang Balázs Kulcsár 83 22 0 21 Sep 2021
CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models Arjun Reddy Akula Keze Wang Changsong Liu Sari Saba-Sadiya Hongjing Lu S. Todorovic J. Chai Song-Chun Zhu 115 49 0 03 Sep 2021
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools Nils Feldhus Robert Schwarzenberg Sebastian Möller 123 14 0 31 Aug 2021
Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad Nadir Durrani Fahim Dalvi MILM AI4CE 133 85 0 30 Aug 2021
A Learning-Based Fast Uplink Grant for Massive IoT via Support Vector Machines and Long Short-Term Memory Eslam Eldeeb M. Shehab Hirley Alves 40 27 0 02 Aug 2021
Improving Deep Learning for HAR with shallow LSTMs Marius Bock Alexander Hoelzemann Michael Moeller Kristof Van Laerhoven HAI AI4TS 81 60 0 02 Aug 2021