v1v2 (latest)

Visualizing and Understanding Recurrent Networks

5 June 2015

Li Fei-Fei

Papers citing "Visualizing and Understanding Recurrent Networks"

50 / 458 papers shown

Title
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization Or Shafran Atticus Geiger Mor Geva MILM 122 0 0 12 Jun 2025
DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition Marius Bock Michael Moeller Kristof Van Laerhoven HAI BDL 29 0 0 27 May 2025
FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks Laines Schmalwasser Niklas Penzel Joachim Denzler Julia Niebling 66 0 0 23 May 2025
A Model Zoo of Vision Transformers Damian Falk Léo Meynent Florence Pfammatter Konstantin Schurholt Damian Borth 266 1 0 14 Apr 2025
In Shift and In Variance: Assessing the Robustness of HAR Deep Learning Models against Variability Azhar Ali Khaked Nobuyuki Oishi Daniel Roggen Paula Lago 109 0 0 14 Mar 2025
Discovering Influential Neuron Path in Vision Transformers Yifan Wang Yifei Liu Yingdong Shi Chong Li Anqi Pang Sibei Yang Jingyi Yu Kan Ren ViT 261 0 0 12 Mar 2025
Superscopes: Amplifying Internal Feature Representations for Language Model Interpretation Jonathan Jacobi Gal Niv LRM ReLM 153 0 0 03 Mar 2025
An End-to-End Homomorphically Encrypted Neural Network Marcos Florencio Luiz Alencar Bianca Lima SyDa 147 0 0 22 Feb 2025
Can Input Attributions Explain Inductive Reasoning in In-Context Learning? Mengyu Ye Tatsuki Kuribayashi Goro Kobayashi Jun Suzuki LRM 177 0 0 20 Dec 2024
A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future Shilin Sun Wenbin An Feng Tian Fang Nan Qidong Liu Jing Liu N. Shah Ping Chen 170 6 0 18 Dec 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning Ilya Kaufman Omri Azencot AI4TS 72 3 0 17 Oct 2024
Mechanistic? Naomi Saphra Sarah Wiegreffe AI4CE 83 13 0 07 Oct 2024
Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents Emanuela Boros Maud Ehrmann 83 0 0 25 Sep 2024
G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis Lutao Jiang Lin Wang 70 0 0 09 Sep 2024
Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust Venkatesh Sivaraman Yannick Assogba Donghao Ren Dominik Moritz Fred Hohman VLM 89 3 0 06 Aug 2024
Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space Eduardo Sánchez-Karhunen Jose F. Quesada-Moreno Miguel A. Gutiérrez-Naranjo 28 0 0 05 Aug 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability Aaron Mueller Jannik Brinkmann Millicent Li Samuel Marks Koyena Pal ... Arnab Sen Sharma Jiuding Sun Eric Todd David Bau Yonatan Belinkov CML 134 25 0 02 Aug 2024
Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++ Anh The Nguyen T. H. Le Muhammad Ali Babar 117 4 0 24 Jul 2024
Towards More Trustworthy and Interpretable LLMs for Code through Syntax-Grounded Explanations David Nader-Palacio Daniel Rodríguez-Cárdenas Alejandro Velasco Dipin Khati Kevin Moran Denys Poshyvanyk 100 6 0 12 Jul 2024
Confidence Regulation Neurons in Language Models Alessandro Stolfo Ben Wu Wes Gurnee Yonatan Belinkov Xingyi Song Mrinmaya Sachan Neel Nanda 85 20 0 24 Jun 2024
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP Marius Mosbach Vagrant Gautam Tomás Vergara-Browne Dietrich Klakow Mor Geva AI4CE 87 10 0 18 Jun 2024
Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training Jiancheng Xie Lou C. Kohler Voinov Noga Mudrik Zhengchao Wan Adam Charles GNN 59 0 0 04 Jun 2024
Interpretability Needs a New Paradigm Andreas Madsen Himabindu Lakkaraju Siva Reddy Sarath Chandar 74 3 0 08 May 2024
On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System Mohamed Roshdi Julian Petzold Mostafa Wahby Hussein Ebrahim Mladen Berekovic Heiko Hamann 71 0 0 26 Apr 2024
A Multimodal Automated Interpretability Agent Tamar Rott Shaham Sarah Schwettmann Franklin Wang Achyuta Rajaram Evan Hernandez Jacob Andreas Antonio Torralba 223 28 0 22 Apr 2024
Deep Neural Networks via Complex Network Theory: a Perspective Emanuele La Malfa G. Malfa Giuseppe Nicosia Vito Latora GNN 69 3 0 17 Apr 2024
JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models Yingchaojie Feng Zhizhang Chen Zhining Kang Sijia Wang Haoyu Tian Wei Zhang Minfeng Zhu Wei Chen 120 4 0 12 Apr 2024
Multi-Objective Evolutionary Neural Architecture Search for Recurrent Neural Networks Reinhard Booysen Anna Sergeevna Bosman 70 1 0 17 Mar 2024
Word Importance Explains How Prompts Affect Language Model Outputs Stefan Hackmann Haniyeh Mahmoudian Mark Steadman Michael Schmidt AAML 266 6 0 05 Mar 2024
Value Prediction for Spatiotemporal Gait Data Using Deep Learning Ryan Cavanagh Jelena Trajkovic Wenlu Zhang I-Hung Khoo Vennila Krishnan CVBM 102 0 0 29 Feb 2024
Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks Franccois Deloche Laurent Bonnasse-Gahot Judit Gervain 48 0 0 22 Jan 2024
Are self-explanations from Large Language Models faithful? Andreas Madsen Sarath Chandar Siva Reddy LRM 110 36 0 15 Jan 2024
Part-of-Speech Tagger for Bodo Language using Deep Learning approach Dhrubajyoti Pathak Sanjib Narzary Sukumar Nandi Bidisha Som 36 1 0 06 Jan 2024
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning Alfirsa Damasyifa Fauzulhaq Wahyu Parwitayasa Joseph A. Sugihdharma M. F. Ridhani N. Yudistira 84 0 0 05 Jan 2024
Knowledge Graph Enhanced Aspect-Level Sentiment Analysis Kavita Sharma Ritu Patel Sunita Iyer 168 0 0 02 Dec 2023
Temporal Action Localization for Inertial-based Human Activity Recognition Marius Bock Michael Moeller Kristof Van Laerhoven 65 0 0 27 Nov 2023
Automated Natural Language Explanation of Deep Visual Neurons with Large Models Chenxu Zhao Wei Qian Yucheng Shi Mengdi Huai Ninghao Liu 59 3 0 16 Oct 2023
Neurons in Large Language Models: Dead, N-gram, Positional Elena Voita Javier Ferrando Christoforos Nalmpantis MILM 169 56 0 09 Sep 2023
Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and Stability Tyler A. Chang Zhuowen Tu Benjamin Bergen 61 13 0 29 Aug 2023
Cerberus: A Deep Learning Hybrid Model for Lithium-Ion Battery Aging Estimation and Prediction Based on Relaxation Voltage Curves Yue Xiang Bo Jiang Haifeng Dai 28 0 0 15 Aug 2023
A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment Ying Zhao Yu Bowen Binyuan Hui Haiyang Yu Fei Huang Yongbin Li N. Zhang 139 25 0 10 Aug 2023
Evaluating and Explaining Large Language Models for Code Using Syntactic Structures David Nader-Palacio Alejandro Velasco Daniel Rodríguez-Cárdenas Kevin Moran Denys Poshyvanyk 87 9 0 07 Aug 2023
Wider and Deeper LLM Networks are Fairer LLM Evaluators Xinghua Zhang Yu Bowen Haiyang Yu Yangyu Lv Tingwen Liu Fei Huang Hongbo Xu Yongbin Li ALM 149 90 0 03 Aug 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior? Ari Holtzman Peter West Luke Zettlemoyer AI4CE 110 15 0 31 Jul 2023
FSLens: A Visual Analytics Approach to Evaluating and Optimizing the Spatial Layout of Fire Stations Long-fei Chen He Wang Ouyang Yang Yang Zhou Naiyu Wang Quan Li 67 7 0 23 Jul 2023
Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks Eldor Abdukhamidov Mohammed Abuhamad Simon S. Woo Eric Chan-Tin Tamer Abuhmed AAML 53 3 0 21 Jul 2023
Visual Analytics For Machine Learning: A Data Perspective Survey Junpeng Wang Shixia Liu Wei Zhang HAI 93 20 0 15 Jul 2023
Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems Eldor Abdukhamidov Mohammed Abuhamad Simon S. Woo Eric Chan-Tin Tamer Abuhmed AAML 59 1 0 13 Jul 2023
Examining the Causal Effect of First Names on Language Models: The Case of Social Commonsense Reasoning Sullam Jeoung Jana Diesner H. Kilicoglu LRM 45 5 0 01 Jun 2023
NeuroX Library for Neuron Analysis of Deep NLP Models Fahim Dalvi Hassan Sajjad Nadir Durrani 84 11 0 26 May 2023