NeuroX Library for Neuron Analysis of Deep NLP Models

26 May 2023

Papers citing "NeuroX Library for Neuron Analysis of Deep NLP Models"

33 / 33 papers shown

Title
Knowledge Circuits in Pretrained Transformers Yunzhi Yao Ningyu Zhang Zekun Xi Meng Wang Ziwen Xu Shumin Deng Huajun Chen KELM 152 24 0 28 May 2024
NxPlain: Web-based Tool for Discovery of Latent Concepts Fahim Dalvi Nadir Durrani Hassan Sajjad Tamim Jaban Musab Husaini Ummar Abbas 47 1 0 06 Mar 2023
Evaluating Neuron Interpretation Methods of NLP Models Yimin Fan Fahim Dalvi Nadir Durrani Hassan Sajjad 65 8 0 30 Jan 2023
ConceptX: A Framework for Latent Concept Analysis Firoj Alam Fahim Dalvi Nadir Durrani Hassan Sajjad A. Khan Jia Xu 57 5 0 12 Nov 2022
Discovering Salient Neurons in Deep NLP Models Nadir Durrani Fahim Dalvi Hassan Sajjad KELM MILM 78 16 0 27 Jun 2022
LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models Mor Geva Avi Caciularu Guy Dar Paul Roit Shoval Sadde Micah Shlain Bar Tamir Yoav Goldberg KELM 72 28 0 26 Apr 2022
On the Pitfalls of Analyzing Individual Neurons in Language Models Omer Antverg Yonatan Belinkov MILM 74 53 0 14 Oct 2021
Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad Nadir Durrani Fahim Dalvi MILM AI4CE 78 85 0 30 Aug 2021
How transfer learning impacts linguistic knowledge in deep NLP models? Nadir Durrani Hassan Sajjad Fahim Dalvi 35 51 0 31 May 2021
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation Shuhao Gu Yang Feng Wanying Xie CLL AI4CE 54 28 0 25 Mar 2021
diagNNose: A Library for Neural Activation Analysis Jaap Jumelet AI4CE 42 9 0 13 Nov 2020
Intrinsic Probing through Dimension Selection Lucas Torroba Hennigen Adina Williams Ryan Cotterell 56 58 0 06 Oct 2020
Analyzing Individual Neurons in Pre-trained Language Models Nadir Durrani Hassan Sajjad Fahim Dalvi Yonatan Belinkov MILM 60 104 0 06 Oct 2020
Captum: A unified and generic model interpretability library for PyTorch Narine Kokhlikyan Vivek Miglani Miguel Martin Edward Wang B. Alsallakh ... Alexander Melnikov Natalia Kliushkina Carlos Araya Siqi Yan Orion Reblitz-Richardson FAtt 144 846 0 16 Sep 2020
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models Ian Tenney James Wexler Jasmijn Bastings Tolga Bolukbasi Andy Coenen ... Ellen Jiang Mahima Pushkarna Carey Radebaugh Emily Reif Ann Yuan VLM 121 195 0 12 Aug 2020
Compositional Explanations of Neurons Jesse Mu Jacob Andreas FAtt CoGe MILM 69 178 0 24 Jun 2020
Finding Experts in Transformer Models Xavier Suau Luca Zappella N. Apostoloff 50 31 0 15 May 2020
A Primer in BERTology: What we know about how BERT works Anna Rogers Olga Kovaleva Anna Rumshisky OffRL 94 1,498 0 27 Feb 2020
Explaining Explanations: Axiomatic Feature Interactions for Deep Networks Joseph D. Janizek Pascal Sturmfels Su-In Lee FAtt 76 148 0 10 Feb 2020
Designing and Interpreting Probes with Control Tasks John Hewitt Percy Liang 81 537 0 08 Sep 2019
The What-If Tool: Interactive Probing of Machine Learning Models James Wexler Mahima Pushkarna Tolga Bolukbasi Martin Wattenberg F. Viégas Jimbo Wilson VLM 79 495 0 09 Jul 2019
BERT Rediscovers the Classical NLP Pipeline Ian Tenney Dipanjan Das Ellie Pavlick MILM SSeg 140 1,478 0 15 May 2019
Linguistic Knowledge and Transferability of Contextual Representations Nelson F. Liu Matt Gardner Yonatan Belinkov Matthew E. Peters Noah A. Smith 135 735 0 21 Mar 2019
The emergence of number and syntax units in LSTM language models Yair Lakretz Germán Kruszewski T. Desbordes Dieuwke Hupkes S. Dehaene Marco Baroni 53 171 0 18 Mar 2019
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks Fahim Dalvi Avery Nortonsmith A. Bau Yonatan Belinkov Hassan Sajjad Nadir Durrani James R. Glass 54 52 0 21 Dec 2018
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models Fahim Dalvi Nadir Durrani Hassan Sajjad Yonatan Belinkov A. Bau James R. Glass MILM 64 192 0 21 Dec 2018
Identifying and Controlling Important Neurons in Neural Machine Translation A. Bau Yonatan Belinkov Hassan Sajjad Nadir Durrani Fahim Dalvi James R. Glass MILM 75 184 0 03 Nov 2018
How Important Is a Neuron? Kedar Dhamdhere Mukund Sundararajan Qiqi Yan FAtt GNN 60 130 0 30 May 2018
Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure Dieuwke Hupkes Sara Veldhoen Willem H. Zuidema 76 280 0 28 Nov 2017
A Unified Approach to Interpreting Model Predictions Scott M. Lundberg Su-In Lee FAtt 1.1K 22,018 0 22 May 2017
What do Neural Machine Translation Models Learn about Morphology? Yonatan Belinkov Nadir Durrani Fahim Dalvi Hassan Sajjad James R. Glass 103 415 0 11 Apr 2017
Representation of linguistic form and function in recurrent neural networks Ákos Kádár Grzegorz Chrupała Afra Alishahi 65 162 0 29 Feb 2016
Visualizing and Understanding Recurrent Networks A. Karpathy Justin Johnson Li Fei-Fei HAI 130 1,101 0 05 Jun 2015