Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.17073
Cited By
NeuroX Library for Neuron Analysis of Deep NLP Models
26 May 2023
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
Re-assign community
ArXiv (abs)
PDF
HTML
Github (102★)
Papers citing
"NeuroX Library for Neuron Analysis of Deep NLP Models"
33 / 33 papers shown
Title
Knowledge Circuits in Pretrained Transformers
Yunzhi Yao
Ningyu Zhang
Zekun Xi
Meng Wang
Ziwen Xu
Shumin Deng
Huajun Chen
KELM
152
24
0
28 May 2024
NxPlain: Web-based Tool for Discovery of Latent Concepts
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
Tamim Jaban
Musab Husaini
Ummar Abbas
47
1
0
06 Mar 2023
Evaluating Neuron Interpretation Methods of NLP Models
Yimin Fan
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
65
8
0
30 Jan 2023
ConceptX: A Framework for Latent Concept Analysis
Firoj Alam
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
A. Khan
Jia Xu
57
5
0
12 Nov 2022
Discovering Salient Neurons in Deep NLP Models
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
KELM
MILM
78
16
0
27 Jun 2022
LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models
Mor Geva
Avi Caciularu
Guy Dar
Paul Roit
Shoval Sadde
Micah Shlain
Bar Tamir
Yoav Goldberg
KELM
72
28
0
26 Apr 2022
On the Pitfalls of Analyzing Individual Neurons in Language Models
Omer Antverg
Yonatan Belinkov
MILM
74
53
0
14 Oct 2021
Neuron-level Interpretation of Deep NLP Models: A Survey
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
MILM
AI4CE
78
85
0
30 Aug 2021
How transfer learning impacts linguistic knowledge in deep NLP models?
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
35
51
0
31 May 2021
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
Shuhao Gu
Yang Feng
Wanying Xie
CLL
AI4CE
54
28
0
25 Mar 2021
diagNNose: A Library for Neural Activation Analysis
Jaap Jumelet
AI4CE
42
9
0
13 Nov 2020
Intrinsic Probing through Dimension Selection
Lucas Torroba Hennigen
Adina Williams
Ryan Cotterell
56
58
0
06 Oct 2020
Analyzing Individual Neurons in Pre-trained Language Models
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
Yonatan Belinkov
MILM
60
104
0
06 Oct 2020
Captum: A unified and generic model interpretability library for PyTorch
Narine Kokhlikyan
Vivek Miglani
Miguel Martin
Edward Wang
B. Alsallakh
...
Alexander Melnikov
Natalia Kliushkina
Carlos Araya
Siqi Yan
Orion Reblitz-Richardson
FAtt
144
846
0
16 Sep 2020
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models
Ian Tenney
James Wexler
Jasmijn Bastings
Tolga Bolukbasi
Andy Coenen
...
Ellen Jiang
Mahima Pushkarna
Carey Radebaugh
Emily Reif
Ann Yuan
VLM
121
195
0
12 Aug 2020
Compositional Explanations of Neurons
Jesse Mu
Jacob Andreas
FAtt
CoGe
MILM
69
178
0
24 Jun 2020
Finding Experts in Transformer Models
Xavier Suau
Luca Zappella
N. Apostoloff
50
31
0
15 May 2020
A Primer in BERTology: What we know about how BERT works
Anna Rogers
Olga Kovaleva
Anna Rumshisky
OffRL
94
1,498
0
27 Feb 2020
Explaining Explanations: Axiomatic Feature Interactions for Deep Networks
Joseph D. Janizek
Pascal Sturmfels
Su-In Lee
FAtt
76
148
0
10 Feb 2020
Designing and Interpreting Probes with Control Tasks
John Hewitt
Percy Liang
81
537
0
08 Sep 2019
The What-If Tool: Interactive Probing of Machine Learning Models
James Wexler
Mahima Pushkarna
Tolga Bolukbasi
Martin Wattenberg
F. Viégas
Jimbo Wilson
VLM
79
495
0
09 Jul 2019
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
140
1,478
0
15 May 2019
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
135
735
0
21 Mar 2019
The emergence of number and syntax units in LSTM language models
Yair Lakretz
Germán Kruszewski
T. Desbordes
Dieuwke Hupkes
S. Dehaene
Marco Baroni
53
171
0
18 Mar 2019
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks
Fahim Dalvi
Avery Nortonsmith
A. Bau
Yonatan Belinkov
Hassan Sajjad
Nadir Durrani
James R. Glass
54
52
0
21 Dec 2018
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
Yonatan Belinkov
A. Bau
James R. Glass
MILM
64
192
0
21 Dec 2018
Identifying and Controlling Important Neurons in Neural Machine Translation
A. Bau
Yonatan Belinkov
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
James R. Glass
MILM
75
184
0
03 Nov 2018
How Important Is a Neuron?
Kedar Dhamdhere
Mukund Sundararajan
Qiqi Yan
FAtt
GNN
60
130
0
30 May 2018
Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure
Dieuwke Hupkes
Sara Veldhoen
Willem H. Zuidema
76
280
0
28 Nov 2017
A Unified Approach to Interpreting Model Predictions
Scott M. Lundberg
Su-In Lee
FAtt
1.1K
22,018
0
22 May 2017
What do Neural Machine Translation Models Learn about Morphology?
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
103
415
0
11 Apr 2017
Representation of linguistic form and function in recurrent neural networks
Ákos Kádár
Grzegorz Chrupała
Afra Alishahi
65
162
0
29 Feb 2016
Visualizing and Understanding Recurrent Networks
A. Karpathy
Justin Johnson
Li Fei-Fei
HAI
130
1,101
0
05 Jun 2015
1