v1v2 (latest)

Discovering Salient Neurons in Deep NLP Models

27 June 2022

Papers citing "Discovering Salient Neurons in Deep NLP Models"

34 / 34 papers shown

Title
On the Transformation of Latent Space in Fine-Tuned NLP Models Nadir Durrani Hassan Sajjad Fahim Dalvi Firoj Alam 97 19 0 23 Oct 2022
On the Pitfalls of Analyzing Individual Neurons in Language Models Omer Antverg Yonatan Belinkov MILM 64 53 0 14 Oct 2021
German's Next Language Model Branden Chan Stefan Schweter Timo Möller 90 273 0 21 Oct 2020
FIND: Human-in-the-Loop Debugging Deep Text Classifiers Piyawat Lertvittayakumjorn Lucia Specia Francesca Toni 43 54 0 10 Oct 2020
Intrinsic Probing through Dimension Selection Lucas Torroba Hennigen Adina Williams Ryan Cotterell 54 58 0 06 Oct 2020
CausaLM: Causal Model Explanation Through Counterfactual Language Models Amir Feder Nadav Oved Uri Shalit Roi Reichart CML LRM 95 161 0 27 May 2020
Finding Experts in Transformer Models Xavier Suau Luca Zappella N. Apostoloff 48 31 0 15 May 2020
Under the Hood of Neural Networks: Characterizing Learned Representations by Functional Neuron Populations and Network Ablations Richard Meyes Constantin Waubert de Puiseau Andres Felipe Posada-Moreno Tobias Meisen AI4CE 68 22 0 02 Apr 2020
Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction Taeuk Kim Jihun Choi Daniel Edmiston Sang-goo Lee 62 90 0 30 Jan 2020
FlauBERT: Unsupervised Language Model Pre-training for French Hang Le Loïc Vial Jibril Frej Vincent Segonne Maximin Coavoux Benjamin Lecouteux A. Allauzen Benoît Crabbé Laurent Besacier D. Schwab AI4CE 88 400 0 11 Dec 2019
Unsupervised Cross-lingual Representation Learning at Scale Alexis Conneau Kartikay Khandelwal Naman Goyal Vishrav Chaudhary Guillaume Wenzek Francisco Guzmán Edouard Grave Myle Ott Luke Zettlemoyer Veselin Stoyanov 220 6,565 0 05 Nov 2019
On the Linguistic Representational Power of Neural Machine Translation Models Yonatan Belinkov Nadir Durrani Fahim Dalvi Hassan Sajjad James R. Glass MILM 83 72 0 01 Nov 2019
Designing and Interpreting Probes with Control Tasks John Hewitt Percy Liang 76 536 0 08 Sep 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding Zhilin Yang Zihang Dai Yiming Yang J. Carbonell Ruslan Salakhutdinov Quoc V. Le AI4CE 232 8,433 0 19 Jun 2019
Are Sixteen Heads Really Better than One? Paul Michel Omer Levy Graham Neubig MoE 100 1,062 0 25 May 2019
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned Elena Voita David Talbot F. Moiseev Rico Sennrich Ivan Titov 114 1,141 0 23 May 2019
BERT Rediscovers the Classical NLP Pipeline Ian Tenney Dipanjan Das Ellie Pavlick MILM SSeg 138 1,471 0 15 May 2019
Linguistic Knowledge and Transferability of Contextual Representations Nelson F. Liu Matt Gardner Yonatan Belinkov Matthew E. Peters Noah A. Smith 130 733 0 21 Mar 2019
The emergence of number and syntax units in LSTM language models Yair Lakretz Germán Kruszewski T. Desbordes Dieuwke Hupkes S. Dehaene Marco Baroni 53 170 0 18 Mar 2019
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models Fahim Dalvi Nadir Durrani Hassan Sajjad Yonatan Belinkov A. Bau James R. Glass MILM 61 191 0 21 Dec 2018
Targeted Syntactic Evaluation of Language Models Rebecca Marvin Tal Linzen 81 416 0 27 Aug 2018
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 342 894 0 03 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 1.1K 7,159 0 20 Apr 2018
Colorless green recurrent networks dream hierarchically Kristina Gulordava Piotr Bojanowski Edouard Grave Tal Linzen Marco Baroni 91 505 0 29 Mar 2018
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction Leland McInnes John Healy James Melville 160 9,432 0 09 Feb 2018
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks Yonatan Belinkov Lluís Màrquez i Villodre Hassan Sajjad Nadir Durrani Fahim Dalvi James R. Glass 63 165 0 23 Jan 2018
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Adina Williams Nikita Nangia Samuel R. Bowman 524 4,479 0 18 Apr 2017
What do Neural Machine Translation Models Learn about Morphology? Yonatan Belinkov Nadir Durrani Fahim Dalvi Hassan Sajjad James R. Glass 103 414 0 11 Apr 2017
Axiomatic Attribution for Deep Networks Mukund Sundararajan Ankur Taly Qiqi Yan OOD FAtt 188 5,989 0 04 Mar 2017
Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies Tal Linzen Emmanuel Dupoux Yoav Goldberg 101 905 0 04 Nov 2016
SQuAD: 100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar Jian Zhang Konstantin Lopyrev Percy Liang RALM 283 8,134 0 16 Jun 2016
Visualizing and Understanding Recurrent Networks A. Karpathy Justin Johnson Li Fei-Fei HAI 118 1,101 0 05 Jun 2015
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps Karen Simonyan Andrea Vedaldi Andrew Zisserman FAtt 312 7,295 0 20 Dec 2013
A Universal Part-of-Speech Tagset Slav Petrov Dipanjan Das Ryan T. McDonald 89 1,036 0 11 Apr 2011