The emergence of number and syntax units in LSTM language models

18 March 2019

Papers citing "The emergence of number and syntax units in LSTM language models"

41 / 41 papers shown

Title
Crafting Large Language Models for Enhanced Interpretability Chung-En Sun Tuomas P. Oikarinen Tsui-Wei Weng 38 7 0 05 Jul 2024
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model Abhijith Chintam Rahel Beloch Willem H. Zuidema Michael Hanna Oskar van der Wal 28 16 0 19 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT Stefan Arnold Nils Kemmerzell Annika Schreiner 35 0 0 17 Oct 2023
Generative Models as a Complex Systems Science: How can we make sense of large language model behavior? Ari Holtzman Peter West Luke Zettlemoyer AI4CE 34 14 0 31 Jul 2023
Generalizing Backpropagation for Gradient-Based Interpretability Kevin Du Lucas Torroba Hennigen Niklas Stoehr Alex Warstadt Ryan Cotterell MILM FAtt 32 7 0 06 Jul 2023
Language acquisition: do children and language models follow similar learning stages? Linnea Evanson Yair Lakretz J. King 32 27 0 06 Jun 2023
Backpack Language Models John Hewitt John Thickstun Christopher D. Manning Percy Liang KELM 21 16 0 26 May 2023
Exposing the Functionalities of Neurons for Gated Recurrent Unit Based Sequence-to-Sequence Model Yi-Ting Lee Da-Yi Wu Chih-Chun Yang Shou-De Lin MILM 24 0 0 27 Mar 2023
NxPlain: Web-based Tool for Discovery of Latent Concepts Fahim Dalvi Nadir Durrani Hassan Sajjad Tamim Jaban Musab Husaini Ummar Abbas 15 1 0 06 Mar 2023
Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context Alexandre Pasquiou Yair Lakretz B. Thirion Christophe Pallier 19 16 0 28 Feb 2023
Dissociating language and thought in large language models Kyle Mahowald Anna A. Ivanova I. Blank Nancy Kanwisher J. Tenenbaum Evelina Fedorenko ELM ReLM 31 209 0 16 Jan 2023
Language Cognition and Language Computation -- Human and Machine Language Understanding Shaonan Wang Nai Ding Nan Lin Jiajun Zhang Chengqing Zong 30 2 0 12 Jan 2023
Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models Peter Hase Joey Tianyi Zhou Been Kim Asma Ghandeharioun MILM 48 169 0 10 Jan 2023
Do LSTMs See Gender? Probing the Ability of LSTMs to Learn Abstract Syntactic Rules Priyanka Sukumaran Conor J. Houghton N. Kazanina 19 4 0 31 Oct 2022
Composition, Attention, or Both? Ryosuke Yoshida Yohei Oseki CoGe 29 0 0 24 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings Filip Klubicka John D. Kelleher 30 4 0 21 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 129 95 0 06 Oct 2022
A computational psycholinguistic evaluation of the syntactic abilities of Galician BERT models at the interface of dependency resolution and training time Iria de-Dios-Flores Marcos Garcia 25 2 0 06 Jun 2022
Discovering Latent Concepts Learned in BERT Fahim Dalvi A. Khan Firoj Alam Nadir Durrani Jia Xu Hassan Sajjad SSL 11 56 0 15 May 2022
Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic Task Karim Lasri Alessandro Lenci Thierry Poibeau 28 13 0 14 Apr 2022
A Latent-Variable Model for Intrinsic Probing Karolina Stañczak Lucas Torroba Hennigen Adina Williams Ryan Cotterell Isabelle Augenstein 29 4 0 20 Jan 2022
Sparse Interventions in Language Models with Differentiable Masking Nicola De Cao Leon Schmid Dieuwke Hupkes Ivan Titov 40 27 0 13 Dec 2021
Minimum Description Length Recurrent Neural Networks Nur Lan Michal Geyer Emmanuel Chemla Roni Katzir 21 13 0 31 Oct 2021
On the Pitfalls of Analyzing Individual Neurons in Language Models Omer Antverg Yonatan Belinkov MILM 30 50 0 14 Oct 2021
Causal Transformers Perform Below Chance on Recursive Nested Constructions, Unlike Humans Yair Lakretz T. Desbordes Dieuwke Hupkes S. Dehaene 233 11 0 14 Oct 2021
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations Arabella J. Sinclair Jaap Jumelet Willem H. Zuidema Raquel Fernández 61 38 0 30 Sep 2021
Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad Nadir Durrani Fahim Dalvi MILM AI4CE 35 82 0 30 Aug 2021
On the proper role of linguistically-oriented deep net analysis in linguistic theorizing Marco Baroni 21 51 0 16 Jun 2021
Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction Shauli Ravfogel Grusha Prasad Tal Linzen Yoav Goldberg 31 57 0 14 May 2021
Editing Factual Knowledge in Language Models Nicola De Cao Wilker Aziz Ivan Titov KELM 68 478 0 16 Apr 2021
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little Koustuv Sinha Robin Jia Dieuwke Hupkes J. Pineau Adina Williams Douwe Kiela 45 244 0 14 Apr 2021
Disentangling Syntax and Semantics in the Brain with Deep Networks Charlotte Caucheteux Alexandre Gramfort J. King 36 70 0 02 Mar 2021
Probing Classifiers: Promises, Shortcomings, and Advances Yonatan Belinkov 229 409 0 24 Feb 2021
Can RNNs learn Recursive Nested Subject-Verb Agreements? Yair Lakretz T. Desbordes J. King Benoît Crabbé Maxime Oquab S. Dehaene 160 19 0 06 Jan 2021
A Discrete Variational Recurrent Topic Model without the Reparametrization Trick Mehdi Rezaee Francis Ferraro BDL DRL 17 27 0 22 Oct 2020
Mechanisms for Handling Nested Dependencies in Neural-Network Language Models and Humans Yair Lakretz Dieuwke Hupkes A. Vergallito Marco Marelli Marco Baroni S. Dehaene 18 62 0 19 Jun 2020
On the Linguistic Representational Power of Neural Machine Translation Models Yonatan Belinkov Nadir Durrani Fahim Dalvi Hassan Sajjad James R. Glass MILM 33 68 0 01 Nov 2019
Discovering the Compositional Structure of Vector Representations with Role Learning Networks Paul Soulos R. Thomas McCoy Tal Linzen P. Smolensky CoGe 29 43 0 21 Oct 2019
Using Priming to Uncover the Organization of Syntactic Representations in Neural Language Models Grusha Prasad Marten van Schijndel Tal Linzen 40 51 0 23 Sep 2019
Analysing Neural Language Models: Contextual Decomposition Reveals Default Reasoning in Number and Gender Assignment Jaap Jumelet Willem H. Zuidema Dieuwke Hupkes LRM 33 37 0 19 Sep 2019
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks Songlin Yang Shawn Tan Alessandro Sordoni Aaron Courville 32 323 0 22 Oct 2018