Dissociating language and thought in large language models

16 January 2023

Nancy Kanwisher

Papers citing "Dissociating language and thought in large language models"

50 / 128 papers shown

Title
Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review Luoma Ke Song Tong Peng Cheng Kaiping Peng OffRL LM&MA 117 21 0 03 Jan 2024
Divergences between Language Models and Human Brains Yuchen Zhou Emmy Liu Graham Neubig Michael J. Tarr Leila Wehbe 64 3 0 15 Nov 2023
Large Language Models Michael R Douglas LLMAG LM&MA 127 616 0 11 Jul 2023
Personality Traits in Large Language Models Gregory Serapio-García Mustafa Safdari Clément Crepy Luning Sun Stephen Fitz P. Romero Marwa Abdulhai Aleksandra Faust Maja J. Matarić LM&MA LLMAG 87 123 0 01 Jul 2023
Rethinking Model Evaluation as Narrowing the Socio-Technical Gap Q. V. Liao Ziang Xiao ALM ELM 92 32 0 01 Jun 2023
Uncontrolled Lexical Exposure Leads to Overestimation of Compositional Generalization in Pretrained Models Najoung Kim Tal Linzen P. Smolensky 77 32 0 21 Dec 2022
A fine-grained comparison of pragmatic language understanding in humans and language models Jennifer Hu Sammy Floyd Olessia Jouravlev Evelina Fedorenko E. Gibson 56 59 0 13 Dec 2022
Talking About Large Language Models Murray Shanahan AI4CE 86 266 0 07 Dec 2022
Language Models as Agent Models Jacob Andreas LLMAG 54 138 0 03 Dec 2022
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models Aaron Mueller Yudi Xia Tal Linzen MILM 72 10 0 25 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs Maarten Sap Ronan Le Bras Daniel Fried Yejin Choi 67 223 0 24 Oct 2022
Syntactic Surprisal From Neural Models Predicts, But Underestimates, Human Processing Difficulty From Syntactic Ambiguities Suhas Arehalli Brian Dillon Tal Linzen 72 40 0 21 Oct 2022
Measuring and Narrowing the Compositionality Gap in Language Models Ofir Press Muru Zhang Sewon Min Ludwig Schmidt Noah A. Smith M. Lewis ReLM KELM LRM 143 617 0 07 Oct 2022
Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks Masahiro Kaneko Danushka Bollegala Naoaki Okazaki 43 44 0 06 Oct 2022
Entailment Semantics Can Be Extracted from an Ideal Language Model William Merrill Alex Warstadt Tal Linzen 124 14 0 26 Sep 2022
Subject Verb Agreement Error Patterns in Meaningless Sentences: Humans vs. BERT Karim Lasri Olga Seminck Alessandro Lenci Thierry Poibeau 78 4 0 21 Sep 2022
What Artificial Neural Networks Can Tell Us About Human Language Acquisition Alex Warstadt Samuel R. Bowman 59 118 0 17 Aug 2022
Meaning without reference in large language models S. Piantadosi Felix Hill 52 75 0 05 Aug 2022
Unit Testing for Concepts in Neural Networks Charles Lovering Ellie Pavlick 37 28 0 28 Jul 2022
Solving Quantitative Reasoning Problems with Language Models Aitor Lewkowycz Anders Andreassen David Dohan Ethan Dyer Henryk Michalewski ... Theo Gutman-Solo Yuhuai Wu Behnam Neyshabur Guy Gur-Ari Vedant Misra ReLM ELM LRM 138 827 0 29 Jun 2022
Emergent Abilities of Large Language Models Jason W. Wei Yi Tay Rishi Bommasani Colin Raffel Barret Zoph ... Tatsunori Hashimoto Oriol Vinyals Percy Liang J. Dean W. Fedus ELM ReLM LRM 261 2,462 0 15 Jun 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Aarohi Srivastava Abhinav Rastogi Abhishek Rao Abu Awal Md Shoeb Abubakar Abid ... Zhuoye Zhao Zijian Wang Zijie J. Wang Zirui Wang Ziyi Wu ELM 138 1,746 0 09 Jun 2022
On the Paradox of Learning to Reason from Data Honghua Zhang Liunian Harold Li Tao Meng Kai-Wei Chang Guy Van den Broeck NAI ReLM OOD LRM 175 106 0 23 May 2022
Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks Katherine M. Collins Catherine Wong Jiahai Feng Megan Wei J. Tenenbaum LRM 65 61 0 11 May 2022
When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it Sebastian Schuster Tal Linzen 40 25 0 06 May 2022
When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes Mycal Tucker Tiwalayo Eisape Peng Qian R. Levy J. Shah MILM 38 12 0 20 Apr 2022
Probing for the Usage of Grammatical Number Karim Lasri Tiago Pimentel Alessandro Lenci Thierry Poibeau Ryan Cotterell 50 58 0 19 Apr 2022
Can language models learn from explanations in context? Andrew Kyle Lampinen Ishita Dasgupta Stephanie C. Y. Chan Kory Matthewson Michael Henry Tessler Antonia Creswell James L. McClelland Jane X. Wang Felix Hill LRM ReLM 140 297 0 05 Apr 2022
PaLM: Scaling Language Modeling with Pathways Aakanksha Chowdhery Sharan Narang Jacob Devlin Maarten Bosma Gaurav Mishra ... Kathy Meier-Hellstern Douglas Eck J. Dean Slav Petrov Noah Fiedel PILM LRM 416 6,202 0 05 Apr 2022
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals Xiao Liu Da Yin Yansong Feng Dongyan Zhao LRM 42 45 0 15 Mar 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 754 12,835 0 04 Mar 2022
Mixture-of-Experts with Expert Choice Routing Yan-Quan Zhou Tao Lei Han-Chu Liu Nan Du Yanping Huang Vincent Zhao Andrew M. Dai Zhifeng Chen Quoc V. Le James Laudon MoE 257 350 0 18 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jason W. Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter F. Xia Ed H. Chi Quoc Le Denny Zhou LM&Ro LRM AI4CE ReLM 738 9,267 0 28 Jan 2022
Relational Memory Augmented Language Models Qi Liu Dani Yogatama Phil Blunsom KELM RALM 94 32 0 24 Jan 2022
LongT5: Efficient Text-To-Text Transformer for Long Sequences Mandy Guo Joshua Ainslie David C. Uthus Santiago Ontanon Jianmo Ni Yun-hsuan Sung Yinfei Yang VLM 55 312 0 15 Dec 2021
Scaling Language Models: Methods, Analysis & Insights from Training Gopher Jack W. Rae Sebastian Borgeaud Trevor Cai Katie Millican Jordan Hoffmann ... Jeff Stanway L. Bennett Demis Hassabis Koray Kavukcuoglu G. Irving 99 1,309 0 08 Dec 2021
Improving language models by retrieving from trillions of tokens Sebastian Borgeaud A. Mensch Jordan Hoffmann Trevor Cai Eliza Rutherford ... Simon Osindero Karen Simonyan Jack W. Rae Erich Elsen Laurent Sifre KELM RALM 206 1,082 0 08 Dec 2021
Causal Distillation for Language Models Zhengxuan Wu Atticus Geiger J. Rozner Elisa Kreiss Hanson Lu Thomas Icard Christopher Potts Noah D. Goodman 96 25 0 05 Dec 2021
Show Your Work: Scratchpads for Intermediate Computation with Language Models Maxwell Nye Anders Andreassen Guy Gur-Ari Henryk Michalewski Jacob Austin ... Aitor Lewkowycz Maarten Bosma D. Luan Charles Sutton Augustus Odena ReLM LRM 159 737 0 30 Nov 2021
How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN R. Thomas McCoy P. Smolensky Tal Linzen Jianfeng Gao Asli Celikyilmaz SyDa 52 121 0 18 Nov 2021
The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP Systems Fail Sam Bowman OffRL 56 45 0 15 Oct 2021
Causal Transformers Perform Below Chance on Recursive Nested Constructions, Unlike Humans Yair Lakretz T. Desbordes Dieuwke Hupkes S. Dehaene 331 11 0 14 Oct 2021
Systematic Inequalities in Language Technology Performance across the World's Languages Damián E. Blasi Antonios Anastasopoulos Graham Neubig 155 137 0 13 Oct 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference Sneha Kudugunta Yanping Huang Ankur Bapna M. Krikun Dmitry Lepikhin Minh-Thang Luong Orhan Firat MoE 215 108 0 24 Sep 2021
Frequency Effects on Syntactic Rule Learning in Transformers Jason W. Wei Dan Garrette Tal Linzen Ellie Pavlick 109 67 0 14 Sep 2021
Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color Mostafa Abdou Artur Kulmizev Daniel Hershcovich Stella Frank Ellie Pavlick Anders Søgaard 67 122 0 13 Sep 2021
A Bayesian Framework for Information-Theoretic Probing Tiago Pimentel Ryan Cotterell 52 24 0 08 Sep 2021
Teaching Autoregressive Language Models Complex Tasks By Demonstration Gabriel Recchia 55 22 0 05 Sep 2021
A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers Shen-Yun Miao Chao-Chun Liang Keh-Yih Su 62 338 0 30 Jun 2021
On the proper role of linguistically-oriented deep net analysis in linguistic theorizing Marco Baroni 96 52 0 16 Jun 2021