Spontaneous Speech Variables for Evaluating LLMs Cognitive Plausibility

22 May 2025

Papers citing "Spontaneous Speech Variables for Evaluating LLMs Cognitive Plausibility"

26 / 26 papers shown

Title
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora Alex Warstadt Aaron Mueller Leshem Choshen E. Wilcox Chengxu Zhuang ... Rafael Mosquera Bhargavi Paranjape Adina Williams Tal Linzen Ryan Cotterell 190 120 0 10 Apr 2025
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora Michael Y. Hu Aaron Mueller Candace Ross Adina Williams Tal Linzen Chengxu Zhuang Ryan Cotterell Leshem Choshen Alex Warstadt Ethan Gotlieb Wilcox 164 14 0 06 Dec 2024
Goldfish: Monolingual Language Models for 350 Languages Tyler A. Chang Catherine Arnett Zhuowen Tu Benjamin Bergen LRM 114 10 0 19 Aug 2024
Is Child-Directed Speech Effective Training Data for Language Models? Steven Y. Feng Noah D. Goodman Michael C. Frank 101 11 0 07 Aug 2024
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs Ekaterina Taktasheva Maxim Bazhukov Kirill Koncha Alena Fenogenova Ekaterina Artemova Vladislav Mikhailov 85 13 0 27 Jun 2024
[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus Leshem Choshen Ryan Cotterell Michael Y. Hu Tal Linzen Aaron Mueller Candace Ross Alex Warstadt Ethan Gotlieb Wilcox Adina Williams Chengxu Zhuang 92 24 0 09 Apr 2024
Quantifying the redundancy between prosody and text Lukas Wolf Tiago Pimentel Evelina Fedorenko Ryan Cotterell Alex Warstadt Ethan Gotlieb Wilcox Tamar I. Regev 64 11 0 28 Nov 2023
CLIMB: Curriculum Learning for Infant-inspired Model Building Richard Diehl Martinez Zébulon Goriely Hope McGovern Christopher Davis Andrew Caines P. Buttery Lisa Beinborn 65 13 0 15 Nov 2023
Analyzing Cognitive Plausibility of Subword Tokenization Lisa Beinborn Yuval Pinter 59 20 0 20 Oct 2023
A Better Way to Do Masked Language Model Scoring Carina Kauf Anna A. Ivanova 76 27 0 17 May 2023
What does BERT learn about prosody? Sofoklis Kakouros Johannah O'Mahony MILM 44 6 0 25 Apr 2023
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus Alex Warstadt Leshem Choshen Aaron Mueller Adina Williams Ethan Gotlieb Wilcox Chengxu Zhuang 90 57 0 27 Jan 2023
Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps Alexandre Pasquiou Yair Lakretz John T. Hale Bertrand Thirion Christophe Pallier 87 37 0 07 Jul 2022
Using cognitive psychology to understand GPT-3 Marcel Binz Eric Schulz ELM LLMAG 338 477 0 21 Jun 2022
Lower Perplexity is Not Always Human-Like Tatsuki Kuribayashi Yohei Oseki Takumi Ito Ryo Yoshida Masayuki Asahara Kentaro Inui 50 76 0 02 Jun 2021
CLiMP: A Benchmark for Chinese Language Model Evaluation Beilei Xiang Changbing Yang Yu Li Alex Warstadt Katharina Kann ALM 41 42 0 26 Jan 2021
When Do You Need Billions of Words of Pretraining Data? Yian Zhang Alex Warstadt Haau-Sing Li Samuel R. Bowman 60 141 0 10 Nov 2020
On the importance of pre-training data volume for compact language models Vincent Micheli Martin d'Hoffschmidt Franccois Fleuret 49 42 0 08 Oct 2020
BLiMP: The Benchmark of Linguistic Minimal Pairs for English Alex Warstadt Alicia Parrish Haokun Liu Anhad Mohananey Wei Peng Sheng-Fu Wang Samuel R. Bowman 87 493 0 02 Dec 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations Aarne Talman Antti Suni H. Çelikkanat Sofoklis Kakouros Jörg Tiedemann M. Vainio 67 31 0 06 Aug 2019
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models Allyson Ettinger 95 607 0 31 Jul 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference R. Thomas McCoy Ellie Pavlick Tal Linzen 143 1,244 0 04 Feb 2019
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing Taku Kudo John Richardson 206 3,528 0 19 Aug 2018
Neural Network Acceptability Judgments Alex Warstadt Amanpreet Singh Samuel R. Bowman 242 1,413 0 31 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates Taku Kudo 226 1,173 0 29 Apr 2018
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks Yossi Adi Einat Kermany Yonatan Belinkov Ofer Lavi Yoav Goldberg 79 546 0 15 Aug 2016