Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.16277
Cited By
Spontaneous Speech Variables for Evaluating LLMs Cognitive Plausibility
22 May 2025
Sheng-Fu Wang
Laurent Prevot
Jou-an Chi
Ri-Sheng Huang
Shu-Kai Hsieh
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spontaneous Speech Variables for Evaluating LLMs Cognitive Plausibility"
26 / 26 papers shown
Title
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
190
120
0
10 Apr 2025
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Michael Y. Hu
Aaron Mueller
Candace Ross
Adina Williams
Tal Linzen
Chengxu Zhuang
Ryan Cotterell
Leshem Choshen
Alex Warstadt
Ethan Gotlieb Wilcox
164
14
0
06 Dec 2024
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
114
10
0
19 Aug 2024
Is Child-Directed Speech Effective Training Data for Language Models?
Steven Y. Feng
Noah D. Goodman
Michael C. Frank
101
11
0
07 Aug 2024
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
Ekaterina Taktasheva
Maxim Bazhukov
Kirill Koncha
Alena Fenogenova
Ekaterina Artemova
Vladislav Mikhailov
85
13
0
27 Jun 2024
[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Leshem Choshen
Ryan Cotterell
Michael Y. Hu
Tal Linzen
Aaron Mueller
Candace Ross
Alex Warstadt
Ethan Gotlieb Wilcox
Adina Williams
Chengxu Zhuang
92
24
0
09 Apr 2024
Quantifying the redundancy between prosody and text
Lukas Wolf
Tiago Pimentel
Evelina Fedorenko
Ryan Cotterell
Alex Warstadt
Ethan Gotlieb Wilcox
Tamar I. Regev
64
11
0
28 Nov 2023
CLIMB: Curriculum Learning for Infant-inspired Model Building
Richard Diehl Martinez
Zébulon Goriely
Hope McGovern
Christopher Davis
Andrew Caines
P. Buttery
Lisa Beinborn
65
13
0
15 Nov 2023
Analyzing Cognitive Plausibility of Subword Tokenization
Lisa Beinborn
Yuval Pinter
59
20
0
20 Oct 2023
A Better Way to Do Masked Language Model Scoring
Carina Kauf
Anna A. Ivanova
76
27
0
17 May 2023
What does BERT learn about prosody?
Sofoklis Kakouros
Johannah O'Mahony
MILM
44
6
0
25 Apr 2023
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Alex Warstadt
Leshem Choshen
Aaron Mueller
Adina Williams
Ethan Gotlieb Wilcox
Chengxu Zhuang
90
57
0
27 Jan 2023
Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps
Alexandre Pasquiou
Yair Lakretz
John T. Hale
Bertrand Thirion
Christophe Pallier
87
37
0
07 Jul 2022
Using cognitive psychology to understand GPT-3
Marcel Binz
Eric Schulz
ELM
LLMAG
338
477
0
21 Jun 2022
Lower Perplexity is Not Always Human-Like
Tatsuki Kuribayashi
Yohei Oseki
Takumi Ito
Ryo Yoshida
Masayuki Asahara
Kentaro Inui
50
76
0
02 Jun 2021
CLiMP: A Benchmark for Chinese Language Model Evaluation
Beilei Xiang
Changbing Yang
Yu Li
Alex Warstadt
Katharina Kann
ALM
41
42
0
26 Jan 2021
When Do You Need Billions of Words of Pretraining Data?
Yian Zhang
Alex Warstadt
Haau-Sing Li
Samuel R. Bowman
60
141
0
10 Nov 2020
On the importance of pre-training data volume for compact language models
Vincent Micheli
Martin d'Hoffschmidt
Franccois Fleuret
49
42
0
08 Oct 2020
BLiMP: The Benchmark of Linguistic Minimal Pairs for English
Alex Warstadt
Alicia Parrish
Haokun Liu
Anhad Mohananey
Wei Peng
Sheng-Fu Wang
Samuel R. Bowman
87
493
0
02 Dec 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
Aarne Talman
Antti Suni
H. Çelikkanat
Sofoklis Kakouros
Jörg Tiedemann
M. Vainio
67
31
0
06 Aug 2019
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models
Allyson Ettinger
95
607
0
31 Jul 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
R. Thomas McCoy
Ellie Pavlick
Tal Linzen
143
1,244
0
04 Feb 2019
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
206
3,528
0
19 Aug 2018
Neural Network Acceptability Judgments
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
242
1,413
0
31 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
226
1,173
0
29 Apr 2018
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks
Yossi Adi
Einat Kermany
Yonatan Belinkov
Ofer Lavi
Yoav Goldberg
79
546
0
15 Aug 2016
1