Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.09479
Cited By
v1
v2 (latest)
Inverse Scaling: When Bigger Isn't Better
15 June 2023
I. R. McKenzie
Alexander Lyzhov
Michael Pieler
Alicia Parrish
Aaron Mueller
Ameya Prabhu
Euan McLean
Aaron Kirtland
Alexis Ross
Alisa Liu
Andrew Gritsevskiy
Daniel Wurgaft
Derik Kauffman
Gabriel Recchia
Jiacheng Liu
Joe Cavanagh
Max Weiss
Sicong Huang
The Floating Droid
Tom Tseng
Tomasz Korbak
Xudong Shen
Yuhui Zhang
Zhengping Zhou
Najoung Kim
Sam Bowman
Ethan Perez
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Inverse Scaling: When Bigger Isn't Better"
39 / 39 papers shown
Title
How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?
Sohee Yang
Sang-Woo Lee
Nora Kassner
Daniela Gottesman
Sebastian Riedel
Mor Geva
LRM
119
0
0
12 Jun 2025
Causal Estimation of Tokenisation Bias
Pietro Lesci
Clara Meister
Thomas Hofmann
Andreas Vlachos
Tiago Pimentel
74
1
0
03 Jun 2025
SkillVerse : Assessing and Enhancing LLMs with Tree Evaluation
Yufei Tian
Jiao Sun
Nanyun Peng
Zizhao Zhang
35
0
0
31 May 2025
Probability-Consistent Preference Optimization for Enhanced LLM Reasoning
Yunqiao Yang
Houxing Ren
Zimu Lu
Ke Wang
Weikang Shi
A-Long Zhou
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
57
0
0
29 May 2025
Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling
Jiayi Zeng
Yizhe Feng
Mengliang He
Wenhui Lei
Wei Zhang
Zeming Liu
Xiaoming Shi
Aimin Zhou
LRM
31
0
0
29 May 2025
NegVQA: Can Vision Language Models Understand Negation?
Yuhui Zhang
Yuchang Su
Yiming Liu
Serena Yeung-Levy
MLLM
CoGe
52
0
0
28 May 2025
Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence
Mehdi Bennis
Salem Lahlou
71
1
0
27 May 2025
The Strawberry Problem: Emergence of Character-level Understanding in Tokenized Language Models
Adrian Cosma
Stefan Ruseti
Emilian Radoi
Mihai Dascalu
LRM
79
0
0
20 May 2025
Toward the Axiomatization of Intelligence: Structure, Time, and Existence
Kei Itoh
26
0
0
20 Apr 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
202
121
0
10 Apr 2025
Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation
Bowen Baker
Joost Huizinga
Leo Gao
Zehao Dou
M. Guan
Aleksander Mądry
Wojciech Zaremba
J. Pachocki
David Farhi
LRM
188
38
0
14 Mar 2025
Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
Emmy Liu
Amanda Bertsch
Lintang Sutawika
Lindia Tjuatja
Patrick Fernandes
...
Siyang Song
Carolin (Haas) Lawrence
Aditi Raghunathan
Kiril Gashteovski
Graham Neubig
277
3
0
05 Mar 2025
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
Richard Ren
Arunim Agarwal
Mantas Mazeika
Cristina Menghini
Robert Vacareanu
...
Matias Geralnik
Adam Khoja
Dean Lee
Summer Yue
Dan Hendrycks
HILM
ALM
173
3
0
05 Mar 2025
BIG-Bench Extra Hard
Mehran Kazemi
Bahare Fatemi
Hritik Bansal
John Palowitch
Chrysovalantis Anastasiou
...
Kate Olszewska
Yi Tay
Vinh Q. Tran
Quoc V. Le
Orhan Firat
ELM
LRM
302
13
0
26 Feb 2025
Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models
Elena Stringli
Maria Lymperaiou
Giorgos Filandrianos
Athanasios Voulodimos
Giorgos Stamou
LRM
57
0
0
18 Feb 2025
Foundations of GenIR
Qingyao Ai
Jingtao Zhan
Yang Liu
128
0
0
06 Jan 2025
Do Large Language Models Align with Core Mental Health Counseling Competencies?
Viet Cuong Nguyen
Mohammad Taher
Dongwan Hong
Vinicius Konkolics Possobom
Vibha Thirunellayi Gopalakrishnan
...
Zihang Li
H. J. Soled
Michael L. Birnbaum
Srijan Kumar
M. D. Choudhury
ELM
LM&MA
AI4MH
102
4
0
29 Oct 2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
408
3
0
29 Oct 2024
Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Zhengyu Hu
Yichuan Li
Zhengyu Chen
Jiadong Wang
Han Liu
Kyumin Lee
Kaize Ding
GNN
530
1
0
09 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu
Pei-Yu Lo
ReLM
LRM
131
2
0
02 Oct 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
134
16
0
27 Jul 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELM
ALM
162
8
0
20 Jun 2024
[WIP] Jailbreak Paradox: The Achilles' Heel of LLMs
Abhinav Rao
Monojit Choudhury
Somak Aditya
77
0
0
18 Jun 2024
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Min Cai
Yuchen Zhang
Shichang Zhang
Fan Yin
Difan Zou
Yisong Yue
Ziniu Hu
87
1
0
04 Jun 2024
Quantifying the Capabilities of LLMs across Scale and Precision
Sher Badshah
Hassan Sajjad
76
14
0
06 May 2024
Goal-guided Generative Prompt Injection Attack on Large Language Models
Chong Zhang
Mingyu Jin
Qinkai Yu
Chengzhi Liu
Haochen Xue
Xiaobo Jin
AAML
SILM
98
16
0
06 Apr 2024
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
James Chua
Edward Rees
Hunar Batra
Samuel R. Bowman
Julian Michael
Ethan Perez
Miles Turpin
LRM
127
13
0
08 Mar 2024
Into the Unknown: Self-Learning Large Language Models
Teddy Ferdinan
Jan Kocoñ
P. Kazienko
81
3
0
14 Feb 2024
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities
Yujun Mao
Yoon Kim
Yilun Zhou
LRM
ReLM
77
23
0
13 Jan 2024
In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax
Aaron Mueller
Albert Webson
Jackson Petty
Tal Linzen
ReLM
LRM
103
16
0
13 Nov 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
84
40
0
30 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
82
3
0
17 Oct 2023
When can transformers reason with abstract symbols?
Enric Boix-Adserà
Omid Saremi
Emmanuel Abbe
Samy Bengio
Etai Littwin
Josh Susskind
LRM
NAI
66
17
0
15 Oct 2023
The Consensus Game: Language Model Generation via Equilibrium Search
Athul Paul Jacob
Songlin Yang
Gabriele Farina
Jacob Andreas
93
23
0
13 Oct 2023
CLEVA: Chinese Language Models EVAluation Platform
Yanyang Li
Jianqiao Zhao
Duo Zheng
Zi-Yuan Hu
Zhi Chen
...
Yongfeng Huang
Shijia Huang
Dahua Lin
Michael R. Lyu
Liwei Wang
ALM
ELM
103
11
0
09 Aug 2023
Measuring Faithfulness in Chain-of-Thought Reasoning
Tamera Lanham
Anna Chen
Ansh Radhakrishnan
Benoit Steiner
Carson E. Denison
...
Zac Hatfield-Dodds
Jared Kaplan
J. Brauner
Sam Bowman
Ethan Perez
ReLM
LRM
80
193
0
17 Jul 2023
Frontier AI Regulation: Managing Emerging Risks to Public Safety
Markus Anderljung
Joslyn Barnhart
Anton Korinek
Jade Leung
Cullen O'Keefe
...
Jonas Schuett
Yonadav Shavit
Divya Siddarth
Robert F. Trager
Kevin J. Wolf
SILM
152
125
0
06 Jul 2023
Emergent inabilities? Inverse scaling over the course of pretraining
J. Michaelov
Benjamin Bergen
LRM
ReLM
61
3
0
24 May 2023
Inverse scaling can become U-shaped
Jason W. Wei
Najoung Kim
Yi Tay
Quoc V. Le
LRM
110
64
0
03 Nov 2022
1