The Factorization Curse: Which Tokens You Predict Underlie the Reversal
Curse and More

The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

7 June 2024

Diane Bouchacourt

Mike Rabbat

Papers citing "The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More"

10 / 10 papers shown

Title
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Vaishnavh Nagarajan Chen Henry Wu Charles Ding Aditi Raghunathan 36 0 0 21 Apr 2025
Looking beyond the next token Abitha Thankaraj Yiding Jiang J. Zico Kolter Yonatan Bisk LRM 57 1 0 15 Apr 2025
Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure Boshi Wang Huan Sun 36 2 0 02 Apr 2025
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More Arvid Frydenlund LRM 52 0 0 13 Mar 2025
Large Language Diffusion Models Shen Nie Fengqi Zhu Zebin You Xiaolu Zhang Jingyang Ou Jun Hu Jun Zhou Yankai Lin Zhicheng Dou Chongxuan Li 112 17 0 14 Feb 2025
Scaling up Masked Diffusion Models on Text Shen Nie Fengqi Zhu Chao Du Tianyu Pang Qian Liu Guangtao Zeng Min-Bin Lin Chongxuan Li AI4CE 50 14 0 24 Oct 2024
Unmasking Trees for Tabular Data Calvin McCarter 37 3 0 08 Jul 2024
Changing Answer Order Can Decrease MMLU Accuracy Vipul Gupta David Pantoja Candace Ross Adina Williams Megan Ung 64 22 0 27 Jun 2024
Punctuation Restoration Improves Structure Understanding Without Supervision Junghyun Min Minho Lee Woochul Lee Yeonsoo Lee 59 1 0 13 Feb 2024
ANLIzing the Adversarial Natural Language Inference Dataset Adina Williams Tristan Thrush Douwe Kiela AAML 174 46 0 24 Oct 2020