Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.04347
Cited By
v1
v2 (latest)
World Models for Math Story Problems
7 June 2023
Andreas Opedal
Niklas Stoehr
Abulhair Saparov
Mrinmaya Sachan
ReLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"World Models for Math Story Problems"
45 / 45 papers shown
Title
Multilingual Performance Biases of Large Language Models in Education
Vansh Gupta
Sankalan Pal Chowdhury
Vilém Zouhar
Donya Rooein
Mrinmaya Sachan
AI4Ed
LRM
147
2
0
24 Apr 2025
Multi-Agent LLM Actor-Critic Framework for Social Robot Navigation
Weizheng Wang
Ike Obi
Byung-Cheol Min
LLMAG
171
2
0
12 Mar 2025
Beyond Pattern Recognition: Probing Mental Representations of LMs
Moritz Miller
Kumar Shridhar
ReLM
LRM
120
0
0
23 Feb 2025
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
Andreas Opedal
Haruki Shirakami
Bernhard Schölkopf
Abulhair Saparov
Mrinmaya Sachan
LRM
154
3
0
17 Feb 2025
Multi-tool Integration Application for Math Reasoning Using Large Language Model
Zhihua Duan
Jialin Wang
LLMAG
LRM
104
0
0
22 Aug 2024
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents
Giorgio Piatti
Zhijing Jin
Max Kleiman-Weiner
Bernhard Schölkopf
Mrinmaya Sachan
Rada Mihalcea
LLMAG
111
25
0
25 Apr 2024
Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Andreas Opedal
Alessandro Stolfo
Haruki Shirakami
Ying Jiao
Ryan Cotterell
Bernhard Schölkopf
Abulhair Saparov
Mrinmaya Sachan
LRM
130
16
0
31 Jan 2024
Distilling Reasoning Capabilities into Smaller Language Models
Kumar Shridhar
Alessandro Stolfo
Mrinmaya Sachan
LRM
ReLM
129
176
0
01 Dec 2022
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar
Jakub Macina
Mennatallah El-Assady
Tanmay Sinha
Manu Kapur
Mrinmaya Sachan
AIMat
104
49
0
23 Nov 2022
Cross-domain Generalization for AMR Parsing
Xuefeng Bai
Sen Yang
Leyang Cui
Linfeng Song
Yue Zhang
107
2
0
22 Oct 2022
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
Alessandro Stolfo
Zhijing Jin
Kumar Shridhar
Bernhard Schölkopf
Mrinmaya Sachan
ELM
OOD
LRM
147
66
0
21 Oct 2022
Language Models of Code are Few-Shot Commonsense Learners
Aman Madaan
Shuyan Zhou
Uri Alon
Yiming Yang
Graham Neubig
ReLM
LRM
148
223
0
13 Oct 2022
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought
Abulhair Saparov
He He
ELM
LRM
ReLM
307
315
0
03 Oct 2022
Compositional Semantic Parsing with Large Language Models
Andrew Drozdov
Nathanael Scharli
Ekin Akyuurek
Nathan Scales
Xinying Song
Xinyun Chen
Olivier Bousquet
Denny Zhou
ReLM
LRM
271
94
0
29 Sep 2022
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLM
ELM
LRM
266
866
0
29 Jun 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALM
LRM
AI4CE
109
1,139
0
21 May 2022
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning
Zhicheng YANG
Jinghui Qin
Jiaqi Chen
Liang Lin
Xiaodan Liang
ReLM
LRM
86
33
0
17 May 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
650
6,325
0
05 Apr 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
1.3K
13,290
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
1.1K
9,827
0
28 Jan 2022
A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level
Iddo Drori
Sarah J. Zhang
Reece Shuttleworth
Leonard Tang
Albert Lu
...
J. Lynch
A. Shporer
Nakul Verma
Eugene Wu
G. Strang
AIMat
ReLM
139
153
0
31 Dec 2021
Improving Compositional Generalization with Latent Structure and Data Augmentation
Linlu Qiu
Peter Shaw
Panupong Pasupat
Pawel Krzysztof Nowak
Tal Linzen
Fei Sha
Kristina Toutanova
CoGe
104
57
0
14 Dec 2021
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
442
4,609
0
27 Oct 2021
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
302
5,702
0
07 Jul 2021
A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
Shen-Yun Miao
Chao-Chun Liang
Keh-Yih Su
79
344
0
30 Jun 2021
Question Generation for Adaptive Education
Megha Srivastava
Noah D. Goodman
AI4Ed
75
41
0
08 Jun 2021
Towards General Natural Language Understanding with Probabilistic Worldbuilding
Abulhair Saparov
Tom Michael Mitchell
101
6
0
06 May 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
402
206
0
18 Apr 2021
NT5?! Training T5 to Perform Numerical Reasoning
Peng Yang
Ying Chen
Yuechan Chen
Daniel Cer
AIMat
LRM
75
15
0
15 Apr 2021
Are NLP Models really able to Solve Simple Math Word Problems?
Arkil Patel
S. Bhattamishra
Navin Goyal
ReLM
LRM
152
852
0
12 Mar 2021
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language
Oyvind Tafjord
Bhavana Dalvi
Peter Clark
123
279
0
24 Dec 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.2K
42,753
0
28 May 2020
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
270
10,934
0
29 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
990
20,462
0
23 Oct 2019
Core Semantic First: A Top-down Approach for AMR Parsing
Deng Cai
W. Lam
GNN
76
54
0
10 Sep 2019
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini
Saadia Gabriel
Shanchuan Lin
Rik Koncel-Kedziorski
Yejin Choi
Hannaneh Hajishirzi
AIMat
ReLM
AI4CE
160
583
0
30 May 2019
AMR Parsing as Sequence-to-Graph Transduction
Sheng Zhang
Xutai Ma
Kevin Duh
Benjamin Van Durme
87
148
0
21 May 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
678
5,897
0
21 Apr 2019
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Nikita Kitaev
Steven Cao
Dan Klein
LRM
93
255
0
31 Dec 2018
Constituency Parsing with a Self-Attentive Encoder
Nikita Kitaev
Dan Klein
111
544
0
02 May 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
253
3,001
0
23 Apr 2018
Mapping to Declarative Knowledge for Word Problem Solving
Subhro Roy
Dan Roth
ReLM
AIMat
59
104
0
26 Dec 2017
Unit Dependency Graph and its Application to Arithmetic Word Problem Solving
Subhro Roy
Dan Roth
AIMat
80
98
0
03 Dec 2016
A Theme-Rewriting Approach for Generating Algebra Word Problems
Rik Koncel-Kedziorski
Ioannis Konstas
Luke Zettlemoyer
Hannaneh Hajishirzi
AIMat
184
32
0
19 Oct 2016
Solving General Arithmetic Word Problems
Subhro Roy
Dan Roth
AIMat
110
484
0
04 Aug 2016
1