Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.14168
Cited By
Training Verifiers to Solve Math Word Problems
27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Verifiers to Solve Math Word Problems"
50 / 3,031 papers shown
Title
AutoHint: Automatic Prompt Optimization with Hint Generation
Hong Sun
Xue Li
Yi Xu
Youkow Homma
Qinhao Cao
Min-man Wu
Jian Jiao
Denis Xavier Charles
34
23
0
13 Jul 2023
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Mian
OffRL
70
529
0
12 Jul 2023
Teaching Arithmetic to Small Transformers
Nayoung Lee
Kartik K. Sreenivasan
Jason D. Lee
Kangwook Lee
Dimitris Papailiopoulos
LRM
32
81
0
07 Jul 2023
Chain of Thought Prompting Elicits Knowledge Augmentation
Di Wu
Jing Zhang
Xinmei Huang
LRM
28
31
0
04 Jul 2023
MWPRanker: An Expression Similarity Based Math Word Problem Retriever
Mayank Goel
Venktesh V
Vikram Goyal
21
1
0
03 Jul 2023
Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models
Yiming Wang
ZhuoSheng Zhang
Pei Zhang
Baosong Yang
Rui Wang
ReLM
LRM
26
6
0
30 Jun 2023
Stay on topic with Classifier-Free Guidance
Guillaume Sanchez
Honglu Fan
Alexander Spangher
Elad Levi
Pawan Sasanka Ammanamanchi
Stella Biderman
3DV
30
47
0
30 Jun 2023
Look, Remember and Reason: Grounded reasoning in videos with language models
Apratim Bhattacharyya
Sunny Panchal
Mingu Lee
Reza Pourreza
Pulkit Madan
Roland Memisevic
LRM
35
7
0
30 Jun 2023
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?
Tianwen Wei
Jian Luan
Wei Liu
Shuang Dong
Bin Wang
ELM
25
30
0
29 Jun 2023
Length Generalization in Arithmetic Transformers
Samy Jelassi
Stéphane dÁscoli
Carles Domingo-Enrich
Yuhuai Wu
Yuan-Fang Li
Franccois Charton
30
38
0
27 Jun 2023
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
Syed Rifat Raiyan
Md. Nafis Faiyaz
S. Kabir
Mohsinul Kabir
H. Mahmud
Md. Kamrul Hasan
38
11
0
24 Jun 2023
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
Rishabh Agarwal
Nino Vieillard
Yongchao Zhou
Piotr Stańczyk
Sabela Ramos
Matthieu Geist
Olivier Bachem
43
4
0
23 Jun 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
Yuchen Zhuang
Yue Yu
Kuan-Chieh Jackson Wang
Haotian Sun
Chao Zhang
ELM
LLMAG
30
216
0
23 Jun 2023
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani
Ali Beyram
H. Shrivastava
23
1
0
22 Jun 2023
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
Miao Xiong
Zhiyuan Hu
Xinyang Lu
Yifei Li
Jie Fu
Junxian He
Bryan Hooi
33
375
0
22 Jun 2023
JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving
Wayne Xin Zhao
Kun Zhou
Beichen Zhang
Zheng Gong
Zhipeng Chen
...
Ji-Rong Wen
Jing Sha
Shijin Wang
Cong Liu
Guoping Hu
MoE
LRM
52
5
0
19 Jun 2023
Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Personalization
Swarnadeep Saha
Peter Hase
Mohit Bansal
LRM
30
10
0
15 Jun 2023
CMMLU: Measuring massive multitask language understanding in Chinese
Haonan Li
Yixuan Zhang
Fajri Koto
Yifei Yang
Hai Zhao
Yeyun Gong
Nan Duan
Tim Baldwin
ALM
ELM
47
239
0
15 Jun 2023
Learning by Analogy: Diverse Questions Generation in Math Word Problem
Zihao Zhou
Maizhen Ning
Qiufeng Wang
Jie Yao
Wei Wang
Xiaowei Huang
Kaizhu Huang
AIMat
25
20
0
15 Jun 2023
Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis
James R. Kirk
R. Wray
Peter Lindes
John E. Laird
LLMAG
36
3
0
11 Jun 2023
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability
Jiacheng Ye
Xijia Tao
Lingpeng Kong
LRM
37
24
0
11 Jun 2023
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Jingbo Wang
Qiushi Sun
Xiang Li
Ming Gao
ReLM
LRM
26
65
0
10 Jun 2023
Human-in-the-Loop through Chain-of-Thought
Zefan Cai
Baobao Chang
Wenjuan Han
LRM
22
23
0
10 Jun 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
51
3,854
0
09 Jun 2023
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance
Qianqian Xie
Weiguang Han
Xiao Zhang
Yanzhao Lai
Min Peng
Alejandro Lopez-Lira
Jimin Huang
ALM
20
136
0
08 Jun 2023
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Yidong Wang
Zhuohao Yu
Zhengran Zeng
Linyi Yang
Cunxiang Wang
...
Jindong Wang
Xingxu Xie
Wei Ye
Shi-Bo Zhang
Yue Zhang
ALM
ELM
51
227
0
08 Jun 2023
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
Yizhong Wang
Hamish Ivison
Pradeep Dasigi
Jack Hessel
Tushar Khot
...
David Wadden
Kelsey MacMillan
Noah A. Smith
Iz Beltagy
Hannaneh Hajishirzi
ALM
ELM
13
369
0
07 Jun 2023
World Models for Math Story Problems
Andreas Opedal
Niklas Stoehr
Abulhair Saparov
Mrinmaya Sachan
ReLM
36
12
0
07 Jun 2023
Certified Deductive Reasoning with Language Models
Gabriel Poesia
Kanishk Gandhi
E. Zelikman
Noah D. Goodman
ELM
ReLM
LRM
37
0
0
06 Jun 2023
ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory
Chenxu Hu
Jie Fu
Chenzhuang Du
Simian Luo
J. Zhao
Hang Zhao
KELM
LLMAG
35
105
0
06 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLM
LRM
32
125
0
06 Jun 2023
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models
Fobo Shi
Peijun Qing
Ke Wang
Nan Wang
Youbo Lei
H. Lu
Xiaodong Lin
Duantengchuan Li
VLM
ReLM
LLMAG
LRM
31
11
0
06 Jun 2023
Natural Language Commanding via Program Synthesis
Apurva Gandhi
Thong Q. Nguyen
Huitian Jiao
R. Steen
Ameya Bhatawdekar
29
7
0
06 Jun 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Junling Liu
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
...
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu
Michael Lingzhi Li
LM&MA
ELM
17
66
0
05 Jun 2023
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning
Beichen Zhang
Kun Zhou
Xilin Wei
Wayne Xin Zhao
Jing Sha
Shijin Wang
Ji-Rong Wen
LRM
38
36
0
04 Jun 2023
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos
S. Symeonidis
Pranav Kasela
G. Pasi
LM&MA
29
12
0
03 Jun 2023
Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Tianduo Wang
Wei Lu
ReLM
LRM
26
14
0
02 Jun 2023
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents
Yiran Wu
Feiran Jia
Shaokun Zhang
Han-Tai Li
Erkang Zhu
Yue Wang
Y. Lee
Richard Peng
Qingyun Wu
Chi Wang
LLMAG
29
49
0
02 Jun 2023
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Ji Lin
Jiaming Tang
Haotian Tang
Shang Yang
Wei-Ming Chen
Wei-Chen Wang
Guangxuan Xiao
Xingyu Dang
Chuang Gan
Song Han
EDL
MQ
36
474
0
01 Jun 2023
Interpretable Math Word Problem Solution Generation Via Step-by-step Planning
Mengxue Zhang
Zichao Wang
Zhichao Yang
Weiqi Feng
Andrew S. Lan
LRM
22
16
0
01 Jun 2023
Chain-Of-Thought Prompting Under Streaming Batch: A Case Study
Yuxin Tang
LRM
25
2
0
01 Jun 2023
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
30
887
0
31 May 2023
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate
Tian Liang
Zhiwei He
Wenxiang Jiao
Xing Wang
Rui Wang
Yujiu Yang
Zhaopeng Tu
Shuming Shi
LLMAG
LRM
37
406
0
30 May 2023
Graph Reasoning for Question Answering with Triplet Retrieval
Shiyang Li
Yifan Gao
Hao Jiang
Qingyu Yin
Zheng Li
Xifeng Yan
Chao Zhang
Bing Yin
RALM
ReLM
18
29
0
30 May 2023
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning
Zhanming Jie
Wei Lu
LRM
ReLM
35
15
0
29 May 2023
Code Prompting: a Neural Symbolic Method for Complex Reasoning in Large Language Models
Yitao Hu
Haotong Yang
Zhouchen Lin
Muhan Zhang
ReLM
LRM
28
15
0
29 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Chenyu You
J. Huang
LM&MA
ELM
ALM
49
179
0
29 May 2023
Tab-CoT: Zero-shot Tabular Chain of Thought
Ziqi Jin
Wei Lu
ReLM
LMTD
LRM
17
33
0
28 May 2023
Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective
Khanh Nguyen
LRM
29
8
0
28 May 2023
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
Minki Kang
Seanie Lee
Jinheon Baek
Kenji Kawaguchi
Sung Ju Hwang
ALM
LRM
57
56
0
28 May 2023
Previous
1
2
3
...
54
55
56
...
59
60
61
Next