Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.14168
Cited By
Training Verifiers to Solve Math Word Problems
27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Verifiers to Solve Math Word Problems"
50 / 3,031 papers shown
Title
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Badr AlKhamissi
Siddharth Verma
Ping Yu
Zhijing Jin
Asli Celikyilmaz
Mona T. Diab
LRM
ReLM
35
10
0
19 May 2023
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Pranjal Aggarwal
Aman Madaan
Yiming Yang
Mausam
LRM
33
38
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
36
358
0
19 May 2023
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
Shibo Hao
Tianyang Liu
Zhen Wang
Zhiting Hu
RALM
LLMAG
63
173
0
19 May 2023
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought
Tianci Xue
Ziqi Wang
Zhenhailong Wang
Chi Han
Pengfei Yu
Heng Ji
KELM
LRM
35
32
0
19 May 2023
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs
IokTong Lei
Zhidong Deng
ReLM
RALM
LRM
27
4
0
19 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
45
83
0
19 May 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
119
1,152
0
17 May 2023
LeTI: Learning to Generate from Textual Interactions
Xingyao Wang
Hao Peng
Reyhaneh Jabbarvand
Heng Ji
35
30
0
17 May 2023
Can Language Models Solve Graph Problems in Natural Language?
Heng Wang
Shangbin Feng
Tianxing He
Zhaoxuan Tan
Xiaochuang Han
Yulia Tsvetkov
ReLM
LRM
26
181
0
17 May 2023
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling
Weijia Xu
Andrzej Banburski-Fahey
Nebojsa Jojic
ReLM
LRM
21
32
0
17 May 2023
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
Xi Ye
Qiaochu Chen
Işıl Dillig
Greg Durrett
ReLM
ReCod
LRM
40
62
0
16 May 2023
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility
Wen-song Ye
Mingfeng Ou
Tianyi Li
Yipeng Chen
Xuetao Ma
...
Sai Wu
Jie Fu
Gang Chen
Haobo Wang
J. Zhao
46
36
0
15 May 2023
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
Yuzhen Huang
Yuzhuo Bai
Zhihao Zhu
Junlei Zhang
Jinghan Zhang
...
Yikai Zhang
Jiayi Lei
Yao Fu
Maosong Sun
Junxian He
ELM
LRM
27
499
0
15 May 2023
Large Language Model Guided Tree-of-Thought
Jieyi Long
LM&Ro
LRM
16
185
0
15 May 2023
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
Yue Wang
Hung Le
Akhilesh Deepak Gotmare
Nghi D. Q. Bui
Junnan Li
Steven C. H. Hoi
ALM
27
461
0
13 May 2023
Comprehensive Solution Program Centric Pretraining for Table-and-Text Hybrid Numerical Reasoning
Qianying Liu
Dongsheng Yang
Wenjie Zhong
Fei Cheng
Sadao Kurohashi
AIMat
36
0
0
12 May 2023
Large Language Model Programs
Imanol Schlag
Sainbayar Sukhbaatar
Asli Celikyilmaz
Wen-tau Yih
Jason Weston
Jürgen Schmidhuber
Xian Li
LRM
38
14
0
09 May 2023
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
62
719
0
09 May 2023
Code Execution with Pre-trained Language Models
Chenxiao Liu
Shuai Lu
Weizhu Chen
Daxin Jiang
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
Nan Duan
ELM
22
21
0
08 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-Wei Lee
Ee-Peng Lim
ReLM
LRM
36
316
0
06 May 2023
Self-Edit: Fault-Aware Code Editor for Code Generation
Kechi Zhang
Zhuo Li
Jia Li
Ge Li
Zhi Jin
51
94
0
06 May 2023
T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Mixed Large Language Model Signals for Science Question Answering
Lei Wang
Yilang Hu
Jiabang He
Xingdong Xu
Ning Liu
Hui-juan Liu
Hengtao Shen
LRM
MLLM
31
41
0
05 May 2023
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Deepak Narayanan
Keshav Santhanam
Peter Henderson
Rishi Bommasani
Tony Lee
Percy Liang
145
3
0
03 May 2023
SCOTT: Self-Consistent Chain-of-Thought Distillation
Jamie Yap
Zhengyang Wang
Zheng Li
K. Lynch
Bing Yin
Xiang Ren
LRM
70
93
0
03 May 2023
Learning to Reason and Memorize with Self-Notes
Jack Lanchantin
Shubham Toshniwal
Jason Weston
Arthur Szlam
Sainbayar Sukhbaatar
ReLM
LRM
LLMAG
98
29
0
01 May 2023
Self-Evaluation Guided Beam Search for Reasoning
Yuxi Xie
Kenji Kawaguchi
Yiran Zhao
Xu Zhao
MingSung Kan
Junxian He
Qizhe Xie
LRM
166
132
0
01 May 2023
Explainable Verbal Reasoner Plus (EVR+): A Natural Language Reasoning Framework that Supports Diverse Compositional Reasoning
Zhengzhong Liang
Zeyu Zhang
Steven Bethard
Mihai Surdeanu
ReLM
LRM
28
1
0
28 Apr 2023
ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time
Shangqing Tu
Chunyang Li
Jifan Yu
Xiaozhi Wang
Lei Hou
Juanzi Li
LLMAG
AI4MH
75
10
0
27 Apr 2023
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Xiangyang Liu
Tianqi Pang
Chenyou Fan
FedML
LRM
44
25
0
27 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Xia Hu
LM&MA
139
626
0
26 Apr 2023
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran
Tomer Wolfson
Ben Bogin
Uri Katz
Daniel Deutch
Jonathan Berant
ReLM
LRM
KELM
26
94
0
25 Apr 2023
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
Jiashuo Sun
Yi Luo
Yeyun Gong
Chen Lin
Yelong Shen
Jian Guo
Nan Duan
LRM
41
19
0
23 Apr 2023
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction
Mohammadreza Pourreza
Davood Rafiei
ReLM
LRM
27
313
0
21 Apr 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
32
45
0
21 Apr 2023
Learning to Plan with Natural Language
Yiduo Guo
Yaobo Liang
Chenfei Wu
Wenshan Wu
Dongyan Zhao
Nan Duan
LLMAG
LRM
34
6
0
20 Apr 2023
Progressive-Hint Prompting Improves Reasoning in Large Language Models
Chuanyang Zheng
Zhengying Liu
Enze Xie
Zhenguo Li
Yu Li
LLMAG
ReLM
LRM
41
103
0
19 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
36
39
0
18 Apr 2023
Tool Learning with Foundation Models
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
...
Cheng Yang
Tongshuang Wu
Heng Ji
Zhiyuan Liu
Maosong Sun
42
200
0
17 Apr 2023
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
Qian Liu
Fan Zhou
Zhengbao Jiang
Longxu Dou
Min-Bin Lin
18
17
0
17 Apr 2023
Solving Math Word Problems by Combining Language Models With Symbolic Solvers
Joy He-Yueya
Gabriel Poesia
Rose E. Wang
Noah D. Goodman
24
112
0
16 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Rui Pan
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
18
408
0
13 Apr 2023
What does CLIP know about a red circle? Visual prompt engineering for VLMs
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
VLM
MLLM
32
140
0
13 Apr 2023
Boosted Prompt Ensembles for Large Language Models
Silviu Pitis
Michael Ruogu Zhang
Andrew Wang
Jimmy Ba
LRM
LLMAG
24
40
0
12 Apr 2023
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
40
348
0
11 Apr 2023
Learnings from Data Integration for Augmented Language Models
A. Halevy
Jane Dwivedi-Yu
30
14
0
10 Apr 2023
Why think step by step? Reasoning emerges from the locality of experience
Ben Prystawski
Michael Y. Li
Noah D. Goodman
LRM
ReLM
27
91
0
07 Apr 2023
When do you need Chain-of-Thought Prompting for ChatGPT?
Jiuhai Chen
Lichang Chen
H. Huang
Dinesh Manocha
LRM
KELM
ReLM
ELM
13
43
0
06 Apr 2023
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Zhiqiang Hu
Lei Wang
Yihuai Lan
Wanyu Xu
Ee-Peng Lim
Lidong Bing
Xing Xu
Soujanya Poria
Roy Ka-Wei Lee
ALM
51
233
0
04 Apr 2023
REFINER: Reasoning Feedback on Intermediate Representations
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
29
171
0
04 Apr 2023
Previous
1
2
3
...
56
57
58
59
60
61
Next