Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.14168
Cited By
Training Verifiers to Solve Math Word Problems
27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Verifiers to Solve Math Word Problems"
50 / 3,090 papers shown
Title
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Han Guo
P. Greengard
Eric P. Xing
Yoon Kim
MQ
38
44
0
20 Nov 2023
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
David Rein
Betty Li Hou
Asa Cooper Stickland
Jackson Petty
Richard Yuanzhe Pang
Julien Dirani
Julian Michael
Samuel R. Bowman
AI4MH
ELM
48
513
0
20 Nov 2023
System 2 Attention (is something you might need too)
Jason Weston
Sainbayar Sukhbaatar
RALM
OffRL
LRM
35
58
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
ZhuoSheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
44
53
0
20 Nov 2023
InfiMM-Eval: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models
Xiaotian Han
Quanzeng You
Yongfei Liu
Wentao Chen
Huangjie Zheng
...
Yiqi Wang
Bohan Zhai
Jianbo Yuan
Heng Wang
Hongxia Yang
ReLM
LRM
ELM
97
9
0
20 Nov 2023
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning
Yiming Wang
Yu Lin
Xiaodong Zeng
Guannan Zhang
MoMe
47
20
0
20 Nov 2023
Meta Prompting for AI Systems
Yifan Zhang
Yang Yuan
Andrew Chi-Chih Yao
LLMAG
LRM
29
5
0
20 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
47
38
0
19 Nov 2023
Token-Level Adaptation of LoRA Adapters for Downstream Task Generalization
Joshua Belofsky
MoMe
21
13
0
17 Nov 2023
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data
Yilun Zhao
Yitao Long
Hongjun Liu
Linyong Nan
Lyuhao Chen
Ryo Kamoi
Yixin Liu
Xiangru Tang
Rui Zhang
Arman Cohan
36
14
0
16 Nov 2023
Investigating Data Contamination in Modern Benchmarks for Large Language Models
Chunyuan Deng
Yilun Zhao
Xiangru Tang
Mark B. Gerstein
Arman Cohan
AAML
ELM
27
53
0
16 Nov 2023
To be or not to be? an exploration of continuously controllable prompt engineering
Yuhan Sun
Mukai Li
Yixin Cao
Kun Wang
Wenxiao Wang
Xingyu Zeng
Rui Zhao
LLMAG
37
2
0
16 Nov 2023
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning
Fei Yu
Anningzhe Gao
Benyou Wang
OffRL
LRM
25
44
0
16 Nov 2023
Automatic Engineering of Long Prompts
Cho-Jui Hsieh
Si Si
Felix X. Yu
Inderjit S. Dhillon
VLM
32
8
0
16 Nov 2023
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying
Adithya Renduchintala
Tugrul Konuk
Oleksii Kuchaiev
MoMe
41
42
0
16 Nov 2023
Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation
Yao Lu
Jiayi Wang
Raphael Tang
Sebastian Riedel
Pontus Stenetorp
40
2
0
16 Nov 2023
Program-Aided Reasoners (better) Know What They Know
Anubha Kabra
Sanketh Rangreji
Yash Mathur
Aman Madaan
Emmy Liu
Graham Neubig
LRM
34
0
0
16 Nov 2023
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
Sen Yang
Xin Li
Leyang Cui
Li Bing
Wai Lam
LRM
NAI
39
16
0
16 Nov 2023
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Nicholas Farn
Richard Shin
LLMAG
ELM
40
14
0
15 Nov 2023
When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour
Leonardo Ranaldi
Giulia Pucci
27
33
0
15 Nov 2023
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
Fangzhi Xu
Zhiyong Wu
Qiushi Sun
Siyu Ren
Fei Yuan
Shuai Yuan
Qika Lin
Yu Qiao
Jun Liu
LLMAG
35
33
0
15 Nov 2023
Contrastive Chain-of-Thought Prompting
Yew Ken Chia
Guizhen Chen
Anh Tuan Luu
Soujanya Poria
Lidong Bing
LRM
AI4CE
68
32
0
15 Nov 2023
Towards Verifiable Text Generation with Symbolic References
Lucas Torroba Hennigen
Zejiang Shen
Aniruddha Nrusimha
Bernhard Gapp
David Sontag
Yoon Kim
28
11
0
15 Nov 2023
CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models
Wenhong Zhu
Hong-ping Hao
Zhiwei He
Yun-Ze Song
Yumeng Zhang
Hanxu Hu
Yiran Wei
Rui Wang
Hongyuan Lu
AAML
ELM
23
12
0
15 Nov 2023
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Shumin Deng
Ningyu Zhang
Nay Oo
Bryan Hooi
LRM
50
2
0
15 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
30
3
0
15 Nov 2023
Speculative Contrastive Decoding
Hongyi Yuan
Keming Lu
Fei Huang
Zheng Yuan
Chang Zhou
47
5
0
15 Nov 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Chang Gao
Haiyun Jiang
Deng Cai
Shuming Shi
Wai Lam
LRM
42
3
0
15 Nov 2023
Auto-ICL: In-Context Learning without Human Supervision
Jinghan Yang
Shuming Ma
Furu Wei
37
9
0
15 Nov 2023
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Bairu Hou
Yujian Liu
Kaizhi Qian
Jacob Andreas
Shiyu Chang
Yang Zhang
UD
UQCV
PER
32
49
0
15 Nov 2023
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Keming Lu
Hongyi Yuan
Runji Lin
Junyang Lin
Zheng Yuan
Chang Zhou
Jingren Zhou
MoE
LRM
48
52
0
15 Nov 2023
Safer-Instruct: Aligning Language Models with Automated Preference Data
Taiwei Shi
Kai Chen
Jieyu Zhao
ALM
SyDa
35
21
0
15 Nov 2023
Plum: Prompt Learning using Metaheuristic
Rui Pan
Shuo Xing
Shizhe Diao
Wenhe Sun
Xiang Liu
Kashun Shum
Renjie Pi
Jipeng Zhang
Tong Zhang
VLM
OffRL
LRM
44
6
0
14 Nov 2023
How Well Do Large Language Models Understand Syntax? An Evaluation by Asking Natural Language Questions
Houquan Zhou
Yang Hou
Zhenghua Li
Xuebin Wang
Zhefeng Wang
Xinyu Duan
Min Zhang
ELM
15
5
0
14 Nov 2023
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
Lei Lin
Jiayi Fu
Pengli Liu
Qingyang Li
Yan Gong
Junchen Wan
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
32
7
0
14 Nov 2023
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Zhenran Xu
Senbao Shi
Baotian Hu
Jindi Yu
Dongfang Li
Min Zhang
Yuxiang Wu
LRM
LLMAG
ALM
66
22
0
14 Nov 2023
SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks
Mengsay Loem
Masahiro Kaneko
Naoaki Okazaki
LRM
29
5
0
14 Nov 2023
Empowering Multi-step Reasoning across Languages via Tree-of-Thoughts
Leonardo Ranaldi
Giulia Pucci
Federico Ranaldi
Elena Sofia Ruzzetti
Fabio Massimo Zanzotto
LRM
32
12
0
14 Nov 2023
The ART of LLM Refinement: Ask, Refine, and Trust
Kumar Shridhar
Koustuv Sinha
Andrew Cohen
Tianlu Wang
Ping Yu
Ramakanth Pasunuru
Mrinmaya Sachan
Jason Weston
Asli Celikyilmaz
LLMAG
ReLM
LRM
35
24
0
14 Nov 2023
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning
Kushal Kumar Jain
Moritz Miller
Niket Tandon
Kumar Shridhar
ReLM
LRM
49
2
0
14 Nov 2023
How are Prompts Different in Terms of Sensitivity?
Sheng Lu
Hendrik Schuff
Iryna Gurevych
45
18
0
13 Nov 2023
Large Language Models for Robotics: A Survey
Fanlong Zeng
Wensheng Gan
Yongheng Wang
Ning Liu
Philip S. Yu
LM&Ro
124
127
0
13 Nov 2023
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency
Vernon Toh
Ratish Puduppully
Nancy F. Chen
LRM
33
6
0
13 Nov 2023
Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning
Yue Yu
Jiaming Shen
Tianqi Liu
Zhen Qin
Jing Nathan Yan
Jialu Liu
Chao Zhang
Michael Bendersky
54
6
0
13 Nov 2023
Towards the Law of Capacity Gap in Distilling Language Models
Chen Zhang
Dawei Song
Zheyu Ye
Yan Gao
ELM
38
20
0
13 Nov 2023
From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models
Junbing Yan
Chengyu Wang
Taolin Zhang
Xiaofeng He
Jun Huang
Wei Zhang
ReLM
LRM
32
9
0
12 Nov 2023
Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding
Ying Su
Xiaojin Fu
Mingwen Liu
Zhijiang Guo
LRM
41
3
0
12 Nov 2023
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Weiyang Liu
Zeju Qiu
Yao Feng
Yuliang Xiu
Yuxuan Xue
...
Songyou Peng
Yandong Wen
Michael J. Black
Adrian Weller
Bernhard Schölkopf
50
58
0
10 Nov 2023
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models
Shahriar Golchin
Mihai Surdeanu
29
24
0
10 Nov 2023
Let's Reinforce Step by Step
Sarah Pan
Vladislav Lialin
Sherin Muckatira
Anna Rumshisky
ReLM
LRM
22
7
0
10 Nov 2023
Previous
1
2
3
...
49
50
51
...
60
61
62
Next