ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14168
  4. Cited By
Training Verifiers to Solve Math Word Problems

Training Verifiers to Solve Math Word Problems

27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
    ReLM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Training Verifiers to Solve Math Word Problems"

50 / 3,115 papers shown
Title
Distilling LLMs' Decomposition Abilities into Compact Language Models
Distilling LLMs' Decomposition Abilities into Compact Language Models
Denis Tarasov
Kumar Shridhar
SyDa
OffRL
LRM
50
2
0
02 Feb 2024
KTO: Model Alignment as Prospect Theoretic Optimization
KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh
Winnie Xu
Niklas Muennighoff
Dan Jurafsky
Douwe Kiela
182
463
0
02 Feb 2024
Vaccine: Perturbation-aware Alignment for Large Language Model
Vaccine: Perturbation-aware Alignment for Large Language Model
Tiansheng Huang
Sihao Hu
Ling Liu
55
36
0
02 Feb 2024
Executable Code Actions Elicit Better LLM Agents
Executable Code Actions Elicit Better LLM Agents
Xingyao Wang
Yangyi Chen
Lifan Yuan
Yizhe Zhang
Yunzhu Li
Hao Peng
Heng Ji
ELM
LLMAG
LM&Ro
45
133
0
01 Feb 2024
LLMs learn governing principles of dynamical systems, revealing an
  in-context neural scaling law
LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law
Toni J. B. Liu
Nicolas Boullé
Raphael Sarfati
Christopher Earls
AI4TS
33
14
0
01 Feb 2024
Dense Reward for Free in Reinforcement Learning from Human Feedback
Dense Reward for Free in Reinforcement Learning from Human Feedback
Alex J. Chan
Hao Sun
Samuel Holt
M. Schaar
26
32
0
01 Feb 2024
Learning Planning-based Reasoning by Trajectories Collection and Process
  Reward Synthesizing
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing
Fangkai Jiao
Chengwei Qin
Zhengyuan Liu
Nancy F. Chen
Shafiq Joty
LRM
29
29
0
01 Feb 2024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for
  Verifiers of Reasoning Chains
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
Alon Jacovi
Yonatan Bitton
Bernd Bohnet
Jonathan Herzig
Or Honovich
Michael Tseng
Michael Collins
Roee Aharoni
Mor Geva
LRM
55
20
0
01 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM
  Collaboration
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
39
78
0
01 Feb 2024
Large Language Models for Mathematical Reasoning: Progresses and
  Challenges
Large Language Models for Mathematical Reasoning: Progresses and Challenges
Janice Ahn
Rishu Verma
Renze Lou
Di Liu
Rui Zhang
Wenpeng Yin
LRM
54
122
0
31 Jan 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought
  Reasoning
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
28
15
0
31 Jan 2024
Efficient Tool Use with Chain-of-Abstraction Reasoning
Efficient Tool Use with Chain-of-Abstraction Reasoning
Silin Gao
Jane Dwivedi-Yu
Ping Yu
X. Tan
Ramakanth Pasunuru
O. Yu. Golovneva
Koustuv Sinha
Asli Celikyilmaz
Antoine Bosselut
Tianlu Wang
LRM
25
20
0
30 Jan 2024
LLaMP: Large Language Model Made Powerful for High-fidelity Materials
  Knowledge Retrieval and Distillation
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation
Chiang Yuan
Elvis Hsieh
Chia-Hong Chou
Janosh Riebesell
38
10
0
30 Jan 2024
Conditional and Modal Reasoning in Large Language Models
Conditional and Modal Reasoning in Large Language Models
Wesley H. Holliday
M. Mandelkern
Cedegao E. Zhang
LRM
37
5
0
30 Jan 2024
Learning Agent-based Modeling with LLM Companions: Experiences of
  Novices and Experts Using ChatGPT & NetLogo Chat
Learning Agent-based Modeling with LLM Companions: Experiences of Novices and Experts Using ChatGPT & NetLogo Chat
John Chen
Xi Lu
Michael Rejtig
Yuzhou Du
Ruth Bagley
Mike Horn
Uri Wilensky
44
29
0
30 Jan 2024
H2O-Danube-1.8B Technical Report
H2O-Danube-1.8B Technical Report
Philipp Singer
Pascal Pfeiffer
Yauhen Babakhin
Maximilian Jeblick
Nischay Dhankhar
Gabor Fodor
SriSatish Ambati
VLM
29
8
0
30 Jan 2024
GAPS: Geometry-Aware Problem Solver
GAPS: Geometry-Aware Problem Solver
Jiaxin Zhang
Yinghui Jiang
Yashar Moshfeghi
AIMat
AI4CE
22
3
0
29 Jan 2024
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for
  Large Language Models
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
Jinchang Hou
Chang Ao
Haihong Wu
Xiangtao Kong
Zhigang Zheng
...
Chengming Li
Xiping Hu
Ruifeng Xu
Shiwen Ni
Min Yang
AI4Ed
ELM
29
6
0
29 Jan 2024
Contrastive Learning and Mixture of Experts Enables Precise Vector
  Embeddings
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings
Logan Hallee
Rohan Kapur
Arjun Patel
Jason P. Gleghorn
Bohdan B. Khomtchouk
MoE
22
3
0
28 Jan 2024
YODA: Teacher-Student Progressive Learning for Language Models
YODA: Teacher-Student Progressive Learning for Language Models
Jianqiao Lu
Wanjun Zhong
Yufei Wang
Zhijiang Guo
Qi Zhu
...
Baojun Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
LRM
32
7
0
28 Jan 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for
  Hallucination Mitigation
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
43
30
0
27 Jan 2024
A Comprehensive Survey of Compression Algorithms for Language Models
A Comprehensive Survey of Compression Algorithms for Language Models
Seungcheol Park
Jaehyeon Choi
Sojin Lee
U. Kang
MQ
34
12
0
27 Jan 2024
Equipping Language Models with Tool Use Capability for Tabular Data
  Analysis in Finance
Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance
Adrian Theuma
Ehsan Shareghi
32
4
0
27 Jan 2024
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy
Yongkang Liu
Yiqun Zhang
Qian Li
Tong Liu
Shi Feng
Daling Wang
Yifei Zhang
Hinrich Schütze
40
6
0
26 Jan 2024
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods
Yu Sun
Keyu Chen
Shujie Wang
Qipeng Guo
Hang Yan
Xipeng Qiu
Xuanjing Huang
Dahua Lin
ELM
19
0
0
26 Jan 2024
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from
  Public Corpora
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Zhaoye Fei
Yunfan Shao
Linyang Li
Zhiyuan Zeng
Conghui He
Hang Yan
Dahua Lin
Xipeng Qiu
36
8
0
26 Jan 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
52
128
0
26 Jan 2024
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language
  Models
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models
Yao Fu
Leyang Xue
Yeqi Huang
Andrei-Octavian Brabete
Dmitrii Ustiugov
Yuvraj Patel
Luo Mai
28
27
0
25 Jan 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The
  Rise of Code Intelligence
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Daya Guo
Qihao Zhu
Dejian Yang
Zhenda Xie
Kai Dong
...
Yu-Huan Wu
Y. K. Li
Fuli Luo
Yingfei Xiong
W. Liang
ELM
62
695
0
25 Jan 2024
Towards Goal-oriented Prompt Engineering for Large Language Models: A
  Survey
Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
Haochen Li
Jonathan Leung
Zhiqi Shen
LM&MA
LLMAG
LRM
30
0
0
25 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
75
27
0
25 Jan 2024
TPD: Enhancing Student Language Model Reasoning via Principle Discovery
  and Guidance
TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance
Haorui Wang
Rongzhi Zhang
Yinghao Li
Lingkai Kong
Yuchen Zhuang
Xiusi Chen
Chao Zhang
LRM
45
5
0
24 Jan 2024
SEER: Facilitating Structured Reasoning and Explanation via
  Reinforcement Learning
SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning
Guoxin Chen
Kexin Tang
Chao Yang
Fuying Ye
Yu Qiao
Yiming Qian
LRM
20
3
0
24 Jan 2024
TAT-LLM: A Specialized Language Model for Discrete Reasoning over
  Tabular and Textual Data
TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data
Fengbin Zhu
Ziyang Liu
Fuli Feng
Chao Wang
Moxin Li
Tat-Seng Chua
LMTD
LRM
34
15
0
24 Jan 2024
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma
Junlei Zhang
Zhihao Zhu
Cheng Yang
Yujiu Yang
Yaohui Jin
Zhenzhong Lan
Lingpeng Kong
Junxian He
ELM
LLMAG
37
60
0
24 Jan 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELM
LRM
LLMAG
ReLM
53
65
0
23 Jan 2024
Benchmarking LLMs via Uncertainty Quantification
Benchmarking LLMs via Uncertainty Quantification
Fanghua Ye
Mingming Yang
Jianhui Pang
Longyue Wang
Derek F. Wong
Emine Yilmaz
Shuming Shi
Zhaopeng Tu
ELM
28
47
0
23 Jan 2024
Can Large Language Models Write Parallel Code?
Can Large Language Models Write Parallel Code?
Daniel Nichols
Joshua H. Davis
Zhaojun Xie
Arjun Rajaram
A. Bhatele
LRM
ELM
ALM
24
21
0
23 Jan 2024
Distilling Mathematical Reasoning Capabilities into Small Language
  Models
Distilling Mathematical Reasoning Capabilities into Small Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
LRM
40
9
0
22 Jan 2024
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in
  Chinese
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese
Liang Xu
Hang Xue
Lei Zhu
Kangkang Zhao
ReLM
LRM
ELM
21
9
0
22 Jan 2024
Large Language Model based Multi-Agents: A Survey of Progress and
  Challenges
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Taicheng Guo
Preslav Nakov
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
52
252
0
21 Jan 2024
In-context Learning with Retrieved Demonstrations for Language Models: A
  Survey
In-context Learning with Retrieved Demonstrations for Language Models: A Survey
an Luo
Xin Xu
Yue Liu
Panupong Pasupat
Mehran Kazemi
RALM
38
55
0
21 Jan 2024
Over-Reasoning and Redundant Calculation of Large Language Models
Over-Reasoning and Redundant Calculation of Large Language Models
Cheng-Han Chiang
Hunghuei Lee
LRM
36
9
0
21 Jan 2024
InferAligner: Inference-Time Alignment for Harmlessness through
  Cross-Model Guidance
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance
Pengyu Wang
Dong Zhang
Linyang Li
Chenkun Tan
Xinghao Wang
Ke Ren
Botian Jiang
Xipeng Qiu
LLMSV
26
42
0
20 Jan 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
Zhen Xiang
Fengqing Jiang
Zidi Xiong
Bhaskar Ramasubramanian
Radha Poovendran
Bo Li
LRM
SILM
42
40
0
20 Jan 2024
Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs
  Without Fine-Tuning
Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning
Adib Hasan
Ileana Rugina
Alex Wang
AAML
54
23
0
19 Jan 2024
LangBridge: Multilingual Reasoning Without Multilingual Supervision
LangBridge: Multilingual Reasoning Without Multilingual Supervision
Dongkeun Yoon
Joel Jang
Sungdong Kim
Seungone Kim
Sheikh Shafayat
Minjoon Seo
LRM
24
15
0
19 Jan 2024
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial
  Analysis
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis
Chao Zhang
Yuren Mao
Yijiang Fan
Yu Mi
Yunjun Gao
Lu Chen
Dongfang Lou
Jinshu Lin
42
23
0
19 Jan 2024
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step
  Reasoning
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
Yiwei Li
Peiwen Yuan
Shaoxiong Feng
Boyuan Pan
Xinglin Wang
Bin Sun
Heda Wang
Kan Li
LRM
35
31
0
19 Jan 2024
LangProp: A code optimization framework using Large Language Models
  applied to driving
LangProp: A code optimization framework using Large Language Models applied to driving
Shu Ishida
Gianluca Corrado
George Fedoseev
Hudson Yeo
Lloyd Russell
Jamie Shotton
João F. Henriques
Anthony Hu
61
11
0
18 Jan 2024
Previous
123...464748...616263
Next