ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14168
  4. Cited By
Training Verifiers to Solve Math Word Problems

Training Verifiers to Solve Math Word Problems

27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
    ReLM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Training Verifiers to Solve Math Word Problems"

50 / 3,042 papers shown
Title
MCC-KD: Multi-CoT Consistent Knowledge Distillation
MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen
Siyue Wu
Xiaojun Quan
Rui Wang
Ming Yan
Ji Zhang
LRM
19
17
0
23 Oct 2023
Unleashing the potential of prompt engineering in Large Language Models:
  a comprehensive review
Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review
Banghao Chen
Zhaofeng Zhang
Nicolas Langrené
Shengxin Zhu
LLMAG
34
4
0
23 Oct 2023
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
Tengxiao Liu
Qipeng Guo
Yuqing Yang
Xiangkun Hu
Yue Zhang
Xipeng Qiu
Zheng-Wei Zhang
LRM
LLMAG
26
30
0
23 Oct 2023
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
Wei-wei Zhu
Xiaoling Wang
Huanran Zheng
Mosha Chen
Buzhou Tang
ELM
LM&MA
28
33
0
22 Oct 2023
Small Language Models Fine-tuned to Coordinate Larger Language Models
  improve Complex Reasoning
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Gurusha Juneja
Subhabrata Dutta
Soumen Chakrabarti
Sunny Manchanda
Tanmoy Chakraborty
LRM
ReLM
16
15
0
21 Oct 2023
Three Questions Concerning the Use of Large Language Models to
  Facilitate Mathematics Learning
Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning
An-Zi Yen
Wei-Ling Hsu
LRM
AI4Ed
38
9
0
20 Oct 2023
Teaching Language Models to Self-Improve through Interactive
  Demonstrations
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
38
20
0
20 Oct 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language
  Model
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Zhaoyang Wang
Shaohan Huang
Yuxuan Liu
Jiahai Wang
Minghui Song
...
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
LRM
48
11
0
20 Oct 2023
ToolChain*: Efficient Action Space Navigation in Large Language Models
  with A* Search
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
Yuchen Zhuang
Xiang Chen
Tong Yu
Saayan Mitra
Victor S. Bursztyn
Ryan A. Rossi
Somdeb Sarkhel
Chao Zhang
LLMAG
36
53
0
20 Oct 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
48
0
0
19 Oct 2023
AgentTuning: Enabling Generalized Agent Abilities for LLMs
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Aohan Zeng
Mingdao Liu
Rui Lu
Bowen Wang
Xiao Liu
Yuxiao Dong
Jie Tang
LM&MA
ALM
LLMAG
32
161
0
19 Oct 2023
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language
  Models
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models
Deepak Nathani
David Wang
Liangming Pan
Wenjie Wang
KELM
LRM
ReLM
28
10
0
19 Oct 2023
Eliminating Reasoning via Inferring with Planning: A New Framework to
  Guide LLMs' Non-linear Thinking
Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking
Yongqi Tong
Yifan Wang
Dawei Li
Sizhe Wang
Zi Lin
Simeng Han
Jingbo Shang
LRM
26
17
0
18 Oct 2023
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from
  a Parametric Perspective
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Ming Zhong
Chenxin An
Weizhu Chen
Jiawei Han
Pengcheng He
31
9
0
17 Oct 2023
KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using
  Large Language Models
KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models
Jiho Kim
Yeonsu Kwon
Yohan Jo
Edward Choi
32
25
0
17 Oct 2023
CoTFormer: More Tokens With Attention Make Up For Less Depth
CoTFormer: More Tokens With Attention Make Up For Less Depth
Amirkeivan Mohtashami
Matteo Pagliardini
Martin Jaggi
16
1
0
16 Oct 2023
Llemma: An Open Language Model For Mathematics
Llemma: An Open Language Model For Mathematics
Zhangir Azerbayev
Hailey Schoelkopf
Keiran Paster
Marco Dos Santos
Stephen Marcus McAleer
Albert Q. Jiang
Jia Deng
Stella Biderman
Sean Welleck
CLL
40
276
0
16 Oct 2023
Semantic Parsing by Large Language Models for Intricate Updating
  Strategies of Zero-Shot Dialogue State Tracking
Semantic Parsing by Large Language Models for Intricate Updating Strategies of Zero-Shot Dialogue State Tracking
Yuxiang Wu
Guanting Dong
Weiran Xu
50
3
0
16 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv
Hang Yan
Qipeng Guo
Haijun Lv
Xipeng Qiu
ODL
27
20
0
16 Oct 2023
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative
  Language Models
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong
Jianhao Shen
Ye Yuan
Haiming Wang
Yichun Yin
...
Yinya Huang
Chuanyang Zheng
Xiaodan Liang
Ming Zhang
Qun Liu
AIMat
LRM
26
15
0
16 Oct 2023
Let's reward step by step: Step-Level reward model as the Navigators for
  Reasoning
Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Qianli Ma
Haotian Zhou
Tingkai Liu
Jianbo Yuan
Pengfei Liu
Yang You
Hongxia Yang
LRM
35
43
0
16 Oct 2023
A Comprehensive Evaluation of Tool-Assisted Generation Strategies
A Comprehensive Evaluation of Tool-Assisted Generation Strategies
Alon Jacovi
Avi Caciularu
Jonathan Herzig
Roee Aharoni
Bernd Bohnet
Mor Geva
ELM
43
6
0
16 Oct 2023
Improving Large Language Model Fine-tuning for Solving Math Problems
Improving Large Language Model Fine-tuning for Solving Math Problems
Yixin Liu
Avi Singh
C. D. Freeman
John D. Co-Reyes
Peter J. Liu
LRM
ReLM
43
45
0
16 Oct 2023
An Expression Tree Decoding Strategy for Mathematical Equation
  Generation
An Expression Tree Decoding Strategy for Mathematical Equation Generation
Wenqi Zhang
Yongliang Shen
Qingpeng Nong
Zeqi Tan
Zeqi Tan Yanna Ma
Weiming Lu
AIMat
31
6
0
14 Oct 2023
Autonomous Tree-search Ability of Large Language Models
Autonomous Tree-search Ability of Large Language Models
Zheyu Zhang
Zhuorui Ye
Yikang Shen
Chuang Gan
LRM
32
0
0
14 Oct 2023
The Consensus Game: Language Model Generation via Equilibrium Search
The Consensus Game: Language Model Generation via Equilibrium Search
Athul Paul Jacob
Songlin Yang
Gabriele Farina
Jacob Andreas
45
20
0
13 Oct 2023
Exploration with Principles for Diverse AI Supervision
Exploration with Principles for Diverse AI Supervision
Hao Liu
Matei A. Zaharia
Pieter Abbeel
48
2
0
13 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
70
8
0
13 Oct 2023
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Yixiao Li
Yifan Yu
Chen Liang
Pengcheng He
Nikos Karampatziakis
Weizhu Chen
Tuo Zhao
MQ
41
125
0
12 Oct 2023
Formally Specifying the High-Level Behavior of LLM-Based Agents
Formally Specifying the High-Level Behavior of LLM-Based Agents
Mayank Agarwal
Ibrahim Abdelaziz
Ramón Fernández Astudillo
Kinjal Basu
Soham Dan
Yara Rizk
Achille Fokoue
Pavan Kapanipathi
Salim Roukos
Luis A. Lastras
LLMAG
23
8
0
12 Oct 2023
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
Yongchao Zhou
Kaifeng Lyu
A. S. Rawat
A. Menon
Afshin Rostamizadeh
Sanjiv Kumar
Jean-François Kagy
Rishabh Agarwal
55
84
0
12 Oct 2023
Found in the Middle: Permutation Self-Consistency Improves Listwise
  Ranking in Large Language Models
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models
Raphael Tang
Xinyu Crystina Zhang
Xueguang Ma
Jimmy Lin
Ferhan Ture
LRM
42
15
0
11 Oct 2023
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large
  Language Models
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language Models
Yuhe Liu
Changhua Pei
Longlong Xu
Bohan Chen
Mingze Sun
...
Gaogang Xie
Xidao Wen
Xiaohui Nie
Minghua Ma
Dan Pei
ELM
22
2
0
11 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
51
2
0
11 Oct 2023
Online Speculative Decoding
Online Speculative Decoding
Xiaoxuan Liu
Lanxiang Hu
Peter Bailis
Alvin Cheung
Zhijie Deng
Ion Stoica
Hao Zhang
29
53
0
11 Oct 2023
Diversity of Thought Improves Reasoning Abilities of LLMs
Diversity of Thought Improves Reasoning Abilities of LLMs
Ranjita Naik
Varun Chandrasekaran
Mert Yuksekgonul
Hamid Palangi
Besmira Nushi
LRM
34
6
0
11 Oct 2023
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained
  Decoding
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
Kexun Zhang
Hongqiao Chen
Lei Li
Wenjie Wang
53
4
0
10 Oct 2023
Sparse Fine-tuning for Inference Acceleration of Large Language Models
Sparse Fine-tuning for Inference Acceleration of Large Language Models
Eldar Kurtic
Denis Kuznedelev
Elias Frantar
Michael Goin
Dan Alistarh
35
8
0
10 Oct 2023
Generating and Evaluating Tests for K-12 Students with Language Model
  Simulations: A Case Study on Sentence Reading Efficiency
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
E. Zelikman
Wanjing Anya Ma
Jasmine E. Tran
Diyi Yang
Jason D. Yeatman
Nick Haber
AI4Ed
32
9
0
10 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
32
68
0
10 Oct 2023
Mistral 7B
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LRM
23
2,014
0
10 Oct 2023
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Keiran Paster
Marco Dos Santos
Zhangir Azerbayev
Jimmy Ba
LRM
33
80
0
10 Oct 2023
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with
  Large Language Models
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Anni Zou
ZhuoSheng Zhang
Hai Zhao
Xiangru Tang
LRM
ReLM
42
3
0
10 Oct 2023
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of
  Multi-modal Language Models
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models
Letian Zhang
Xiaotong Zhai
Zhongkai Zhao
Yongshuo Zong
Xin Wen
Bingchen Zhao
LRM
16
0
0
10 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
115
125
0
10 Oct 2023
Let Models Speak Ciphers: Multiagent Debate through Embeddings
Let Models Speak Ciphers: Multiagent Debate through Embeddings
Chau Pham
Boyi Liu
Yingxiang Yang
Zhengyu Chen
Tianyi Liu
Jianbo Yuan
Bryan A. Plummer
Zhaoran Wang
Hongxia Yang
LLMAG
41
15
0
10 Oct 2023
LLMLingua: Compressing Prompts for Accelerated Inference of Large
  Language Models
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang
Qianhui Wu
Chin-Yew Lin
Yuqing Yang
Lili Qiu
42
103
0
09 Oct 2023
Guiding Language Model Math Reasoning with Planning Tokens
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
43
20
0
09 Oct 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on
  Math Reasoning
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
Chengpeng Li
Zheng Yuan
Hongyi Yuan
Guanting Dong
Keming Lu
Jiancan Wu
Chuanqi Tan
Xiang Wang
Chang Zhou
LRM
22
22
0
09 Oct 2023
How Abilities in Large Language Models are Affected by Supervised
  Fine-tuning Data Composition
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
Guanting Dong
Hongyi Yuan
Keming Lu
Chengpeng Li
Mingfeng Xue
Dayiheng Liu
Wei Wang
Zheng Yuan
Chang Zhou
Jingren Zhou
LRM
CLL
34
121
0
09 Oct 2023
Previous
123...505152...596061
Next