ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14168
  4. Cited By
Training Verifiers to Solve Math Word Problems

Training Verifiers to Solve Math Word Problems

27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
    ReLM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Training Verifiers to Solve Math Word Problems"

50 / 3,115 papers shown
Title
Toward Self-Improvement of LLMs via Imagination, Searching, and
  Criticizing
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian
Baolin Peng
Linfeng Song
Lifeng Jin
Dian Yu
Haitao Mi
Dong Yu
LRM
ReLM
59
67
0
18 Apr 2024
Parallel Decoding via Hidden Transfer for Lossless Large Language Model
  Acceleration
Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration
Pengfei Wu
Jiahao Liu
Zhuocheng Gong
Qifan Wang
Jinpeng Li
Jingang Wang
Xunliang Cai
Dongyan Zhao
33
1
0
18 Apr 2024
AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
Minbeom Kim
Hwanhee Lee
Joonsuk Park
Hwaran Lee
Kyomin Jung
45
2
0
18 Apr 2024
The Landscape of Emerging AI Agent Architectures for Reasoning,
  Planning, and Tool Calling: A Survey
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Tula Masterman
Sandi Besen
Mason Sawtell
Alex Chao
LM&Ro
LLMAG
40
45
0
17 Apr 2024
Paraphrase and Solve: Exploring and Exploiting the Impact of Surface
  Form on Mathematical Reasoning in Large Language Models
Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models
Yue Zhou
Yada Zhu
Diego Antognini
Yoon Kim
Yang Zhang
ReLM
LRM
24
3
0
17 Apr 2024
TransLinkGuard: Safeguarding Transformer Models Against Model Stealing
  in Edge Deployment
TransLinkGuard: Safeguarding Transformer Models Against Model Stealing in Edge Deployment
Qinfeng Li
Zhiqiang Shen
Zhenghan Qin
Yangfan Xie
Xuhong Zhang
Tianyu Du
Jianwei Yin
29
8
0
17 Apr 2024
On the Empirical Complexity of Reasoning and Planning in LLMs
On the Empirical Complexity of Reasoning and Planning in LLMs
Liwei Kang
Zirui Zhao
David Hsu
Wee Sun Lee
LRM
38
5
0
17 Apr 2024
Many-Shot In-Context Learning
Many-Shot In-Context Learning
Rishabh Agarwal
Avi Singh
Lei M. Zhang
Bernd Bohnet
Luis Rosias
...
John D. Co-Reyes
Eric Chu
Feryal M. P. Behbahani
Aleksandra Faust
Hugo Larochelle
ReLM
OffRL
BDL
69
100
0
17 Apr 2024
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces
  Hallucinations
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations
Christian Tomani
Kamalika Chaudhuri
Ivan Evtimov
Daniel Cremers
Mark Ibrahim
59
10
0
16 Apr 2024
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
J. P. Muñoz
Jinjie Yuan
Nilesh Jain
35
7
0
16 Apr 2024
HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
Haozheng Fan
Hao Zhou
Guangtai Huang
Parameswaran Raman
Xinwei Fu
Gaurav Gupta
Dhananjay Ram
Yida Wang
Jun Huan
48
5
0
16 Apr 2024
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of
  Language Models with Fine-grained Rewards
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Hyeonbin Hwang
Doyoung Kim
Seungone Kim
Seonghyeon Ye
Minjoon Seo
LRM
ReLM
53
15
0
16 Apr 2024
Enhancing Confidence Expression in Large Language Models Through
  Learning from Past Experience
Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
Haixia Han
Tingyun Li
Shisong Chen
Jie Shi
Chengyu Du
Yanghua Xiao
Jiaqing Liang
Xin Lin
55
7
0
16 Apr 2024
Balancing Speciality and Versatility: a Coarse to Fine Framework for
  Supervised Fine-tuning Large Language Model
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Hengyuan Zhang
Yanru Wu
Dawei Li
Zacc Yang
Rui Zhao
Yong Jiang
Fei Tan
ALM
40
0
0
16 Apr 2024
Compression Represents Intelligence Linearly
Compression Represents Intelligence Linearly
Yuzhen Huang
Jinghan Zhang
Zifei Shan
Junxian He
50
28
0
15 Apr 2024
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity,
  Bias and Propensity for Hallucinations
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
David Nadeau
Mike Kroutikov
Karen McNeil
Simon Baribeau
HILM
37
7
0
15 Apr 2024
Learn Your Reference Model for Real Good Alignment
Learn Your Reference Model for Real Good Alignment
Alexey Gorbatovski
Boris Shaposhnikov
Alexey Malakhov
Nikita Surnachev
Yaroslav Aksenov
Ian Maksimov
Nikita Balagansky
Daniil Gavrilov
OffRL
61
29
0
15 Apr 2024
Entropy Guided Extrapolative Decoding to Improve Factuality in Large
  Language Models
Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Souvik Das
Lifeng Jin
Linfeng Song
Haitao Mi
Baolin Peng
Dong Yu
HILM
45
2
0
14 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive
  Thinking
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLM
LRM
42
4
0
14 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
52
6
0
14 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
36
35
0
12 Apr 2024
Rho-1: Not All Tokens Are What You Need
Rho-1: Not All Tokens Are What You Need
Zheng-Wen Lin
Zhibin Gou
Yeyun Gong
Xiao Liu
Yelong Shen
...
Chen Lin
Yujiu Yang
Jian Jiao
Nan Duan
Weizhu Chen
CLL
52
58
0
11 Apr 2024
Reflectance Estimation for Proximity Sensing by Vision-Language Models:
  Utilizing Distributional Semantics for Low-Level Cognition in Robotics
Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics
Masashi Osada
G. A. G. Ricardez
Yosuke Suzuki
Tadahiro Taniguchi
31
2
0
11 Apr 2024
UltraEval: A Lightweight Platform for Flexible and Comprehensive
  Evaluation for LLMs
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs
Chaoqun He
Renjie Luo
Shengding Hu
Yuanqian Zhao
Jie Zhou
Hanghao Wu
Jiajie Zhang
Xu Han
Zhiyuan Liu
Maosong Sun
ELM
49
15
0
11 Apr 2024
MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT
  Prompting
MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting
Avinash Anand
Janak Kapuriya
Apoorv Singh
Jay Saraf
Naman Lal
Astha Verma
Rushali Gupta
R. Shah
LRM
41
12
0
11 Apr 2024
Interactive Prompt Debugging with Sequence Salience
Interactive Prompt Debugging with Sequence Salience
Ian Tenney
Ryan Mullins
Bin Du
Shree Pandya
Minsuk Kahng
Lucas Dixon
LRM
45
1
0
11 Apr 2024
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Yikang Shen
Zhen Guo
Tianle Cai
Zengyi Qin
MoE
ALM
46
29
0
11 Apr 2024
Improving Language Model Reasoning with Self-motivated Learning
Improving Language Model Reasoning with Self-motivated Learning
Yunlong Feng
Yang Xu
Libo Qin
Yasheng Wang
Wanxiang Che
LRM
ReLM
42
7
0
10 Apr 2024
A Survey on the Integration of Generative AI for Critical Thinking in
  Mobile Networks
A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks
Athanasios Karapantelakis
Alexandros Nikou
Ajay Kattepur
Jean Martins
Leonid Mokrushin
S. Mohalik
Marin Orlic
Aneta Vulgarakis Feljan
34
1
0
10 Apr 2024
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Murong Yue
Wijdane Mifdal
Yixuan Zhang
Jennifer Suh
Yixuan Zhang
Ziyu Yao
LLMAG
64
17
0
10 Apr 2024
Sample-Efficient Human Evaluation of Large Language Models via Maximum
  Discrepancy Competition
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Kehua Feng
Keyan Ding
Kede Ma
Zhihua Wang
Qiang Zhang
Huajun Chen
42
10
0
10 Apr 2024
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Chonghua Wang
Haodong Duan
Songyang Zhang
Dahua Lin
Kai-xiang Chen
ELM
31
17
0
09 Apr 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable
  Training Strategies
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Shengding Hu
Yuge Tu
Xu Han
Chaoqun He
Ganqu Cui
...
Chaochao Jia
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
MoE
56
298
0
09 Apr 2024
Latent Distance Guided Alignment Training for Large Language Models
Latent Distance Guided Alignment Training for Large Language Models
Haotian Luo
19
0
0
09 Apr 2024
RAR-b: Reasoning as Retrieval Benchmark
RAR-b: Reasoning as Retrieval Benchmark
Chenghao Xiao
G. Thomas
Al Moubayed
LRM
RALM
48
8
0
09 Apr 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation
  of Large Language Models
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Zhuohao Yu
Chang Gao
Wenjin Yao
Yidong Wang
Zhengran Zeng
Wei Ye
Jindong Wang
Yue Zhang
Shikun Zhang
50
1
0
09 Apr 2024
Evaluating Mathematical Reasoning Beyond Accuracy
Evaluating Mathematical Reasoning Beyond Accuracy
Shijie Xia
Xuefeng Li
Yixin Liu
Tongshuang Wu
Pengfei Liu
LRM
ReLM
52
22
0
08 Apr 2024
RoT: Enhancing Large Language Models with Reflection on Search Trees
RoT: Enhancing Large Language Models with Reflection on Search Trees
Wenyang Hui
Kewei Tu
LRM
42
6
0
08 Apr 2024
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step
  Reasoning with Large Language Models
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
Shibo Hao
Yi Gu
Haotian Luo
Tianyang Liu
Xiyan Shao
...
Haodi Ma
Adithya Samavedhi
Qiyue Gao
Zhen Wang
Zhiting Hu
LRM
ELM
100
26
0
08 Apr 2024
Have You Merged My Model? On The Robustness of Large Language Model IP
  Protection Methods Against Model Merging
Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
Tianshuo Cong
Delong Ran
Zesen Liu
Xinlei He
Jinyuan Liu
Yichen Gong
Qi Li
Anyu Wang
Xiaoyun Wang
MoMe
46
7
0
08 Apr 2024
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation
  and Fine-grained Classification
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification
Kai Sun
Yushi Bai
Ji Qi
Lei Hou
Juanzi Li
LRM
35
15
0
07 Apr 2024
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
Andi Zhang
Tim Z. Xiao
Weiyang Liu
Robert Bamler
Damon J. Wischik
OODD
56
4
0
07 Apr 2024
Navigating the Landscape of Hint Generation Research: From the Past to
  the Future
Navigating the Landscape of Hint Generation Research: From the Past to the Future
Anubhav Jangra
Jamshid Mozafari
Adam Jatowt
Smaranda Muresan
45
2
0
06 Apr 2024
To Cool or not to Cool? Temperature Network Meets Large Foundation
  Models via DRO
To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO
Zi-Hao Qiu
Siqi Guo
Mao Xu
Tuo Zhao
Lijun Zhang
Tianbao Yang
AI4TS
AI4CE
59
3
0
06 Apr 2024
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical
  Reasoning in Large Language Models
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
Hyeonwoo Kim
Gyoungjin Gim
Yungi Kim
Jihoo Kim
Byungju Kim
Wonseok Lee
Chanjun Park
ReLM
LRM
45
1
0
05 Apr 2024
Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Jingyu Zhang
Marc Marone
Tianjian Li
Benjamin Van Durme
Daniel Khashabi
93
9
0
05 Apr 2024
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
Dongwei Jiang
Jingyu Zhang
Orion Weller
Nathaniel Weir
Benjamin Van Durme
Daniel Khashabi
65
1
0
04 Apr 2024
ReFT: Representation Finetuning for Language Models
ReFT: Representation Finetuning for Language Models
Zhengxuan Wu
Aryaman Arora
Zheng Wang
Atticus Geiger
Daniel Jurafsky
Christopher D. Manning
Christopher Potts
OffRL
43
58
0
04 Apr 2024
Investigating Regularization of Self-Play Language Models
Investigating Regularization of Self-Play Language Models
Réda Alami
Abdalgader Abubaker
Mastane Achab
M. Seddik
Salem Lahlou
38
3
0
04 Apr 2024
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models
  with a Self-Critique Pipeline
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Yifan Xu
Xiao Liu
Xinghan Liu
Zhenyu Hou
Yueyan Li
...
Aohan Zeng
Zhengxiao Du
Wenyi Zhao
Jie Tang
Yuxiao Dong
LRM
49
36
0
03 Apr 2024
Previous
123...394041...616263
Next