ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14168
  4. Cited By
Training Verifiers to Solve Math Word Problems

Training Verifiers to Solve Math Word Problems

27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
    ReLM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Training Verifiers to Solve Math Word Problems"

50 / 3,115 papers shown
Title
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Haritz Puerto
Martin Tutek
Somak Aditya
Xiaodan Zhu
Iryna Gurevych
ReCod
ReLM
LRM
56
11
0
18 Jan 2024
Self-Rewarding Language Models
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLM
SyDa
ALM
LRM
244
304
0
18 Jan 2024
Evaluating LLMs' Mathematical and Coding Competency through
  Ontology-guided Interventions
Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions
Pengfei Hong
Navonil Majumder
Deepanway Ghosal
Somak Aditya
Rada Mihalcea
Soujanya Poria
LRM
54
4
0
17 Jan 2024
LLMs for Relational Reasoning: How Far are We?
LLMs for Relational Reasoning: How Far are We?
Zhiming Li
Yushi Cao
Xiufeng Xu
Junzhe Jiang
Xu Liu
Yon Shin Teo
Shang-Wei Lin
Yang Liu
LRM
37
16
0
17 Jan 2024
Augmenting Math Word Problems via Iterative Question Composing
Augmenting Math Word Problems via Iterative Question Composing
Haoxiong Liu
Yifan Zhang
Yifan Luo
Andrew Chi-Chih Yao
SyDa
LRM
47
37
0
17 Jan 2024
ReFT: Reasoning with Reinforced Fine-Tuning
ReFT: Reasoning with Reinforced Fine-Tuning
Trung Quoc Luong
Xinbo Zhang
Zhanming Jie
Peng Sun
Xiaoran Jin
Hang Li
OffRL
LRM
ReLM
48
95
0
17 Jan 2024
Tuning Language Models by Proxy
Tuning Language Models by Proxy
Alisa Liu
Xiaochuang Han
Yizhong Wang
Yulia Tsvetkov
Yejin Choi
Noah A. Smith
ALM
43
46
0
16 Jan 2024
Contrastive Perplexity for Controlled Generation: An Application in
  Detoxifying Large Language Models
Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models
T. Klein
Moin Nabi
26
1
0
16 Jan 2024
Understanding User Experience in Large Language Model Interactions
Understanding User Experience in Large Language Model Interactions
Jiayin Wang
Weizhi Ma
Peijie Sun
Min Zhang
Jian-yun Nie
27
32
0
16 Jan 2024
Large Language Models are Null-Shot Learners
Large Language Models are Null-Shot Learners
Pittawat Taveekitworachai
Febri Abdullah
R. Thawonmas
LRM
29
2
0
16 Jan 2024
LoMA: Lossless Compressed Memory Attention
LoMA: Lossless Compressed Memory Attention
Yumeng Wang
Zhenyang Xiao
21
3
0
16 Jan 2024
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible
  Pipeline
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline
Minpeng Liao
Wei Luo
Chengxi Li
Jing Wu
Kai Fan
LRM
40
39
0
16 Jan 2024
PRewrite: Prompt Rewriting with Reinforcement Learning
PRewrite: Prompt Rewriting with Reinforcement Learning
Weize Kong
Spurthi Amba Hombaiah
Mingyang Zhang
Qiaozhu Mei
Michael Bendersky
LLMAG
18
14
0
16 Jan 2024
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using
  Self-Imagination
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination
Syeda Nahida Akter
Aman Madaan
Sangwu Lee
Yiming Yang
Eric Nyberg
ReLM
VLM
LRM
41
2
0
16 Jan 2024
Unlocking Efficiency in Large Language Model Inference: A Comprehensive
  Survey of Speculative Decoding
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
Heming Xia
Zhe Yang
Qingxiu Dong
Peiyi Wang
Yongqi Li
Tao Ge
Tianyu Liu
Wenjie Li
Zhifang Sui
LRM
38
105
0
15 Jan 2024
Question Translation Training for Better Multilingual Reasoning
Question Translation Training for Better Multilingual Reasoning
Wenhao Zhu
Shujian Huang
Fei Yuan
Shuaijie She
Jiajun Chen
Alexandra Birch
LRM
31
29
0
15 Jan 2024
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of
  Large Language Models
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models
Divyanshu Aggarwal
Ashutosh Sathe
Ishaan Watts
Sunayana Sitaram
40
1
0
15 Jan 2024
Survey of Natural Language Processing for Education: Taxonomy,
  Systematic Review, and Future Trends
Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends
Yunshi Lan
Xinyuan Li
Hanyue Du
Xuesong Lu
Ming Gao
Weining Qian
Aoying Zhou
45
2
0
15 Jan 2024
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Weizhou Shen
Chenliang Li
Hongzhan Chen
Ming Yan
Xiaojun Quan
Hehong Chen
Ji Zhang
Fei Huang
LLMAG
48
50
0
14 Jan 2024
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics
  Capabilities
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities
S. Sravanthi
Meet Doshi
Tankala Pavan Kalyan
Rudra Murthy
Pushpak Bhattacharyya
Raj Dabre
24
20
0
13 Jan 2024
xCoT: Cross-lingual Instruction Tuning for Cross-lingual
  Chain-of-Thought Reasoning
xCoT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning
Linzheng Chai
Jian Yang
Tao Sun
Hongcheng Guo
Jiaheng Liu
...
Xiannian Liang
Jiaqi Bai
Tongliang Li
Qiyao Peng
Zhoujun Li
LRM
44
49
0
13 Jan 2024
Knowledge Distillation for Closed-Source Language Models
Knowledge Distillation for Closed-Source Language Models
Hongzhan Chen
Xiaojun Quan
Hehong Chen
Ming Yan
Ji Zhang
BDL
47
2
0
13 Jan 2024
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs'
  Mathematical Reasoning Capabilities
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities
Yujun Mao
Yoon Kim
Yilun Zhou
LRM
ReLM
26
18
0
13 Jan 2024
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
Peter Hase
Mohit Bansal
Peter Clark
Sarah Wiegreffe
20
28
0
12 Jan 2024
MAPO: Advancing Multilingual Reasoning through Multilingual
  Alignment-as-Preference Optimization
MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization
Shuaijie She
Wei Zou
Shujian Huang
Wenhao Zhu
Xiang Liu
Xiang Geng
Jiajun Chen
LRM
75
34
0
12 Jan 2024
AntEval: Evaluation of Social Interaction Competencies in LLM-Driven
  Agents
AntEval: Evaluation of Social Interaction Competencies in LLM-Driven Agents
Yuanzhi Liang
Linchao Zhu
Yi Yang
LLMAG
30
0
0
12 Jan 2024
Extreme Compression of Large Language Models via Additive Quantization
Extreme Compression of Large Language Models via Additive Quantization
Vage Egiazarian
Andrei Panferov
Denis Kuznedelev
Elias Frantar
Artem Babenko
Dan Alistarh
MQ
102
91
0
11 Jan 2024
Improving Large Language Models via Fine-grained Reinforcement Learning
  with Minimum Editing Constraint
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Junchen Wan
Fuzheng Zhang
Di Zhang
Ji-Rong Wen
KELM
41
33
0
11 Jan 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in
  Mixture-of-Experts Language Models
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Damai Dai
Chengqi Deng
Chenggang Zhao
R. X. Xu
Huazuo Gao
...
Panpan Huang
Fuli Luo
Chong Ruan
Zhifang Sui
W. Liang
MoE
46
252
0
11 Jan 2024
Learning Cognitive Maps from Transformer Representations for Efficient
  Planning in Partially Observed Environments
Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments
Antoine Dedieu
Wolfgang Lehrach
Guangyao Zhou
Dileep George
Miguel Lazaro-Gredilla
45
2
0
11 Jan 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models
  (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Yiqi Wang
Wentao Chen
Xiaotian Han
Xudong Lin
Haiteng Zhao
Yongfei Liu
Bohan Zhai
Jianbo Yuan
Quanzeng You
Hongxia Yang
LRM
47
71
0
10 Jan 2024
DCR: Divide-and-Conquer Reasoning for Multi-choice Question Answering
  with LLMs
DCR: Divide-and-Conquer Reasoning for Multi-choice Question Answering with LLMs
Zijie Meng
Yan Zhang
Zhaopeng Feng
Zuozhu Liu
LRM
27
4
0
10 Jan 2024
The Impact of Reasoning Step Length on Large Language Models
The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin
Qinkai Yu
Dong Shu
Haiyan Zhao
Wenyue Hua
Yanda Meng
Yongfeng Zhang
Mengnan Du
ReLM
LRM
20
93
0
10 Jan 2024
Model Editing Harms General Abilities of Large Language Models:
  Regularization to the Rescue
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
Jia-Chen Gu
Haoyang Xu
Jun-Yu Ma
Pan Lu
Zhen-Hua Ling
Kai-Wei Chang
Nanyun Peng
KELM
38
37
0
09 Jan 2024
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
Mahdi Nikdan
Soroush Tabesh
Elvir Crnčević
Dan Alistarh
16
27
0
09 Jan 2024
Mixtral of Experts
Mixtral of Experts
Albert Q. Jiang
Alexandre Sablayrolles
Antoine Roux
A. Mensch
Blanche Savary
...
Théophile Gervet
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LLMAG
42
1,000
0
08 Jan 2024
TeleChat Technical Report
TeleChat Technical Report
Zhongjiang He
Zihan Wang
Xinzhan Liu
Shixuan Liu
Yitong Yao
...
Zilu Huang
Sishi Xiong
Yuxiang Zhang
Chao Wang
Shuangyong Song
AI4MH
LRM
ALM
66
3
0
08 Jan 2024
Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
Callie Y. Kim
Christine P. Lee
Bilge Mutlu
LM&Ro
57
72
0
06 Jan 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
DeepSeek-AI Xiao Bi
:
Xiao Bi
Deli Chen
Guanting Chen
...
Yao Zhao
Shangyan Zhou
Shunfeng Zhou
Qihao Zhu
Yuheng Zou
LRM
ALM
139
316
0
05 Jan 2024
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts
  for Instruction Tuning on General Tasks
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
Haoyuan Wu
Haisheng Zheng
Zhuolun He
Bei Yu
MoE
ALM
29
14
0
05 Jan 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
LLaMA Pro: Progressive LLaMA with Block Expansion
Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Ye Feng
Ying Shan
Ping Luo
CLL
37
61
0
04 Jan 2024
LLM Augmented LLMs: Expanding Capabilities through Composition
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal
Bidisha Samanta
Siddharth Dalmia
Nitish Gupta
Shikhar Vashishth
Sriram Ganapathy
Abhishek Bapna
Prateek Jain
Partha P. Talukdar
CLL
26
34
0
04 Jan 2024
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model
Yichen Zhu
Minjie Zhu
Ning Liu
Zhicai Ou
Xiaofeng Mou
Jian Tang
76
94
0
04 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
39
65
0
04 Jan 2024
Self-Contrast: Better Reflection Through Inconsistent Solving
  Perspectives
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Yueting Zhuang
Weiming Lu
LRM
LLMAG
45
53
0
04 Jan 2024
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as
  Programmers
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Aleksandar Stanić
Sergi Caelles
Michael Tschannen
LRM
VLM
27
9
0
03 Jan 2024
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
  Models
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Zixiang Chen
Yihe Deng
Huizhuo Yuan
Kaixuan Ji
Quanquan Gu
SyDa
48
285
0
02 Jan 2024
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Hongye Jin
Xiaotian Han
Jingfeng Yang
Zhimeng Jiang
Zirui Liu
Chia-Yuan Chang
Huiyuan Chen
Xia Hu
47
101
0
02 Jan 2024
LLM Harmony: Multi-Agent Communication for Problem Solving
LLM Harmony: Multi-Agent Communication for Problem Solving
Sumedh Rasal
LLMAG
24
22
0
02 Jan 2024
A Comprehensive Study of Knowledge Editing for Large Language Models
A Comprehensive Study of Knowledge Editing for Large Language Models
Ningyu Zhang
Yunzhi Yao
Bo Tian
Peng Wang
Shumin Deng
...
Lei Liang
Qing Cui
Xiao-Jun Zhu
Jun Zhou
Huajun Chen
KELM
55
77
0
02 Jan 2024
Previous
123...474849...616263
Next