ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.14168
  4. Cited By
Training Verifiers to Solve Math Word Problems

Training Verifiers to Solve Math Word Problems

27 October 2021
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
Lukasz Kaiser
Matthias Plappert
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
    ReLM
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Training Verifiers to Solve Math Word Problems"

50 / 3,115 papers shown
Title
Quantum Many-Body Physics Calculations with Large Language Models
Quantum Many-Body Physics Calculations with Large Language Models
Haining Pan
N. Mudur
Will Taranto
Maria Tikhanovskaya
Subhashini Venugopalan
Yasaman Bahri
Michael P. Brenner
Eun-Ah Kim
44
6
0
05 Mar 2024
Localized Zeroth-Order Prompt Optimization
Localized Zeroth-Order Prompt Optimization
Wenyang Hu
Yao Shu
Zongmin Yu
Zhaoxuan Wu
Xiangqiang Lin
Zhongxiang Dai
See-Kiong Ng
Bryan Kian Hsiang Low
35
6
0
05 Mar 2024
DPPA: Pruning Method for Large Language Model to Model Merging
DPPA: Pruning Method for Large Language Model to Model Merging
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
40
4
0
05 Mar 2024
Causal Prompting: Debiasing Large Language Model Prompting based on
  Front-Door Adjustment
Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment
Congzhi Zhang
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
CML
AI4CE
LRM
49
15
0
05 Mar 2024
Exploring the Limitations of Large Language Models in Compositional
  Relation Reasoning
Exploring the Limitations of Large Language Models in Compositional Relation Reasoning
Jinman Zhao
Xueyan Zhang
BDL
LRM
38
4
0
05 Mar 2024
Eliciting Better Multilingual Structured Reasoning from LLMs through
  Code
Eliciting Better Multilingual Structured Reasoning from LLMs through Code
Bryan Li
Tamer Alkhouli
Daniele Bonadiman
Nikolaos Pappas
Saab Mansour
LRM
47
7
0
05 Mar 2024
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
Yuqi Zhu
Shuofei Qiao
Yixin Ou
Shumin Deng
N. Zhang
Shiwei Lyu
Yue Shen
Lei Liang
Jinjie Gu
Huajun Chen
LLMAG
LM&Ro
83
26
0
05 Mar 2024
Are More LLM Calls All You Need? Towards Scaling Laws of Compound
  Inference Systems
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Ion Stoica
Matei A. Zaharia
James Zou
LRM
39
0
0
04 Mar 2024
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical
  Reasoning
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Yiming Huang
Xiao Liu
Yeyun Gong
Zhibin Gou
Yelong Shen
Nan Duan
Weizhu Chen
AIMat
LRM
58
37
0
04 Mar 2024
Birbal: An efficient 7B instruct-model fine-tuned with curated datasets
Birbal: An efficient 7B instruct-model fine-tuned with curated datasets
Ashvini Jindal
P. Rajpoot
Ankur P. Parikh
58
6
0
04 Mar 2024
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve
  Mathematical Reasoning Learning of Language Models
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Changyu Chen
Xiting Wang
Ting-En Lin
Ang Lv
Yuchuan Wu
Xin Gao
Ji-Rong Wen
Rui Yan
Yongbin Li
ReLM
LRM
31
11
0
04 Mar 2024
SciAssess: Benchmarking LLM Proficiency in Scientific Literature
  Analysis
SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis
Hengxing Cai
Xiaochen Cai
Junhan Chang
Sihang Li
Lin Yao
...
Changhong Chen
Zheng Cheng
Zifeng Zhao
Linfeng Zhang
Guolin Ke
ELM
36
24
0
04 Mar 2024
Online Training of Large Language Models: Learn while chatting
Online Training of Large Language Models: Learn while chatting
Juhao Liang
Ziwei Wang
Zhuoheng Ma
Jianquan Li
Zhiyi Zhang
Xiangbo Wu
Benyou Wang
KELM
44
3
0
04 Mar 2024
Rethinking LLM Language Adaptation: A Case Study on Chinese Mixtral
Rethinking LLM Language Adaptation: A Case Study on Chinese Mixtral
Yiming Cui
Xin Yao
30
4
0
04 Mar 2024
LM4OPT: Unveiling the Potential of Large Language Models in Formulating
  Mathematical Optimization Problems
LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems
Tasnim Ahmed
Salimur Choudhury
33
11
0
02 Mar 2024
STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient
  Fine-Tuning of Large Language Models
STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
32
4
0
02 Mar 2024
LAB: Large-Scale Alignment for ChatBots
LAB: Large-Scale Alignment for ChatBots
Shivchander Sudalairaj
Abhishek Bhandwaldar
Aldo Pareja
Kai Xu
David D. Cox
Akash Srivastava
OSLM
41
29
0
02 Mar 2024
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large
  Language Models
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models
Kedi Chen
Qin Chen
Jie Zhou
Yishen He
Liang He
HILM
53
1
0
01 Mar 2024
Functional Benchmarks for Robust Evaluation of Reasoning Performance,
  and the Reasoning Gap
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Saurabh Srivastava
B. AnnaroseM
V. AntoP
Shashank Menon
Ajay Sukumar
T. AdwaithSamod
Alan Philipose
Stevin Prince
Sooraj Thomas
ELM
ReLM
LRM
42
46
0
29 Feb 2024
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of
  LLMs as Mathematical Problem Solvers
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
LRM
48
49
0
29 Feb 2024
ToolNet: Connecting Large Language Models with Massive Tools via Tool
  Graph
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph
Xukun Liu
Zhiyuan Peng
Xiaoyuan Yi
Xing Xie
Lirong Xiang
Yuchen Liu
Dongkuan Xu
CLL
LLMAG
61
13
0
29 Feb 2024
Learning to Compress Prompt in Natural Language Formats
Learning to Compress Prompt in Natural Language Formats
Yu-Neng Chuang
Tianwei Xing
Chia-Yuan Chang
Zirui Liu
Xun Chen
Xia Hu
42
16
0
28 Feb 2024
CLLMs: Consistency Large Language Models
CLLMs: Consistency Large Language Models
Siqi Kou
Lanxiang Hu
Zhe He
Zhijie Deng
Hao Zhang
52
28
0
28 Feb 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems
  in Commonsense Reasoning
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Daojian Zeng
Kang Liu
Jun Zhao
LRM
56
8
0
28 Feb 2024
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the
  Key?
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
Qineng Wang
Zihao Wang
Ying Su
Hanghang Tong
Yangqiu Song
LLMAG
LRM
46
64
0
28 Feb 2024
Towards Generalist Prompting for Large Language Models by Mental Models
Towards Generalist Prompting for Large Language Models by Mental Models
Haoxiang Guan
Jiyan He
Shuxin Zheng
En-Hong Chen
Weiming Zhang
Neng H. Yu
LRM
67
0
0
28 Feb 2024
CogBench: a large language model walks into a psychology lab
CogBench: a large language model walks into a psychology lab
Julian Coda-Forno
Marcel Binz
Jane X. Wang
Eric Schulz
ELM
ALM
LLMAG
LM&MA
69
33
0
28 Feb 2024
Improving Open-Ended Text Generation via Adaptive Decoding
Improving Open-Ended Text Generation via Adaptive Decoding
Wenhong Zhu
Hong-ping Hao
Zhiwei He
Yiming Ai
Rui Wang
33
7
0
28 Feb 2024
Evaluating Quantized Large Language Models
Evaluating Quantized Large Language Models
Shiyao Li
Xuefei Ning
Luning Wang
Tengxuan Liu
Xiangsheng Shi
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MQ
45
47
0
28 Feb 2024
No Token Left Behind: Reliable KV Cache Compression via Importance-Aware
  Mixed Precision Quantization
No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization
J. Yang
Byeongwook Kim
Jeongin Bae
Beomseok Kwon
Gunho Park
Eunho Yang
S. Kwon
Dongsoo Lee
MQ
47
45
0
28 Feb 2024
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Shuhua Shi
Shaohan Huang
Minghui Song
Zhoujun Li
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
AI4CE
25
14
0
28 Feb 2024
Transformer-based Parameter Estimation in Statistics
Transformer-based Parameter Estimation in Statistics
Xiaoxin Yin
David S. Yin
24
0
0
28 Feb 2024
Hire a Linguist!: Learning Endangered Languages with In-Context
  Linguistic Descriptions
Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions
Kexun Zhang
Yee Man Choi
Zhenqiao Song
Taiqi He
Luu Anh Tuan
Lei Li
39
17
0
28 Feb 2024
Adversarial Math Word Problem Generation
Adversarial Math Word Problem Generation
Roy Xie
Chengxuan Huang
Junlin Wang
Bhuwan Dhingra
AAML
41
1
0
27 Feb 2024
Stable LM 2 1.6B Technical Report
Stable LM 2 1.6B Technical Report
Marco Bellagente
J. Tow
Dakota Mahan
Duy Phung
Maksym Zhuravinskyi
...
Paulo Rocha
Harry Saini
H. Teufel
Niccoló Zanichelli
Carlos Riquelme
OSLM
54
52
0
27 Feb 2024
Are LLMs Capable of Data-based Statistical and Causal Reasoning?
  Benchmarking Advanced Quantitative Reasoning with Data
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
Xiao Liu
Zirui Wu
Xueqing Wu
Pan Lu
Kai-Wei Chang
Yansong Feng
ELM
LRM
40
31
0
27 Feb 2024
KoDialogBench: Evaluating Conversational Understanding of Language
  Models with Korean Dialogue Benchmark
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
Seongbo Jang
Seonghyeon Lee
Hwanjo Yu
ELM
34
0
0
27 Feb 2024
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical
  Reasoning
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Debrup Das
Debopriyo Banerjee
Somak Aditya
Ashish Kulkarni
ReLM
LRM
38
10
0
27 Feb 2024
Reasoning in Conversation: Solving Subjective Tasks through Dialogue
  Simulation for Large Language Models
Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models
Xiaolong Wang
Yile Wang
Yuan Zhang
Ziyue Wang
Peng Li
Maosong Sun
Yang Liu
LRM
40
0
0
27 Feb 2024
Metasql: A Generate-then-Rank Framework for Natural Language to SQL
  Translation
Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation
Yuankai Fan
Zhenying He
Tonghui Ren
Can Huang
Yinan Jing
Kai Zhang
Xiaoyang Sean Wang
32
12
0
27 Feb 2024
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Mikayel Samvelyan
Sharath Chandra Raparthy
Andrei Lupu
Eric Hambro
Aram H. Markosyan
...
Minqi Jiang
Jack Parker-Holder
Jakob Foerster
Tim Rocktaschel
Roberta Raileanu
SyDa
83
67
0
26 Feb 2024
Nemotron-4 15B Technical Report
Nemotron-4 15B Technical Report
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
M. Patwary
Sandeep Subramanian
...
Ashwath Aithal
Oleksii Kuchaiev
Mohammad Shoeybi
Jonathan Cohen
Bryan Catanzaro
39
22
0
26 Feb 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language
  Models
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
Renren Jin
Jiangcun Du
Wuwei Huang
Wei Liu
Jian Luan
Bin Wang
Deyi Xiong
MQ
34
31
0
26 Feb 2024
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent
  LLMs
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent LLMs
Sumedh Rasal
E. Hauer
32
0
0
26 Feb 2024
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection
Liangxin Liu
Xuebo Liu
Derek F. Wong
Dongfang Li
Ziyi Wang
Baotian Hu
Min Zhang
60
17
0
26 Feb 2024
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs
  from the Programming Language
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming Language
Ming Wang
Yuanzhong Liu
Xiaoyu Liang
Songlian Li
Yijie Huang
...
Shi Feng
Chi Zhang
Yifei Zhang
Minghui Zheng
Jigang Li
48
13
0
26 Feb 2024
Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and
  Their Applications
Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications
Weize Liu
Yinlong Xu
Hongxia Xu
Jintai Chen
Xuming Hu
Jian Wu
34
0
0
26 Feb 2024
MathGenie: Generating Synthetic Data with Question Back-translation for
  Enhancing Mathematical Reasoning of LLMs
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs
Zimu Lu
Aojun Zhou
Houxing Ren
Ke Wang
Weikang Shi
Junting Pan
Mingjie Zhan
Hongsheng Li
SyDa
LRM
53
45
0
26 Feb 2024
From Large Language Models and Optimization to Decision Optimization
  CoPilot: A Research Manifesto
From Large Language Models and Optimization to Decision Optimization CoPilot: A Research Manifesto
Segev Wasserkrug
Léonard Boussioux
D. Hertog
F. Mirzazadeh
Ilker Birbil
Jannis Kurtz
Donato Maragno
LLMAG
56
3
0
26 Feb 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Ti-Fen Pan
Hanfeng Lin
Yi Wang
Zeyue Tian
Shangda Wu
...
Gus Xia
Roger Dannenberg
Wei Xue
Shiyin Kang
Yike Guo
101
36
0
25 Feb 2024
Previous
123...424344...616263
Next