Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.14318
Cited By
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions
28 May 2022
Ansong Ni
J. Inala
Chenglong Wang
Oleksandr Polozov
Christopher Meek
Dragomir R. Radev
Jianfeng Gao
ReLM
AIMat
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions"
9 / 9 papers shown
Title
To Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-Maximization
Haozhe Wang
Long Li
Chao Qu
Fengming Zhu
Weidi Xu
Wei Chu
Fangzhen Lin
70
1
0
02 Feb 2025
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
OSLM
LRM
110
416
0
03 Jan 2025
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Hao Ma
Tianyi Hu
Zhiqiang Pu
Boyin Liu
Xiaolin Ai
Yanyan Liang
Min Chen
55
3
0
08 Oct 2024
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou
Abdalgader Abubaker
Hakim Hacid
LRM
46
2
0
23 Jun 2024
AICoderEval: Improving AI Domain Code Generation of Large Language Models
Yinghui Xia
Yuyan Chen
Tianyu Shi
Jun Wang
Jinsong Yang
34
3
0
07 Jun 2024
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
Li Zhong
Zilong Wang
Jingbo Shang
29
48
0
25 Feb 2024
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
Pragya Srivastava
Manuj Malik
Vivek Gupta
T. Ganu
Dan Roth
25
15
0
17 Feb 2024
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang
Yunxuan Li
Yuexin Wu
Liangchen Luo
Le Hou
Hongkun Yu
Jingbo Shang
LRM
45
20
0
05 Feb 2024
TinyGSM: achieving >80% on GSM8k with small language models
Bingbin Liu
Sébastien Bubeck
Ronen Eldan
Janardhan Kulkarni
Yuanzhi Li
Anh Nguyen
Rachel A. Ward
Yi Zhang
ALM
32
47
0
14 Dec 2023
1