Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.01729
Cited By
Evaluating Robustness of Reward Models for Mathematical Reasoning
2 October 2024
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Robustness of Reward Models for Mathematical Reasoning"
3 / 3 papers shown
Title
On the Robustness of Reward Models for Language Model Alignment
Jiwoo Hong
Noah Lee
Eunki Kim
Guijin Son
Woojin Chung
Aman Gupta
Shao Tang
James Thorne
29
0
0
12 May 2025
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics
Ting-Ruen Wei
Haowei Liu
Xuyang Wu
Yi Fang
LRM
AI4CE
ReLM
KELM
220
1
0
21 Feb 2025
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Hyungjoo Chae
Taeyoon Kwon
Seungjun Moon
Yongho Song
Dongjin Kang
Kai Tzu-iunn Ong
Beong-woo Kwak
SeongHyeon Bae
Seung-won Hwang
Jinyoung Yeo
31
3
0
29 Sep 2024
1