ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.01729
  4. Cited By
Evaluating Robustness of Reward Models for Mathematical Reasoning

Evaluating Robustness of Reward Models for Mathematical Reasoning

2 October 2024
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
ArXivPDFHTML

Papers citing "Evaluating Robustness of Reward Models for Mathematical Reasoning"

3 / 3 papers shown
Title
On the Robustness of Reward Models for Language Model Alignment
On the Robustness of Reward Models for Language Model Alignment
Jiwoo Hong
Noah Lee
Eunki Kim
Guijin Son
Woojin Chung
Aman Gupta
Shao Tang
James Thorne
29
0
0
12 May 2025
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics
Ting-Ruen Wei
Haowei Liu
Xuyang Wu
Yi Fang
LRM
AI4CE
ReLM
KELM
220
1
0
21 Feb 2025
Coffee-Gym: An Environment for Evaluating and Improving Natural Language
  Feedback on Erroneous Code
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Hyungjoo Chae
Taeyoon Kwon
Seungjun Moon
Yongho Song
Dongjin Kang
Kai Tzu-iunn Ong
Beong-woo Kwak
SeongHyeon Bae
Seung-won Hwang
Jinyoung Yeo
31
3
0
29 Sep 2024
1