ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.07086
  4. Cited By
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

9 April 2025
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
    ReLMALMLRM
ArXiv (abs)PDFHTML

Papers citing "A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility"

7 / 57 papers shown
Title
Evaluating the Performance of Reinforcement Learning Algorithms
Evaluating the Performance of Reinforcement Learning Algorithms
Scott M. Jordan
Yash Chandak
Daniel Cohen
Mengxue Zhang
Philip S. Thomas
58
47
0
30 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
70
224
0
10 Jun 2020
A Metric Learning Reality Check
A Metric Learning Reality Check
Kevin Musgrave
Serge J. Belongie
Ser-Nam Lim
145
479
0
18 Mar 2020
Measuring the Reliability of Reinforcement Learning Algorithms
Measuring the Reliability of Reinforcement Learning Algorithms
Stephanie C. Y. Chan
Sam Fishman
John F. Canny
Anoop Korattikara Balan
S. Guadarrama
57
84
0
10 Dec 2019
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement
  Learning Experiments
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
45
93
0
21 Jun 2018
Re-evaluating Evaluation
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
60
101
0
07 Jun 2018
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
125
1,963
0
19 Sep 2017
Previous
12