Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20196
Cited By
Temporal Sampling for Forgotten Reasoning in LLMs
26 May 2025
Yuetai Li
Zhangchen Xu
Fengqing Jiang
Bhaskar Ramasubramanian
Luyao Niu
Bill Yuchen Lin
Xiang Yue
Radha Poovendran
CLL
KELM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Sampling for Forgotten Reasoning in LLMs"
6 / 6 papers shown
Title
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue
Zhiqi Chen
Rui Lu
Andrew Zhao
Zhaokai Wang
Yang Yue
Shiji Song
Gao Huang
ReLM
LRM
253
128
0
18 Apr 2025
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Wei Xiong
Jiarui Yao
Yuhui Xu
Bo Pang
Lei Wang
...
Junnan Li
Nan Jiang
Tong Zhang
Caiming Xiong
Hanze Dong
OffRL
LRM
131
32
0
15 Apr 2025
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Yuxiang Lai
Shitian Zhao
Ming Li
Jike Zhong
Xiaofeng Yang
OffRL
LRM
LM&MA
VLM
205
31
0
18 Mar 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
449
2,033
0
22 Jan 2025
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Nikhil Sardana
Jacob P. Portes
Sasha Doubov
Jonathan Frankle
LRM
455
88
0
31 Dec 2023
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
Yun Luo
Zhen Yang
Fandong Meng
Yafu Li
Jie Zhou
Yue Zhang
CLL
KELM
221
319
0
17 Aug 2023
1