Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.02273
Cited By
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
3 April 2025
Hung Le
Dai Do
D. Nguyen
Svetha Venkatesh
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models"
1 / 1 papers shown
Title
UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection
Yang Zhao
Kai Xiong
Xiao Ding
Li Du
YangouOuyang
...
Feiyu Xiong
Bin Liu
Dong Hu
Bing Qin
Ting Liu
OffRL
4
0
0
18 May 2025
1