Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.05316
Cited By
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
5 June 2025
Yifan Sun
Jingyan Shen
Yibin Wang
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
Huan Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay"
Title
No papers