ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.19850
  4. Cited By
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

26 May 2025
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
    OffRL
ArXiv (abs)PDFHTML

Papers citing "DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning"

7 / 7 papers shown
Title
Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging
Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging
Ryo Bertolissi
Jonas Hübotter
Ido Hakimi
Andreas Krause
MoMeMoE
61
1
0
20 May 2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
Liu Liu
...
Jianfeng Gao
Weizhu Chen
Shuaiqiang Wang
Simon Shaolei Du
Yelong Shen
OffRLReLMLRM
330
47
0
29 Apr 2025
TTRL: Test-Time Reinforcement Learning
TTRL: Test-Time Reinforcement Learning
Yuxin Zuo
Kaiyan Zhang
Li Sheng
Li Sheng
Xuekai Zhu
...
Youbang Sun
Zhiyuan Ma
Lifan Yuan
Ning Ding
Bowen Zhou
OffRL
414
31
0
22 Apr 2025
One-Minute Video Generation with Test-Time Training
One-Minute Video Generation with Test-Time Training
Karan Dalal
Daniel Koceja
Gashon Hussein
Jiarui Xu
Yue Zhao
...
Tatsunori Hashimoto
Sanmi Koyejo
Yejin Choi
Yu Sun
Xiaolong Wang
ViT
181
13
0
07 Apr 2025
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Toby Simonds
Akira Yoshiyama
LRM
115
6
0
02 Mar 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
390
2,024
0
22 Jan 2025
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Yu Sun
Xinhao Li
Karan Dalal
Jiarui Xu
Arjun Vikram
...
Xinlei Chen
Xiaolong Wang
Sanmi Koyejo
Tatsunori Hashimoto
Carlos Guestrin
143
113
0
05 Jul 2024
1