ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2601.22448
  4. Cited By
HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

30 January 2026
Weiqi Wang
Xin Liu
Binxuan Huang
Hejie Cui
Rongzhi Zhang
Changlong Yu
Shuowei Jin
Jingfeng Yang
Qingyu Yin
Zhengyang Wang
Zheng Li
Yifan Gao
Priyanka Nigam
Bing Yin
Lihong Li
Yangqiu Song
    OffRLLRM
ArXiv (abs)PDFHTMLGithub (1★)

Papers citing "HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning"

0 / 0 papers shown

No papers found

Page 1 of 0