ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.14403
  4. Cited By
Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning

Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning

20 May 2025
Zhaohui Yang
Shilei Jiang
Chen Hu
Linjing Li
Shihong Deng
D. Jiang
    OffRL
ArXivPDFHTML

Papers citing "Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning"

Title
No papers