ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.00911
  4. Cited By
RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training

RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training

1 October 2025
Tao Ren
Jinyang Jiang
Hui Yang
Wan Tian
Minhao Zou
Guanghao Li
Zishi Zhang
Qinghao Wang
Shentao Qin
Yanjun Zhao
Rui Tao
Hui Shao
Yijie Peng
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3★)

Papers citing "RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training"

0 / 0 papers shown
Title

No papers found