Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.00911
Cited By
RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training
1 October 2025
Tao Ren
Jinyang Jiang
Hui Yang
Wan Tian
Minhao Zou
Guanghao Li
Zishi Zhang
Qinghao Wang
Shentao Qin
Yanjun Zhao
Rui Tao
Hui Shao
Yijie Peng
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3★)
Papers citing
"RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training"
0 / 0 papers shown
Title
No papers found