Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.00103
Cited By
v1
v2 (latest)
Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards
30 May 2025
Xun Lu
Yunyi Yang
Yongbo Gai
Kai Luo
Shihao Huang
Jianhe Lin
Xiaoxi Jiang
Guanjun Jiang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards"
1 / 1 papers shown
Title
Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks
Yifei Xu
Tusher Chakraborty
Srinagesh Sharma
Leonardo Nunes
Emre Kıcıman
Songwu Lu
Ranveer Chandra
OffRL
LRM
28
1
0
16 Jun 2025
1