Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation

Zongxia Li
Yapei Chang
Yuhang Zhou
Xiyang Wu
Zichao Liang
Yoo Yeon Sung
Jordan Lee Boyd-Graber

Papers citing "Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation"

Title
No papers