Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.04203
Cited By
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
5 October 2024
Hanyang Zhao
Genta Indra Winata
Anirban Das
Shi-Xiong Zhang
D. Yao
Wenpin Tang
Sambit Sahu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization"
2 / 2 papers shown
Title
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Hanyang Zhao
Haoxian Chen
Yucheng Guo
Genta Indra Winata
Tingting Ou
Ziyu Huang
D. Yao
Wenpin Tang
59
0
0
13 Mar 2025
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson
Vivek Srikumar
Ashish Sabharwal
85
2
0
23 Dec 2024
1