
2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
Papers citing "2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision"
5 / 5 papers shown
Title |
---|
![]() Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization Yuxin Jiang Bo Huang Yufei Wang Xingshan Zeng Liangyou Li Yasheng Wang Xin Jiang Lifeng Shang Ruiming Tang Wei Wang |