Title |
---|
![]() Towards a Unified View of Preference Learning for Large Language Models:
A Survey Bofei Gao Feifan Song Yibo Miao Zefan Cai Zhengyuan Yang ...Houfeng Wang Zhifang Sui Peiyi Wang Baobao Chang Baobao Chang |
![]() I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative
Self-Enhancement Paradigm Yiming Liang Ge Zhang Xingwei Qu Tianyu Zheng Jiawei Guo ...Jiaheng Liu Chenghua Lin Lei Ma Wenhao Huang Jiajun Zhang |