Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.02795
Cited By
Towards a Unified View of Preference Learning for Large Language Models: A Survey
4 September 2024
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Z. Yang
Liang Chen
Helan Hu
Runxin Xu
Qingxiu Dong
Ce Zheng
Shanghaoran Quan
Wen Xiao
Daoguang Zan
K. Lu
Keming Lu
Bowen Yu
Zeyu Cui
Zeyu Cui
Lei Sha
Lei Sha
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards a Unified View of Preference Learning for Large Language Models: A Survey"
8 / 8 papers shown
Title
Steerable Chatbots: Personalizing LLMs with Preference-Based Activation Steering
Jessica Y. Bo
Tianyu Xu
Ishan Chatterjee
Katrina Passarella-Ward
Achin Kulshrestha
D Shin
LLMSV
79
0
0
07 May 2025
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback
Yafu Li
Xuyang Hu
Xiaoye Qu
Linjie Li
Yu-Xi Cheng
53
3
0
22 Jan 2025
Aligning CodeLLMs with Direct Preference Optimization
Yibo Miao
Bofei Gao
Shanghaoran Quan
Junyang Lin
Daoguang Zan
J. Liu
Jian Yang
Tianyu Liu
Zhijie Deng
58
5
0
24 Oct 2024
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
Chris Liu
Liang Zeng
J. Liu
Rui Yan
Jujie He
Chaojie Wang
Shuicheng Yan
Yang Liu
Yahui Zhou
AI4TS
46
63
0
24 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
57
6
0
10 Oct 2024
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Wei Chen
Lin Li
Yongqi Yang
Bin Wen
Fan Yang
Tingting Gao
Yu Wu
Long Chen
VLM
VGen
47
6
0
15 Jun 2024
Decoding-time Realignment of Language Models
Tianlin Liu
Shangmin Guo
Leonardo Bianco
Daniele Calandriello
Quentin Berthet
Felipe Llinares-López
Jessica Hoffmann
Lucas Dixon
Michal Valko
Mathieu Blondel
AI4CE
54
35
0
05 Feb 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
1