Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07672
Cited By
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
10 October 2024
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization"
6 / 6 papers shown
Title
Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society
Feifei Zhao
Y. Wang
Enmeng Lu
Dongcheng Zhao
Bing Han
...
Chao Liu
Yaodong Yang
Yi Zeng
Boyuan Chen
Jinyu Fan
83
0
0
24 Apr 2025
Cognitive Debiasing Large Language Models for Decision-Making
Yougang Lyu
Shijie Ren
Yue Feng
Zihan Wang
Z. Chen
Z. Z. Ren
Maarten de Rijke
36
0
0
05 Apr 2025
Information Retrieval for Climate Impact
Maarten de Rijke
Bart van den Hurk
Flora Salim
Alaa Al Khourdajie
Nan Bai
...
Edmund Totin
Andrew Trotman
Ramamurthy Valavandan
Dereje Workneh
Yangxinyu Xie
SyDa
59
1
0
01 Apr 2025
Research on Superalignment Should Advance Now with Parallel Optimization of Competence and Conformity
HyunJin Kim
Xiaoyuan Yi
Jing Yao
Muhua Huang
Jinyeong Bak
James Evans
Xing Xie
44
0
0
08 Mar 2025
How to Mitigate Overfitting in Weak-to-strong Generalization?
Junhao Shi
Qinyuan Cheng
Zhaoye Fei
Y. Zheng
Qipeng Guo
Xipeng Qiu
70
0
0
06 Mar 2025
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Z. Yang
...
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
50
11
0
04 Sep 2024
1