Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.05006
Cited By
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
8 March 2024
Huiying Zhong
Zhun Deng
Weijie J. Su
Zhiwei Steven Wu
Linjun Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Provable Multi-Party Reinforcement Learning with Diverse Human Feedback"
5 / 5 papers shown
Title
Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
26
0
0
05 Jun 2025
Learning Guarantee of Reward Modeling Using Deep Neural Networks
Yuanhang Luo
Yeheng Ge
Ruijian Han
Guohao Shen
71
0
0
10 May 2025
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
420
0
0
27 Apr 2025
Clone-Robust AI Alignment
Ariel D. Procaccia
Benjamin G. Schiffer
Shirley Zhang
48
3
0
17 Jan 2025
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
141
41
0
30 Apr 2024
1