Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.08358
Cited By
v1
v2 (latest)
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
13 December 2023
Anand Siththaranjan
Cassidy Laidlaw
Dylan Hadfield-Menell
Re-assign community
ArXiv (abs)
PDF
HTML
Github (29★)
Papers citing
"Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF"
2 / 52 papers shown
Title
Crowd-PrefRL: Preference-Based Reward Learning from Crowds
David Chhan
Ellen R. Novoseller
Vernon J. Lawhern
161
5
0
17 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
132
112
0
08 Jan 2024
Previous
1
2