Crowd-PrefRL: Preference-Based Reward Learning from Crowds
v1v2 (latest)

Crowd-PrefRL: Preference-Based Reward Learning from Crowds

Papers citing "Crowd-PrefRL: Preference-Based Reward Learning from Crowds"

27 / 27 papers shown
Title
Learning Reward Functions from Scale Feedback
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
89
34
0
01 Oct 2021

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.