Iterative Reward Shaping using Human Feedback for Correcting Reward
  Misspecification

Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification

Papers citing "Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification"

12 / 12 papers shown
Title

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.