ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.16025
  4. Cited By
Exploring and Addressing Reward Confusion in Offline Preference Learning

Exploring and Addressing Reward Confusion in Offline Preference Learning

22 July 2024
Xin Chen
Sam Toyer
Florian Shkurti
    OffRL
ArXivPDFHTML

Papers citing "Exploring and Addressing Reward Confusion in Offline Preference Learning"

1 / 1 papers shown
Title
Defining and Characterizing Reward Hacking
Defining and Characterizing Reward Hacking
Joar Skalse
Nikolaus H. R. Howe
Dmitrii Krasheninnikov
David M. Krueger
59
56
0
27 Sep 2022
1