ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.23749
  4. Cited By
Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?

Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?

29 May 2025
Paul Gölz
Nika Haghtalab
Kunhe Yang
ArXiv (abs)PDFHTML

Papers citing "Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?"

2 / 2 papers shown
Title
Jackpot! Alignment as a Maximal Lottery
Jackpot! Alignment as a Maximal Lottery
Roberto-Rafael Maura-Rivero
Marc Lanctot
Francesco Visin
Kate Larson
113
7
0
31 Jan 2025
Clone-Robust AI Alignment
Ariel D. Procaccia
Benjamin G. Schiffer
Shirley Zhang
48
3
0
17 Jan 2025
1