ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.11937
  4. Cited By
Value Imprint: A Technique for Auditing the Human Values Embedded in
  RLHF Datasets

Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets

18 November 2024
Ike Obi
Rohan Pant
Srishti Shekhar Agrawal
Maham Ghazanfar
Aaron Basiletti
ArXiv (abs)PDFHTML

Papers citing "Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets"

1 / 1 papers shown
Title
AI Alignment at Your Discretion
AI Alignment at Your Discretion
Maarten Buyl
Hadi Khalaf
C. M. Verdun
Lucas Monteiro Paes
Caio Vieira Machado
Flavio du Pin Calmon
114
1
0
10 Feb 2025
1