ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.13538
  4. Cited By
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

15 March 2025
Siliang Zeng
Yao Liu
Huzefa Rangwala
George Karypis
Mingyi Hong
Rasool Fakoor
ArXivPDFHTML

Papers citing "From Demonstrations to Rewards: Alignment Without Explicit Human Preferences"

1 / 1 papers shown
Title
DMRL: Data- and Model-aware Reward Learning for Data Extraction
DMRL: Data- and Model-aware Reward Learning for Data Extraction
Zhiqiang Wang
Ruoxi Cheng
31
0
0
07 May 2025
1