ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.15922
  4. Cited By
Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition

Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition

21 May 2025
Dong Won Lee
Hae Won Park
C. Breazeal
Louis-Philippe Morency
ArXiv (abs)PDFHTML

Papers citing "Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition"

1 / 1 papers shown
Title
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu
Yuhang Jiang
Boyuan Wang
Yixiu Mao
Cheems Wang
Chang-Shu Liu
Xiangyang Ji
190
8
0
10 Jan 2025
1