ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.01072
  4. Cited By
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
v1v2 (latest)

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning

2 December 2023
Eduardo Pignatelli
Johan Ferret
Matthieu Geist
Thomas Mesnard
Hado van Hasselt
Olivier Pietquin
Laura Toni
ArXiv (abs)PDFHTML

Papers citing "A Survey of Temporal Credit Assignment in Deep Reinforcement Learning"

2 / 2 papers shown
Title
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
Jianmin Wang
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
406
5
0
21 Apr 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai
Haoran Sun
Huang Fang
Shuohuan Wang
Yu Sun
Hua Wu
444
3
0
03 Oct 2024
1