Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.01072
Cited By
v1
v2 (latest)
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
2 December 2023
Eduardo Pignatelli
Johan Ferret
Matthieu Geist
Thomas Mesnard
Hado van Hasselt
Olivier Pietquin
Laura Toni
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Survey of Temporal Credit Assignment in Deep Reinforcement Learning"
2 / 2 papers shown
Title
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
Jianmin Wang
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
411
5
0
21 Apr 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai
Haoran Sun
Huang Fang
Shuohuan Wang
Yu Sun
Hua Wu
444
3
0
03 Oct 2024
1