Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.01069
Cited By
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
2 October 2020
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms"
5 / 5 papers shown
Title
On the Convergence of Discounted Policy Gradient Methods
Chris Nota
13
0
0
28 Dec 2022
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
29
18
0
07 Jun 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Romain Laroche
Rémi Tachet des Combes
46
2
0
15 Feb 2022
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
46
8
0
29 Sep 2021
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
1