A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

2 October 2020

Papers citing "A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms"

5 / 5 papers shown

Title
On the Convergence of Discounted Policy Gradient Methods Chris Nota 13 0 0 28 Dec 2022
On the Role of Discount Factor in Offline Reinforcement Learning Haotian Hu Yiqin Yang Qianchuan Zhao Chongjie Zhang OffRL 29 18 0 07 Jun 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms Romain Laroche Rémi Tachet des Combes 46 2 0 15 Feb 2022
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates Romain Laroche Rémi Tachet des Combes 46 8 0 29 Sep 2021
A Finite Time Analysis of Two Time-Scale Actor Critic Methods Yue Wu Weitong Zhang Pan Xu Quanquan Gu 90 146 0 04 May 2020