Variance Reduction based Experience Replay for Policy Optimization

17 October 2021

Papers citing "Variance Reduction based Experience Replay for Policy Optimization"

3 / 3 papers shown

Title
Digital Twin Calibration with Model-Based Reinforcement Learning Hua Zheng Wei Xie I. Ryzhov Keilung Choy 39 0 0 04 Jan 2025
A Finite Time Analysis of Two Time-Scale Actor Critic Methods Yue Wu Weitong Zhang Pan Xu Quanquan Gu 90 146 0 04 May 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation Harshat Kumar Alec Koppel Alejandro Ribeiro 102 79 0 18 Oct 2019