Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.05109
Cited By
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
11 December 2019
Riashat Islam
Raihan Seraj
Samin Yeasar Arnob
Doina Precup
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning"
3 / 3 papers shown
Title
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
35
0
0
07 Apr 2025
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
35
10
0
04 Nov 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
49
24
0
23 Feb 2021
1