ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.09784
  4. Cited By
Deep Conservative Policy Iteration

Deep Conservative Policy Iteration

24 June 2019
Nino Vieillard
Olivier Pietquin
Matthieu Geist
ArXivPDFHTML

Papers citing "Deep Conservative Policy Iteration"

8 / 8 papers shown
Title
Twice Regularized Markov Decision Processes: The Equivalence between
  Robustness and Regularization
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
Matthieu Geist
Shie Mannor
50
1
0
12 Mar 2023
Variance-Reduced Conservative Policy Iteration
Variance-Reduced Conservative Policy Iteration
Naman Agarwal
Brian Bullins
Karan Singh
32
3
0
12 Dec 2022
Learning to Constrain Policy Optimization with Virtual Trust Region
Learning to Constrain Policy Optimization with Virtual Trust Region
Hung Le
Thommen Karimpanal George
Majid Abdolshah
D. Nguyen
Kien Do
Sunil R. Gupta
Svetha Venkatesh
41
3
0
20 Apr 2022
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
30
29
0
17 Jul 2021
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based
  Reinforcement Learning
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based Reinforcement Learning
Lingwei Zhu
Takamitsu Matsubara
22
4
0
25 Aug 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
41
17
0
09 Mar 2020
On Connections between Constrained Optimization and Reinforcement
  Learning
On Connections between Constrained Optimization and Reinforcement Learning
Nino Vieillard
Olivier Pietquin
Matthieu Geist
14
13
0
18 Oct 2019
Modified Actor-Critics
Modified Actor-Critics
Erinc Merdivan
S. Hanke
Matthieu Geist
24
2
0
02 Jul 2019
1