ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.11280
  4. Cited By
Actor-critic is implicitly biased towards high entropy optimal policies

Actor-critic is implicitly biased towards high entropy optimal policies

21 October 2021
Yuzheng Hu
Ziwei Ji
Matus Telgarsky
ArXivPDFHTML

Papers citing "Actor-critic is implicitly biased towards high entropy optimal policies"

2 / 2 papers shown
Title
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Johannes Muller
Semih Cayci
44
0
0
06 Jun 2024
Robust Offline Reinforcement Learning with Gradient Penalty and
  Constraint Relaxation
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
39
2
0
19 Oct 2022
1