ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.02951
  4. Cited By
A Fisher-Rao gradient flow for entropy-regularised Markov decision
  processes in Polish spaces

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

4 October 2023
B. Kerimkulov
J. Leahy
David Siska
Lukasz Szpruch
Yufei Zhang
ArXivPDFHTML

Papers citing "A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces"

7 / 7 papers shown
Title
Efficient Learning for Entropy-Regularized Markov Decision Processes via Multilevel Monte Carlo
Efficient Learning for Entropy-Regularized Markov Decision Processes via Multilevel Monte Carlo
Matthieu Meunier
C. Reisinger
Yufei Zhang
53
0
0
27 Mar 2025
Fisher-Rao Gradient Flow: Geodesic Convexity and Functional Inequalities
Fisher-Rao Gradient Flow: Geodesic Convexity and Functional Inequalities
José A. Carrillo
Yifan Chen
Daniel Zhengyu Huang
Jiaoyang Huang
Dongyi Wei
AI4CE
40
3
0
22 Jul 2024
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Johannes Muller
Semih Cayci
50
0
0
06 Jun 2024
Mirror Descent-Ascent for mean-field min-max problems
Mirror Descent-Ascent for mean-field min-max problems
Razvan-Andrei Lascu
Mateusz B. Majka
Lukasz Szpruch
56
1
0
12 Feb 2024
Towards a Theoretical Foundation of Policy Optimization for Learning
  Control Policies
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kai Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
96
27
0
10 Oct 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M Sadler
Pratap Tokekar
Alec Koppel
65
17
0
28 Jan 2022
Policy Mirror Descent for Reinforcement Learning: Linear Convergence,
  New Sampling Complexity, and Generalized Problem Classes
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
106
137
0
30 Jan 2021
1