Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.03854
Cited By
Mirror Descent Actor Critic via Bounded Advantage Learning
6 February 2025
Ryo Iwaki
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mirror Descent Actor Critic via Bounded Advantage Learning"
5 / 5 papers shown
Title
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
Matthieu Geist
OffRL
39
9
0
16 Aug 2021
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
61
403
0
22 Jun 2020
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
84
317
0
31 Jan 2019
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
206
18,685
0
20 Jul 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
57
1,329
0
27 Feb 2017
1