Soft Actor-Critic with Cross-Entropy Policy Optimization

21 December 2021

Papers citing "Soft Actor-Critic with Cross-Entropy Policy Optimization"

13 / 13 papers shown

Title
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past Che Wang George Andriopoulos 48 45 0 10 Jun 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Riley Simmons-Edler Ben Eisner E. Mitchell Sebastian Seung Daniel D. Lee 93 29 0 25 Mar 2019
CEM-RL: Combining evolutionary and gradient-based methods for policy search Aloïs Pourchot Olivier Sigaud 87 161 0 02 Oct 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation Dmitry Kalashnikov A. Irpan P. Pastor Julian Ibarz Alexander Herzog ... Deirdre Quillen E. Holly Mrinal Kalakrishnan Vincent Vanhoucke Sergey Levine 146 1,472 0 27 Jun 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 204 5,229 0 26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 319 8,436 0 04 Jan 2018
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 147 1,963 0 19 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 595 19,317 0 20 Jul 2017
Parameter Space Noise for Exploration Matthias Plappert Rein Houthooft Prafulla Dhariwal Szymon Sidor Richard Y. Chen Xi Chen Tamim Asfour Pieter Abbeel Marcin Andrychowicz 89 597 0 06 Jun 2017
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 225 5,089 0 05 Jun 2016
Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan Xi Chen Rein Houthooft John Schulman Pieter Abbeel OffRL 103 1,695 0 22 Apr 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 353 13,297 0 09 Sep 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 285 6,808 0 19 Feb 2015