Distributed Soft Actor-Critic with Multivariate Reward Representation
and Knowledge Distillation

v1v2 (latest)

Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation

29 November 2019

ArXiv (abs)PDF HTML

Papers citing "Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation"

11 / 11 papers shown

Title
Soft Actor-Critic Algorithms and Applications Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha ... Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel Sergey Levine 136 2,445 0 13 Dec 2018
Distributed Distributional Deterministic Policy Gradients Gabriel Barth-Maron Matthew W. Hoffman David Budden Will Dabney Dan Horgan TB Dhruva Alistair Muldal N. Heess Timothy Lillicrap OffRL 86 480 0 23 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning Tuomas Haarnoja Kristian Hartikainen Pieter Abbeel Sergey Levine BDL 74 193 0 09 Apr 2018
Distributed Prioritized Experience Replay Dan Horgan John Quan David Budden Gabriel Barth-Maron Matteo Hessel H. V. Hasselt David Silver 147 741 0 02 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 180 5,187 0 26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 311 8,352 0 04 Jan 2018
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 517 19,065 0 20 Jul 2017
Hindsight Experience Replay Marcin Andrychowicz Dwight Crow Alex Ray Jonas Schneider Rachel Fong Peter Welinder Bob McGrew Joshua Tobin Pieter Abbeel Wojciech Zaremba OffRL 255 2,328 0 05 Jul 2017
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 223 3,789 0 18 Nov 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 320 13,248 0 09 Sep 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.9K 150,115 0 22 Dec 2014