
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Papers citing "Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games"
9 / 9 papers shown
Title |
---|
![]() Soft Actor-Critic Algorithms and Applications Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha ...Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel Sergey Levine |