v1v2 (latest)
Regularization of Soft Actor-Critic Algorithms with Automatic Temperature Adjustment

Abstract
This work presents a comprehensive analysis to regularize the Soft Actor-Critic (SAC) algorithm with automatic temperature adjustment. The the policy evaluation, the policy improvement and the temperature adjustment are reformulated, addressing certain modification and enhancing the clarity of the original theory in a more explicit manner.
View on arXivComments on this paper