Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.02829
Cited By
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
7 February 2020
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)"
2 / 2 papers shown
Title
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
35
0
0
05 Jul 2023
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
1