ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.10081
  4. Cited By
Revisiting Discrete Soft Actor-Critic

Revisiting Discrete Soft Actor-Critic

21 September 2022
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
ArXivPDFHTML

Papers citing "Revisiting Discrete Soft Actor-Critic"

29 / 29 papers shown
Title
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
James R. Han
Hugues Thomas
Jian Zhang
Nicholas Rhinehart
Timothy D. Barfoot
82
1
0
17 Feb 2025
Honor of Kings Arena: an Environment for Generalization in Competitive
  Reinforcement Learning
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Hua Wei
Jingxiao Chen
Xiyang Ji
Hongyang Qin
Minwen Deng
...
Lin Liu
Lanxiao Huang
Deheng Ye
Qiang Fu
Wei Yang
52
28
0
18 Sep 2022
Target Entropy Annealing for Discrete Soft Actor-Critic
Target Entropy Annealing for Discrete Soft Actor-Critic
Yaosheng Xu
Dailin Hu
Litian Liang
Stephen Marcus McAleer
Pieter Abbeel
Roy Fox
64
10
0
06 Dec 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
31
30
0
24 Sep 2021
Training Larger Networks for Deep Reinforcement Learning
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
52
39
0
16 Feb 2021
Which Heroes to Pick? Learning to Draft in MOBA Games with Neural
  Networks and Tree Search
Which Heroes to Pick? Learning to Draft in MOBA Games with Neural Networks and Tree Search
Sheng Chen
Menghui Zhu
Deheng Ye
Weinan Zhang
Qiang Fu
Wei Yang
43
29
0
18 Dec 2020
Towards Playing Full MOBA Games with Deep Reinforcement Learning
Towards Playing Full MOBA Games with Deep Reinforcement Learning
Deheng Ye
Guibin Chen
Wen Zhang
Sheng Chen
Bo Yuan
...
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
24
182
0
25 Nov 2020
Supervised Learning Achieves Human-Level Performance in MOBA Games: A
  Case Study of Honor of Kings
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Deheng Ye
Guibin Chen
P. Zhao
Fuhao Qiu
Bo Yuan
...
Liang Wang
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
48
49
0
25 Nov 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
80
87
0
19 Oct 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via
  Metagradient
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
24
20
0
03 Jul 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
  with Advantage Weighted Mixture Policy(SAC-AWMP)
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
85
15
0
07 Feb 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
175
0
09 Jan 2020
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
36
314
0
20 Dec 2019
Better Exploration with Optimistic Actor-Critic
Better Exploration with Optimistic Actor-Critic
K. Ciosek
Q. Vuong
R. Loftin
Katja Hofmann
38
150
0
28 Oct 2019
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
123
294
0
16 Oct 2019
Improving Exploration in Soft-Actor-Critic with Normalizing Flows
  Policies
Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies
Patrick Nadeem Ward
Ariella Smofsky
A. Bose
21
58
0
06 Jun 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
94
2,391
0
13 Dec 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
59
471
0
14 Jun 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
136
5,121
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
186
8,236
0
04 Jan 2018
Visualizing the Loss Landscape of Neural Nets
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
224
1,873
0
28 Dec 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
89
2,255
0
06 Oct 2017
A Distributional Perspective on Reinforcement Learning
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
65
1,497
0
21 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
201
18,685
0
20 Jul 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
157
8,805
0
04 Feb 2016
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
115
7,590
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
168
13,174
0
09 Sep 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
224
6,722
0
19 Feb 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
86
12,163
0
19 Dec 2013
1