ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,658 papers shown
Title
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
72
86
0
19 Oct 2020
Robot Navigation in Constrained Pedestrian Environments using
  Reinforcement Learning
Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning
Claudia Pérez-DÁrpino
Can Liu
P. Goebel
Roberto Martín-Martín
Silvio Savarese
39
65
0
16 Oct 2020
Learning Dexterous Manipulation from Suboptimal Experts
Learning Dexterous Manipulation from Suboptimal Experts
Rae Jeong
Jost Tobias Springenberg
Jackie Kay
Daniel Zheng
Yuxiang Zhou
Alexandre Galashov
N. Heess
F. Nori
OffRL
18
36
0
16 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
34
55
0
15 Oct 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
40
93
0
12 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in
  Epidemiological Models
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
24
22
0
09 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and
  Transfer Learning
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
42
120
0
08 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
29
7
0
08 Oct 2020
Reinforcement Learning for Many-Body Ground-State Preparation Inspired
  by Counterdiabatic Driving
Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving
Jiahao Yao
Lin Lin
Marin Bukov
BDL
AI4CE
40
61
0
07 Oct 2020
Reinforcement Learning with Random Delays
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
135
59
0
06 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
25
7
0
04 Oct 2020
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance
  Metric Learning and Behavior Regularization
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
Lanqing Li
Rui Yang
Dijun Luo
OffRL
33
10
0
02 Oct 2020
Learning to swim in potential flow
Learning to swim in potential flow
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
13
40
0
30 Sep 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach
  based on Contrastive Learning
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Haotian Fu
Hongyao Tang
Jianye Hao
Chong Chen
Xidong Feng
Dong Li
Wulong Liu
OffRL
37
51
0
29 Sep 2020
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to
  District Demand Side Management through CityLearn
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan
Kacper Twardowski
E. Mangina
D. Finn
8
21
0
22 Sep 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
32
5
0
21 Sep 2020
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent
  Control
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control
Qingrui Zhang
Hao Dong
Wei Pan
29
6
0
20 Sep 2020
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Ian Fox
Joyce M. Lee
R. Pop-Busui
Jenna Wiens
BDL
OffRL
30
50
0
18 Sep 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
29
23
0
18 Sep 2020
Elastica: A compliant mechanics environment for soft robotic control
Elastica: A compliant mechanics environment for soft robotic control
Noel M. Naughton
Jiarui Sun
Arman Tekinalp
Girish Chowdhary
M. Gazzola
14
86
0
17 Sep 2020
Multimodal Safety-Critical Scenarios Generation for Decision-Making
  Algorithms Evaluation
Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation
Wenhao Ding
Baiming Chen
Yue Liu
Kim Ji Eun
Ding Zhao
AAML
16
100
0
16 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
27
52
0
16 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
288
341
0
14 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
58
610
0
10 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators
  using Reinforcement Learning
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
28
95
0
04 Sep 2020
Nonholonomic Yaw Control of an Underactuated Flying Robot with
  Model-based Reinforcement Learning
Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning
Nathan Lambert
Craig B. Schindler
Daniel S. Drew
K. Pister
22
6
0
02 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown
  Dynamics
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAML
OffRL
OnRL
37
49
0
02 Sep 2020
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep
  Reinforcement Learning
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
18
7
0
23 Aug 2020
Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement
  Learning
Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning
Florian Fuchs
Yunlong Song
Elia Kaufmann
Davide Scaramuzza
Peter Dürr
26
124
0
18 Aug 2020
Linear Disentangled Representations and Unsupervised Action Estimation
Linear Disentangled Representations and Unsupervised Action Estimation
Matthew Painter
Jonathon S. Hare
Adam Prugel-Bennett
CoGe
DRL
39
20
0
18 Aug 2020
Offline Meta-Reinforcement Learning with Advantage Weighting
Offline Meta-Reinforcement Learning with Advantage Weighting
E. Mitchell
Rafael Rafailov
Xue Bin Peng
Sergey Levine
Chelsea Finn
OffRL
38
104
0
13 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
29
72
0
08 Aug 2020
SafePILCO: a software tool for safe and data-efficient policy synthesis
SafePILCO: a software tool for safe and data-efficient policy synthesis
Kyriakos Polymenakos
Nikitas Rontsis
Alessandro Abate
Stephen J. Roberts
32
6
0
07 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
26
42
0
02 Aug 2020
Towards Deep Robot Learning with Optimizer applicable to Non-stationary
  Problems
Towards Deep Robot Learning with Optimizer applicable to Non-stationary Problems
Taisuke Kobayashi
ODL
20
9
0
31 Jul 2020
Queueing Network Controls via Deep Reinforcement Learning
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
32
50
0
31 Jul 2020
Understanding the Stability of Deep Control Policies for Biped
  Locomotion
Understanding the Stability of Deep Control Policies for Biped Locomotion
Hwangpil Park
R. Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
22
9
0
30 Jul 2020
Modular Transfer Learning with Transition Mismatch Compensation for
  Excessive Disturbance Rejection
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
43
1
0
29 Jul 2020
Learning Object-conditioned Exploration using Distributed Soft Actor
  Critic
Learning Object-conditioned Exploration using Distributed Soft Actor Critic
Ayzaan Wahid
Austin Stone
Kevin Chen
Brian Ichter
Alexander Toshev
27
22
0
29 Jul 2020
Clinician-in-the-Loop Decision Making: Reinforcement Learning with
  Near-Optimal Set-Valued Policies
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies
Shengpu Tang
Aditya Modi
Michael Sjoding
Jenna Wiens
OffRL
22
25
0
24 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
35
175
0
24 Jul 2020
Off-Policy Reinforcement Learning for Efficient and Effective GAN
  Architecture Search
Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search
Yuan Tian
Qin Wang
Zhiwu Huang
Wen Li
Dengxin Dai
Minghao Yang
Jun Wang
Olga Fink
OffRL
24
60
0
17 Jul 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal
  Sample Complexity
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
52
120
0
15 Jul 2020
Efficient Empowerment Estimation for Unsupervised Stabilization
Efficient Empowerment Estimation for Unsupervised Stabilization
Ruihan Zhao
Kevin Lu
Pieter Abbeel
Stas Tiomkin
32
8
0
14 Jul 2020
Optimizing Memory Placement using Evolutionary Graph Reinforcement
  Learning
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda Khadka
Estelle Aflalo
Mattias Marder
Avrech Ben-David
Santiago Miret
Shie Mannor
Tamir Hazan
Hanlin Tang
Somdeb Majumdar
GNN
32
11
0
14 Jul 2020
Control as Hybrid Inference
Control as Hybrid Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
21
9
0
11 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
11
19
0
09 Jul 2020
Natural Emergence of Heterogeneous Strategies in Artificially
  Intelligent Competitive Teams
Natural Emergence of Heterogeneous Strategies in Artificially Intelligent Competitive Teams
A. Deka
Katia Sycara
AAML
31
32
0
06 Jul 2020
Previous
123...282930...323334
Next