ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,669 papers shown
Title
The Distracting Control Suite -- A Challenging Benchmark for
  Reinforcement Learning from Pixels
The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels
Austin Stone
Oscar Ramirez
K. Konolige
Rico Jonschkowski
141
101
0
07 Jan 2021
Zero-shot sim-to-real transfer of tactile control policies for
  aggressive swing-up manipulation
Zero-shot sim-to-real transfer of tactile control policies for aggressive swing-up manipulation
Thomas Bi
Carmelo Sferrazza
Raffaello DÁndrea
109
35
0
07 Jan 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
43
32
0
29 Dec 2020
A Tutorial on Sparse Gaussian Processes and Variational Inference
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
42
49
0
27 Dec 2020
Stability-Certified Reinforcement Learning via Spectral Normalization
Stability-Certified Reinforcement Learning via Spectral Normalization
R. Takase
N. Yoshikawa
T. Mariyama
T. Tsuchiya
26
4
0
26 Dec 2020
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
27
4
0
24 Dec 2020
Rethink AI-based Power Grid Control: Diving Into Algorithm Design
Rethink AI-based Power Grid Control: Diving Into Algorithm Design
Xiren Zhou
Siqi Wang
R. Diao
Desong Bian
Jiahui Duan
Di Shi
22
4
0
23 Dec 2020
Mobile Robot Planner with Low-cost Cameras Using Deep Reinforcement
  Learning
Mobile Robot Planner with Low-cost Cameras Using Deep Reinforcement Learning
M. Tran
N. Ly
18
1
0
21 Dec 2020
CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning
  for Demand Response and Urban Energy Management
CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning for Demand Response and Urban Energy Management
José R. Vázquez-Canteli
Sourav Dey
G. Henze
Zoltán Nagy
AI4CE
6
62
0
18 Dec 2020
Online Service Migration in Mobile Edge with Incomplete System
  Information: A Deep Recurrent Actor-Critic Learning Approach
Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach
Jin Wang
Jia Hu
Geyong Min
Qiang Ni
Tarek A. El-Ghazawi
33
28
0
16 Dec 2020
Policy Manifold Search for Improving Diversity-based Neuroevolution
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
29
0
0
15 Dec 2020
Grounding Artificial Intelligence in the Origins of Human Behavior
Grounding Artificial Intelligence in the Origins of Human Behavior
Eleni Nisioti
Clément Moulin-Frier
AI4CE
47
5
0
15 Dec 2020
Policy Gradient RL Algorithms as Directed Acyclic Graphs
Policy Gradient RL Algorithms as Directed Acyclic Graphs
J. Luis
31
0
0
14 Dec 2020
A Reinforcement Learning Formulation of the Lyapunov Optimization:
  Application to Edge Computing Systems with Queue Stability
A Reinforcement Learning Formulation of the Lyapunov Optimization: Application to Edge Computing Systems with Queue Stability
Sohee Bae
Seungyul Han
Y. Sung
35
9
0
14 Dec 2020
How to Train PointGoal Navigation Agents on a (Sample and Compute)
  Budget
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget
Erik Wijmans
Irfan Essa
Dhruv Batra
3DPC
35
10
0
11 Dec 2020
Multi-expert learning of adaptive legged locomotion
Multi-expert learning of adaptive legged locomotion
Chuanyu Yang
Kai Yuan
Qiuguo Zhu
Wanming Yu
Zhibin Li
111
184
0
10 Dec 2020
Battery Model Calibration with Deep Reinforcement Learning
Battery Model Calibration with Deep Reinforcement Learning
Ajaykumar Unagar
Yuan Tian
M. A. Chao
Olga Fink
24
1
0
07 Dec 2020
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and
  Optimal Control
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and Optimal Control
Siddhant Gangapurwala
Mathieu Geisert
Romeo Orsolino
Maurice F. Fallon
Ioannis Havoutis
41
114
0
05 Dec 2020
Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors
Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors
Michelle A. Lee
Matthew Tan
Yuke Zhu
Jeannette Bohg
49
25
0
01 Dec 2020
Self-supervised Visual Reinforcement Learning with Object-centric
  Representations
Self-supervised Visual Reinforcement Learning with Object-centric Representations
Andrii Zadaianchuk
Maximilian Seitzer
Georg Martius
SSL
OCL
27
41
0
29 Nov 2020
Adaptable Automation with Modular Deep Reinforcement Learning and Policy
  Transfer
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
28
25
0
27 Nov 2020
Learning from Simulation, Racing in Reality
Learning from Simulation, Racing in Reality
Eugenio Chisari
Alexander Liniger
Alisa Rupenyan
Luc Van Gool
John Lygeros
33
25
0
26 Nov 2020
Large-Scale Multi-Agent Deep FBSDEs
Large-Scale Multi-Agent Deep FBSDEs
T. Chen
Ziyi Wang
Ioannis Exarchos
Evangelos A. Theodorou
37
4
0
21 Nov 2020
Model-based Reinforcement Learning for Continuous Control with Posterior
  Sampling
Model-based Reinforcement Learning for Continuous Control with Posterior Sampling
Ying Fan
Yifei Ming
33
17
0
20 Nov 2020
MRAC-RL: A Framework for On-Line Policy Adaptation Under Parametric
  Model Uncertainty
MRAC-RL: A Framework for On-Line Policy Adaptation Under Parametric Model Uncertainty
A. Guha
Anuradha M. Annaswamy
31
12
0
20 Nov 2020
Fault-Aware Robust Control via Adversarial Reinforcement Learning
Fault-Aware Robust Control via Adversarial Reinforcement Learning
Fan Yang
Chao Yang
Di Guo
Huaping Liu
F. Sun
42
4
0
17 Nov 2020
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Zheng Wu
Wenzhao Lian
Vaibhav Unhelkar
Masayoshi Tomizuka
S. Schaal
8
37
0
17 Nov 2020
Distilling a Hierarchical Policy for Planning and Control via
  Representation and Reinforcement Learning
Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning
Jung-Su Ha
Young-Jin Park
Hyeok-Joo Chae
Soon-Seo Park
Han-Lim Choi
35
3
0
16 Nov 2020
Towards Learning Controllable Representations of Physical Systems
Towards Learning Controllable Representations of Physical Systems
Kevin Haninger
R. Vicente-Garcia
J. Krüger
31
1
0
16 Nov 2020
Meta Automatic Curriculum Learning
Meta Automatic Curriculum Learning
Rémy Portelas
Clément Romac
Katja Hofmann
Pierre-Yves Oudeyer
35
8
0
16 Nov 2020
A Geometric Perspective on Self-Supervised Policy Adaptation
A Geometric Perspective on Self-Supervised Policy Adaptation
Cristian Bodnar
Karol Hausman
Gabriel Dulac-Arnold
Rico Jonschkowski
SSL
44
5
0
14 Nov 2020
Robust Quadruped Jumping via Deep Reinforcement Learning
Robust Quadruped Jumping via Deep Reinforcement Learning
Guillaume Bellegarda
Chuong H. Nguyen
Quan Nguyen
32
63
0
13 Nov 2020
ROLL: Visual Self-Supervised Reinforcement Learning with Object
  Reasoning
ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning
Yufei Wang
G. Narasimhan
Xingyu Lin
Brian Okorn
David Held
OffRL
LRM
30
13
0
13 Nov 2020
Critic PI2: Master Continuous Planning via Policy Improvement with Path
  Integrals and Deep Actor-Critic Reinforcement Learning
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
OffRL
21
5
0
13 Nov 2020
Learning Latent Representations to Influence Multi-Agent Interaction
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
23
131
0
12 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with
  Interaction
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
18
105
0
12 Nov 2020
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
52
122
0
11 Nov 2020
Offline Learning of Counterfactual Predictions for Real-World Robotic
  Reinforcement Learning
Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning
Jun Jin
D. Graves
Cameron Haigh
Jun Luo
Martin Jägersand
SSL
OffRL
27
6
0
11 Nov 2020
Reinforcement Learning Experiments and Benchmark for Solving Robotic
  Reaching Tasks
Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks
Pierre Aumjaud
David McAuliffe
Francisco J. Rodríguez-Lera
P. Cardiff
19
15
0
11 Nov 2020
Proximal Policy Optimization via Enhanced Exploration Efficiency
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
34
41
0
11 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via
  Reset-Games
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
33
33
0
10 Nov 2020
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Tianwei Ni
Harshit S. Sikchi
Yufei Wang
Tejus Gupta
Lisa Lee
Benjamin Eysenbach
13
72
0
09 Nov 2020
MAGIC: Learning Macro-Actions for Online POMDP Planning
MAGIC: Learning Macro-Actions for Online POMDP Planning
Yiyuan Lee
Panpan Cai
David Hsu
30
21
0
07 Nov 2020
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled
  Wireless Networks: A Tutorial
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
Ekram Hossain
40
237
0
06 Nov 2020
Learning a Decentralized Multi-arm Motion Planner
Learning a Decentralized Multi-arm Motion Planner
Huy Ha
Jingxi Xu
Shuran Song
23
51
0
05 Nov 2020
Cooperative Heterogeneous Deep Reinforcement Learning
Cooperative Heterogeneous Deep Reinforcement Learning
Han Zheng
Pengfei Wei
Jing Jiang
Guodong Long
Qinghua Lu
Chengqi Zhang
51
12
0
02 Nov 2020
Measuring and Harnessing Transference in Multi-Task Learning
Measuring and Harnessing Transference in Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
30
15
0
29 Oct 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
23
34
0
27 Oct 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured
  MaxEnt RL
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
42
39
0
27 Oct 2020
Previous
123...272829...323334
Next