ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.01290
  4. Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"

50 / 1,680 papers shown
Title
Provable Benefits of Actor-Critic Methods for Offline Reinforcement
  Learning
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
34
115
0
19 Aug 2021
Settling the Variance of Multi-Agent Policy Gradients
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
26
59
0
19 Aug 2021
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement
  Learning with Prior Regularization
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization
Lu Wen
Songan Zhang
H. E. Tseng
Baljeet Singh
Dimitar Filev
H. Peng
OffRL
OnRL
27
1
0
19 Aug 2021
Smoother Entropy for Active State Trajectory Estimation and Obfuscation
  in POMDPs
Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs
Timothy L. Molloy
G. Nair
30
13
0
19 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
38
7
0
16 Aug 2021
Offline-Online Reinforcement Learning for Energy Pricing in Office
  Demand Response: Lowering Energy and Data Costs
Offline-Online Reinforcement Learning for Energy Pricing in Office Demand Response: Lowering Energy and Data Costs
Doseok Jang
Lucas Spangher
Manan Khattar
Utkarsha Agwan
Selvaprabu Nadarajah
C. Spanos
OffRL
30
11
0
14 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
34
42
0
11 Aug 2021
Imitation Learning by Reinforcement Learning
Imitation Learning by Reinforcement Learning
K. Ciosek
33
18
0
10 Aug 2021
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual,
  Interactive, and Ecological Environments
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments
S. Srivastava
Chengshu Li
Michael Lingelbach
Roberto Martín-Martín
Fei Xia
...
Chenxi Liu
Silvio Savarese
H. Gweon
Jiajun Wu
Li Fei-Fei
LM&Ro
156
159
0
06 Aug 2021
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday
  Household Tasks
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks
Chengshu Li
Fei Xia
Roberto Martín-Martín
Michael Lingelbach
S. Srivastava
...
Karen Liu
H. Gweon
Jiajun Wu
Li Fei-Fei
Silvio Savarese
LM&Ro
168
225
0
06 Aug 2021
A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning
A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
28
16
0
06 Aug 2021
Policy Gradients Incorporating the Future
Policy Gradients Incorporating the Future
David Venuto
Elaine Lau
Doina Precup
Ofir Nachum
OffRL
19
9
0
04 Aug 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
35
9
0
04 Aug 2021
Risk Conditioned Neural Motion Planning
Risk Conditioned Neural Motion Planning
Xin Huang
Meng Feng
A. Jasour
Guy Rosman
B. Williams
29
7
0
04 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
33
37
0
04 Aug 2021
Variational Actor-Critic Algorithms
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
15
0
0
03 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for
  Dynamic Control
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
36
38
0
31 Jul 2021
Tianshou: a Highly Modularized Deep Reinforcement Learning Library
Tianshou: a Highly Modularized Deep Reinforcement Learning Library
Jiayi Weng
Huayu Chen
Dong Yan
Kaichao You
Alexis Duburcq
Minghao Zhang
Yi Su
Hang Su
Jun Zhu
NoLa
OffRL
41
196
0
29 Jul 2021
Autonomous Reinforcement Learning via Subgoal Curricula
Autonomous Reinforcement Learning via Subgoal Curricula
Archit Sharma
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
27
27
0
27 Jul 2021
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A
  Deep Reinforcement Learning Approach
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A Deep Reinforcement Learning Approach
Yang Wang
Zhen Gao
Jun Zhang
Xianbin Cao
Dezhi Zheng
Yue Gao
Derrick Wing Kwan Ng
M. Di Renzo
44
94
0
23 Jul 2021
Accelerating Quadratic Optimization with Reinforcement Learning
Accelerating Quadratic Optimization with Reinforcement Learning
Jeffrey Ichnowski
Paras Jain
Bartolomeo Stellato
G. Banjac
Michael Luo
Francesco Borrelli
Joseph E. Gonzalez
Ion Stoica
Ken Goldberg
OffRL
21
36
0
22 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao
Qi Yu
Yu Kong
FAtt
32
39
0
21 Jul 2021
Improving exploration in policy gradient search: Application to symbolic
  optimization
Improving exploration in policy gradient search: Application to symbolic optimization
Mikel Landajuela
Brenden K. Petersen
S. K. Kim
Claudio Santiago
Ruben Glatt
T. Nathan Mundhenk
Jacob F. Pettit
Daniel Faissol
27
16
0
19 Jul 2021
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
25
29
0
17 Jul 2021
Visual Adversarial Imitation Learning using Variational Models
Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
SSL
33
49
0
16 Jul 2021
Conservative Objective Models for Effective Offline Model-Based
  Optimization
Conservative Objective Models for Effective Offline Model-Based Optimization
Brandon Trabucco
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
47
86
0
14 Jul 2021
Distributionally Robust Policy Learning via Adversarial Environment
  Generation
Distributionally Robust Policy Learning via Adversarial Environment Generation
Allen Z. Ren
Anirudha Majumdar
OOD
103
15
0
13 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
37
36
0
12 Jul 2021
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Ya-Chien Chang
Sicun Gao
23
58
0
11 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative
  Coding
Backprop-Free Reinforcement Learning with Active Neural Generative Coding
Alexander Ororbia
A. Mali
43
15
0
10 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
34
11
0
10 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
36
112
0
07 Jul 2021
Supervised Off-Policy Ranking
Supervised Off-Policy Ranking
Yue Jin
Yue Zhang
Tao Qin
Xudong Zhang
Jian Yuan
Houqiang Li
Tie-Yan Liu
OffRL
37
5
0
03 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
44
135
0
01 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
30
141
0
01 Jul 2021
MHER: Model-based Hindsight Experience Replay
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
29
17
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and
  Neuroscience
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
33
7
0
30 Jun 2021
Koopman Spectrum Nonlinear Regulators and Efficient Online Learning
Koopman Spectrum Nonlinear Regulators and Efficient Online Learning
Motoya Ohnishi
Isao Ishikawa
Kendall Lowrey
Masahiro Ikeda
Sham Kakade
Yoshinobu Kawahara
26
5
0
30 Jun 2021
The Values Encoded in Machine Learning Research
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
41
275
0
29 Jun 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Unsupervised Skill Discovery with Bottleneck Option Learning
Jaekyeom Kim
Seohong Park
Gunhee Kim
32
32
0
27 Jun 2021
Discovering Generalizable Skills via Automated Generation of Diverse
  Tasks
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
48
6
0
26 Jun 2021
Active Learning in Robotics: A Review of Control Principles
Active Learning in Robotics: A Review of Control Principles
Annalisa T. Taylor
Thomas A. Berrueta
Todd D. Murphey
38
71
0
25 Jun 2021
panda-gym: Open-source goal-conditioned environments for robotic
  learning
panda-gym: Open-source goal-conditioned environments for robotic learning
Quentin Gallouedec
Nicolas Cazin
Emmanuel Dellandrea
Liming Chen
OffRL
27
77
0
25 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
48
359
0
24 Jun 2021
The Option Keyboard: Combining Skills in Reinforcement Learning
The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto
Diana Borsa
Shaobo Hou
Gheorghe Comanici
Eser Aygun
...
Daniel Toyama
Jonathan J. Hunt
Shibl Mourad
David Silver
Doina Precup
38
98
0
24 Jun 2021
IQ-Learn: Inverse soft-Q Learning for Imitation
IQ-Learn: Inverse soft-Q Learning for Imitation
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
51
178
0
23 Jun 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution
  Correction Estimation
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
37
91
0
21 Jun 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual
  Policies
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
33
64
0
17 Jun 2021
Previous
123...242526...323334
Next