ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05905
  4. Cited By
Soft Actor-Critic Algorithms and Applications

Soft Actor-Critic Algorithms and Applications

13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 487 papers shown
Title
Mastering Visual Continuous Control: Improved Data-Augmented
  Reinforcement Learning
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
36
338
0
20 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
29
11
0
10 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
39
66
0
08 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
36
112
0
07 Jul 2021
IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control
IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control
Xiaoyan Cao
Yaowen Yao
Lanqing Li
Wanpeng Zhang
Zhicheng An
...
Li Xiao
Shihui Guo
Xiaoyu Cao
Meihong Wu
Dijun Luo
11
19
0
06 Jul 2021
Multi-Modal Mutual Information (MuMMI) Training for Robust
  Self-Supervised Deep Reinforcement Learning
Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning
Kaiqi Chen
Yong Lee
Harold Soh
SSL
33
20
0
06 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
30
141
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and
  Neuroscience
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
33
7
0
30 Jun 2021
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic
  Data via Stereo
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo
Thomas Kollar
Michael Laskey
Kevin Stone
Brijen Thananjeyan
Mark Tjersland
53
25
0
30 Jun 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Unsupervised Skill Discovery with Bottleneck Option Learning
Jaekyeom Kim
Seohong Park
Gunhee Kim
32
32
0
27 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
48
355
0
24 Jun 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic
  Manipulation via Discretisation
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
38
124
0
23 Jun 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual
  Policies
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
33
64
0
17 Jun 2021
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
35
50
0
16 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
34
15
0
13 Jun 2021
Bayesian Bellman Operators
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for
  Reinforcement Learning
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
25
46
0
08 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
43
119
0
07 Jun 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit
  Differentiation
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
54
35
0
06 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
21
46
0
05 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement
  Learning
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
26
5
0
01 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
55
77
0
01 Jun 2021
Continual World: A Robotic Benchmark For Continual Reinforcement
  Learning
Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
OffRL
19
89
0
23 May 2021
Reinforcement learning of rare diffusive dynamics
Reinforcement learning of rare diffusive dynamics
Avishek Das
Dominic C. Rose
J. P. Garrahan
David T. Limmer
24
27
0
10 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise
  Rollouts
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
27
35
0
07 May 2021
Benchmarking Structured Policies and Policy Optimization for Real-World
  Dexterous Object Manipulation
Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Niklas Funk
Charles B. Schaff
Rishabh Madan
Takuma Yoneda
Julen Urain De Jesus
...
Stefan Bauer
S. Srinivasa
Tapomayukh Bhattacharjee
Matthew R. Walter
Jan Peters
37
35
0
05 May 2021
Hierarchical Reinforcement Learning for Air-to-Air Combat
Hierarchical Reinforcement Learning for Air-to-Air Combat
Adrian P. Pope
J. Ide
Daria Mićović
Henry Diaz
D. Rosenbluth
Lee Ritholtz
Jason C. Twedt
Thayne T. Walker
K. Alcedo
D. Javorsek
25
72
0
03 May 2021
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural
  Networks
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng
Ruitong Zhang
Yingtong Dou
Renyu Yang
Jingyi Zhang
Philip S. Yu
41
115
0
16 Apr 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with
  Deep Reinforcement Learning
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
48
43
0
25 Mar 2021
Self-Imitation Learning by Planning
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
35
85
0
25 Mar 2021
CLAMGen: Closed-Loop Arm Motion Generation via Multi-view Vision-Based
  RL
CLAMGen: Closed-Loop Arm Motion Generation via Multi-view Vision-Based RL
Iretiayo Akinola
Zizhao Wang
Peter K. Allen
42
2
0
24 Mar 2021
Efficient Deep Reinforcement Learning with Imitative Expert Priors for
  Autonomous Driving
Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving
Zhiyu Huang
Jingda Wu
Chen Lv
24
133
0
19 Mar 2021
Near Optimal Policy Optimization via REPS
Near Optimal Policy Optimization via REPS
Aldo Pacchiano
Jonathan Lee
Peter L. Bartlett
Ofir Nachum
23
3
0
17 Mar 2021
Offline Reinforcement Learning with Fisher Divergence Critic
  Regularization
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
OffRL
29
300
0
14 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
50
176
0
10 Mar 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
20
520
0
04 Feb 2021
Learning Skills to Navigate without a Master: A Sequential Multi-Policy
  Reinforcement Learning Algorithm
Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm
Ambedkar Dukkipati
Rajarshi Banerjee
Ranga Shaarad Ayyagari
Dhaval Parmar Udaybhai
27
6
0
30 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
A Tutorial on Sparse Gaussian Processes and Variational Inference
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
42
49
0
27 Dec 2020
Imitation Learning for High Precision Peg-in-Hole Tasks
Imitation Learning for High Precision Peg-in-Hole Tasks
S. Gubbi
Shishir Kolathaya
B. Amrutur
37
21
0
26 Dec 2020
Battery Model Calibration with Deep Reinforcement Learning
Battery Model Calibration with Deep Reinforcement Learning
Ajaykumar Unagar
Yuan Tian
M. A. Chao
Olga Fink
24
1
0
07 Dec 2020
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and
  Optimal Control
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and Optimal Control
Siddhant Gangapurwala
Mathieu Geisert
Romeo Orsolino
Maurice F. Fallon
Ioannis Havoutis
41
114
0
05 Dec 2020
Generalization in Reinforcement Learning by Soft Data Augmentation
Generalization in Reinforcement Learning by Soft Data Augmentation
Nicklas Hansen
Xiaolong Wang
OffRL
17
168
0
26 Nov 2020
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
Eric Liang
Zhanghao Wu
Michael Luo
Sven Mika
Joseph E. Gonzalez
Ion Stoica
AI4CE
23
9
0
25 Nov 2020
Learning Navigation Skills for Legged Robots with Learned Robot
  Embeddings
Learning Navigation Skills for Legged Robots with Learned Robot Embeddings
Joanne Truong
Denis Yarats
Tianyu Li
Franziska Meier
Sonia Chernova
Dhruv Batra
Akshara Rai
23
20
0
24 Nov 2020
Multi-agent Reinforcement Learning Accelerated MCMC on Multiscale
  Inversion Problem
Multi-agent Reinforcement Learning Accelerated MCMC on Multiscale Inversion Problem
Eric T. Chung
Y. Efendiev
W. Leung
Sai-Mang Pun
Zecheng Zhang
19
12
0
17 Nov 2020
State-Dependent Temperature Control for Langevin Diffusions
State-Dependent Temperature Control for Langevin Diffusions
Xuefeng Gao
Z. Xu
X. Zhou
30
27
0
15 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with
  Interaction
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
18
105
0
12 Nov 2020
Reinforcement Learning Experiments and Benchmark for Solving Robotic
  Reaching Tasks
Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks
Pierre Aumjaud
David McAuliffe
Francisco J. Rodríguez-Lera
P. Cardiff
19
15
0
11 Nov 2020
Previous
123...10789
Next