Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 487 papers shown
Title
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
32
8
0
20 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
35
10
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
26
4
0
17 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
51
110
0
17 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
21
0
0
13 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
40
31
0
10 Jun 2022
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy
Hongzhi Hua
Kaigui Wu
Guixuan Wen
24
0
0
10 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
34
52
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
46
17
0
08 Jun 2022
Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer
Minrui Wang
Ming Feng
Wen-gang Zhou
Houqiang Li
38
9
0
08 Jun 2022
FishGym: A High-Performance Physics-based Simulation Framework for Underwater Robot Learning
Wenji Liu
Kai-Yi Bai
Xuming He
Shuran Song
Changxi Zheng
Xiaopei Liu
AI4CE
37
12
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert Platt
Chris Amato
OffRL
35
35
0
02 Jun 2022
Control of Two-way Coupled Fluid Systems with Differentiable Solvers
B. Ramos
Felix Trost
Nils Thuerey
AI4CE
27
5
0
01 Jun 2022
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
D. Mguni
Aivar Sootla
Juliusz Ziomek
Oliver Slumbers
Zipeng Dai
Kun Shao
Jun Wang
47
6
0
31 May 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
55
7
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert Platt
32
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
32
39
0
27 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
96
182
0
16 May 2022
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
28
0
0
16 May 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
31
15
0
06 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
34
12
0
02 May 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
16
19
0
11 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled Hand
Leon Sievers
Johannes Pitz
Berthold Bäuml
29
38
0
07 Apr 2022
Visual-Tactile Multimodality for Following Deformable Linear Objects Using Reinforcement Learning
Leszek Pecyna
Siyuan Dong
Shan Luo
24
21
0
31 Mar 2022
Monte Carlo Tree Search based Hybrid Optimization of Variational Quantum Circuits
Jiahao Yao
Haoya Li
Marin Bukov
Lin Lin
Lexing Ying
21
15
0
30 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
36
19
0
23 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
24
40
0
19 Mar 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
Jinxin Liu
Hongyin Zhang
Donglin Wang
OffRL
38
33
0
13 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
226
0
09 Mar 2022
Investigation of Factorized Optical Flows as Mid-Level Representations
Hsuan-Kung Yang
Tsu-Ching Hsiao
Tingbo Liao
Hsu-Shen Liu
Li-Yuan Tsao
Tzu-Wen Wang
Shan Yang
Yu-Wen Chen
Huang-ru Liao
Chun-Yi Lee
35
3
0
09 Mar 2022
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Xiaobai Ma
David Isele
Jayesh K. Gupta
K. Fujimura
Mykel J. Kochenderfer
19
5
0
06 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He Wang
Li Yi
50
173
0
03 Mar 2022
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
Nikolaos Kourtzanidis
Sajad Saeedi
42
2
0
26 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
Brandon Trabucco
Xinyang Geng
Aviral Kumar
Sergey Levine
OffRL
37
95
0
17 Feb 2022
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control
Killian Dally
E. Kampen
27
16
0
16 Feb 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
30
0
0
14 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
40
65
0
13 Feb 2022
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Nate Gruver
Marc Finzi
Samuel Stanton
A. Wilson
AI4CE
28
40
0
10 Feb 2022
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
Stephen James
Pieter Abbeel
35
9
0
08 Feb 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
18
3
0
07 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
71
31
0
28 Jan 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
29
44
0
28 Jan 2022
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven Learning in Artificial Intelligence Tasks
Chenyu Sun
Hangwei Qian
Chunyan Miao
18
10
0
20 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
Mixture of basis for interpretable continual learning with distribution shifts
Mengda Xu
Sumitra Ganesh
Pranay Pasula
OOD
37
1
0
05 Jan 2022
Previous
1
2
3
...
10
5
6
7
8
9
Next