Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,680 papers shown
Title
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
37
15
0
31 Jul 2023
Rating-based Reinforcement Learning
Devin White
Mingkang Wu
Ellen R. Novoseller
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
ALM
24
6
0
30 Jul 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
38
5
0
26 Jul 2023
Communication-Efficient Orchestrations for URLLC Service via Hierarchical Reinforcement Learning
Wei Shi
Milad Ganjalizadeh
H. S. Ghadikolaei
M. Petrova
AI4CE
11
2
0
25 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
64
2
0
21 Jul 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
45
23
0
21 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
33
7
0
19 Jul 2023
Image-based Regularization for Action Smoothness in Autonomous Miniature Racing Car with Deep Reinforcement Learning
Hoang-Giang Cao
I. Lee
Bo-Jiun Hsu
Zheng-Yi Lee
Yu-Wei Shih
Hsueh-Cheng Wang
I-Chen Wu
39
2
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
49
5
0
16 Jul 2023
Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review
Chengmin Zhou
Chao Wang
Haseeb Hassan
H. Shah
Bingding Huang
Pasi Fränti
3DV
43
3
0
16 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
43
18
0
15 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
38
5
0
13 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
56
15
0
10 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
49
0
0
05 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
28
25
0
05 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Human-like Decision-making at Unsignalized Intersection using Social Value Orientation
Yan Tong
Licheng Wen
Pinlong Cai
Daocheng Fu
Song Mao
Yikang Li
46
2
0
30 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
35
1
0
21 Jun 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
29
19
0
20 Jun 2023
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication
Adam Callaghan
Karl Mason
Patrick Mannion
39
2
0
20 Jun 2023
Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards
Semih Cayci
A. Eryilmaz
36
2
0
20 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
41
3
0
17 Jun 2023
Active Policy Improvement from Multiple Black-box Oracles
Xuefeng Liu
Takuma Yoneda
Chaoqi Wang
Matthew R. Walter
Yuxin Chen
50
9
0
17 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
35
13
0
15 Jun 2023
DiAReL: Reinforcement Learning with Disturbance Awareness for Robust Sim2Real Policy Transfer in Robot Control
M. Malmir
Josip Josifovski
Noah Klarmann
Alois C. Knoll
48
2
0
15 Jun 2023
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
Tom Dupuis
Jaonary Rabarisoa
Q. C. Pham
David Filliat
49
0
0
14 Jun 2023
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
43
1
0
14 Jun 2023
Self-Paced Absolute Learning Progress as a Regularized Approach to Curriculum Learning
Tobias Niehues
Ulla Scheler
Pascal Klink
37
0
0
09 Jun 2023
Learning to Navigate in Turbulent Flows with Aerial Robot Swarms: A Cooperative Deep Reinforcement Learning Approach
Diego Patiño
Siddharth Mayya
J. Calderon
Kostas Daniilidis
David Saldaña
32
3
0
07 Jun 2023
FAMO: Fast Adaptive Multitask Optimization
B. Liu
Yihao Feng
Peter Stone
Qian Liu
48
32
0
06 Jun 2023
Identifiability and Generalizability in Constrained Inverse Reinforcement Learning
Andreas Schlaginhaufen
Maryam Kamgarpour
34
10
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
34
23
0
01 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
41
9
0
29 May 2023
Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms
Peiyao Xiao
Hao Ban
Kaiyi Ji
48
19
0
28 May 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
41
1
0
28 May 2023
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets
Dinghuai Zhang
H. Dai
Nikolay Malkin
Aaron Courville
Yoshua Bengio
L. Pan
43
36
0
26 May 2023
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays
Lucy McCutcheon
Saber Fallah
38
0
0
26 May 2023
Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Tom Bewley
J. Lawry
Arthur G. Richards
32
1
0
26 May 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Cevahir Köprülü
Ufuk Topcu
31
3
0
25 May 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
39
11
0
25 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
56
10
0
25 May 2023
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OOD
DRL
27
18
0
23 May 2023
RLBoost: Boosting Supervised Models using Deep Reinforcement Learning
Eloy Anguiano Batanero
Ángela Fernández Pascual
Á. Jiménez
OffRL
18
0
0
23 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
35
3
0
22 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
46
40
0
22 May 2023
Road Planning for Slums via Deep Reinforcement Learning
Y. Zheng
Hongyuan Su
Jingtao Ding
Depeng Jin
Yong Li
19
13
0
22 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
46
19
0
22 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
37
6
0
22 May 2023
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
Yameng Zhang
Long Bai
Li Liu
Hongliang Ren
Max Q.-H. Meng
32
9
0
18 May 2023
Previous
1
2
3
...
9
10
11
...
32
33
34
Next