Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
BEVNav: Robot Autonomous Navigation Via Spatial-Temporal Contrastive Learning in Bird's-Eye View
Jiahao Jiang
Yuxiang Yang
Yingqi Deng
Chenlong Ma
Jing Zhang
SSL
91
4
0
03 Sep 2024
Revisiting Safe Exploration in Safe Reinforcement learning
David Eckel
Baohe Zhang
Joschka Bödecker
81
0
0
02 Sep 2024
AI Olympics challenge with Evolutionary Soft Actor Critic
Marco Calì
Alberto Sinigaglia
Niccolò Turcato
R. Carli
Gian Antonio Susto
52
3
0
02 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
144
0
0
02 Sep 2024
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving
Zilin Huang
Zihao Sheng
Sikai Chen
171
4
0
01 Sep 2024
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
171
57
0
01 Sep 2024
Robust off-policy Reinforcement Learning via Soft Constrained Adversary
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
89
0
0
31 Aug 2024
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Zihao Sheng
Zilin Huang
Sikai Chen
96
10
0
30 Aug 2024
Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits
Woojin Jeong
Seungki Min
89
0
0
28 Aug 2024
RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models
Pritthijit Nath
Henry Moss
Emily Shuckburgh
Mark Webb
AI4Cl
AI4CE
171
0
0
28 Aug 2024
Artificially intelligent Maxwell's demon for optimal control of open quantum systems
P. A. Erdman
R. Czupryniak
Bibek Bhandari
Andrew N. Jordan
Frank Noé
J. Eisert
Giacomo Guarnieri
69
3
0
27 Aug 2024
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
Jihwan Lee
Woochang Sim
Sejin Kim
Sundong Kim
OffRL
108
2
0
27 Aug 2024
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
105
5
0
27 Aug 2024
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David Klee
Dian Wang
Robert Platt
Christopher Amato
93
16
0
26 Aug 2024
Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search
Shuo Yang
Caojun Wang
Yuanjian Zhang
Yuming Yin
Yanjun Huang
Shengbo Eben Li
Hong Chen
50
0
0
26 Aug 2024
Safe Policy Exploration Improvement via Subgoals
Brian Angulo
G. Gorbov
Aleksandr I. Panov
Konstantin Yakovlev
OffRL
81
0
0
25 Aug 2024
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning
Zhongjian Qiao
Jiafei Lyu
Kechen Jiao
Qi Liu
Xiu Li
OffRL
73
4
0
23 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
139
1
0
23 Aug 2024
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Georgios Bakirtzis
M. Savvas
Ruihan Zhao
Sandeep Chinchali
Ufuk Topcu
129
2
0
23 Aug 2024
A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model
Shuo Yang
Caojun Wang
Zhenyu Ma
Yanjun Huang
Hong Chen
46
0
0
22 Aug 2024
A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems
Shuo Yang
Liwen Wang
Yanjun Huang
Hong Chen
80
0
0
22 Aug 2024
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
106
10
0
21 Aug 2024
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
Donghoon Kim
Minjong Yoo
Honguk Woo
OffRL
75
0
0
21 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
204
4
0
20 Aug 2024
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
189
0
0
19 Aug 2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
S. Poddar
Yanming Wan
Hamish Ivison
Abhishek Gupta
Natasha Jaques
113
50
0
19 Aug 2024
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
Renye Yan
Yaozhong Gan
You Wu
Ling Liang
Junliang Xing
Yimao Cai
Ru Huang
140
1
0
19 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
141
15
0
19 Aug 2024
SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic Monitoring Cameras
Tiejin Chen
Prithvi Shirke
Bharatesh Chakravarthi
Arpitsinh Vaghela
Longchao Da
Duo Lu
Yezhou Yang
Hua Wei
48
1
0
18 Aug 2024
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Marco Bagatella
Andreas Krause
Georg Martius
OffRL
86
1
0
18 Aug 2024
Vanilla Gradient Descent for Oblique Decision Trees
Subrat Prasad Panda
B. Genest
Arvind Easwaran
Ponnuthurai Nagaratnam Suganthan
OffRL
146
1
0
17 Aug 2024
Online Behavior Modification for Expressive User Control of RL-Trained Robots
Isaac S. Sheidlower
Mavis Murdock
Emma Bethel
Reuben M. Aronson
E. Short
OffRL
88
3
0
15 Aug 2024
Experimental evaluation of offline reinforcement learning for HVAC control in buildings
Jun Wang
Linyan Li
Qi Liu
Yu Yang
OffRL
AI4CE
48
1
0
15 Aug 2024
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning
Homayoun Honari
Amir M. Soufi Enayati
Mehran Ghafarian Tamizi
Homayoun Najjaran
85
2
0
15 Aug 2024
q-exponential family for policy optimization
Lingwei Zhu
Haseeb Shah
Han Wang
Yukie Nagai
Martha White
OffRL
142
0
0
14 Aug 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
113
1
0
13 Aug 2024
GFlowNet Training by Policy Gradients
Puhua Niu
Shili Wu
Mingzhou Fan
Xiaoning Qian
145
3
0
12 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
135
2
0
11 Aug 2024
CURLing the Dream: Contrastive Representations for World Modeling in Reinforcement Learning
V. A. Kich
J. A. Bottega
Raul Steinmetz
Ricardo B. Grando
Ayano Yorozu
Akihisa Ohya
OffRL
99
0
0
11 Aug 2024
Cell Morphology-Guided Small Molecule Generation with GFlowNets
Stephen Zhewen Lu
Ziqing Lu
Ehsan Hajiramezanali
Tommaso Biancalani
Yoshua Bengio
Gabriele Scalia
Michał Koziarski
82
3
0
09 Aug 2024
F1tenth Autonomous Racing With Offline Reinforcement Learning Methods
Prajwal Koirala
Cody Fleming
OffRL
86
1
0
08 Aug 2024
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
Heewoong Choi
Sangwon Jung
Hongjoon Ahn
Taesup Moon
OffRL
122
4
0
08 Aug 2024
HDPlanner: Advancing Autonomous Deployments in Unknown Environments through Hierarchical Decision Networks
Jingsong Liang
Yuhong Cao
Yixiao Ma
Hanqi Zhao
Guillaume Sartoretti
69
2
0
07 Aug 2024
Hierarchical learning control for autonomous robots inspired by central nervous system
Pei Zhang
Zhaobo Hua
Jinliang Ding
77
0
0
07 Aug 2024
Achieving Human Level Competitive Robot Table Tennis
David B. DÁmbrosio
Saminda Abeyruwan
L. Graesser
Atil Iscen
H. B. Amor
...
Vikas Sindhwani
Vincent Vanhoucke
Grace Vesom
P. Xu
Pannag R Sanketi
184
15
0
07 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
132
9
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
133
0
0
05 Aug 2024
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
75
2
0
04 Aug 2024
Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Yu Yang
Pan Xu
VLM
OffRL
90
2
0
02 Aug 2024
Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system
Li Dong
Luke Robinson
Minjie Wang
Daniele De Martini
Xiaolong Li
88
15
0
02 Aug 2024
Previous
1
2
3
...
12
13
14
...
81
82
83
Next