Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 1,669 papers shown
Title
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
36
8
0
02 Feb 2024
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
Chia-Cheng Chiang
Li-Cheng Lan
Wei-Fang Sun
Chien Feng
Cho-Jui Hsieh
Chun-Yi Lee
46
0
0
01 Feb 2024
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Ryoma Furuyama
Daiki Kuyoshi
Satoshi Yamane
23
0
0
30 Jan 2024
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
48
3
0
30 Jan 2024
Attentive Convolutional Deep Reinforcement Learning for Optimizing Solar-Storage Systems in Real-Time Electricity Markets
Jinhao Li
Changlong Wang
Hao Wang
11
3
0
29 Jan 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRL
OnRL
39
41
0
29 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
42
2
0
26 Jan 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
46
0
0
24 Jan 2024
Discovering Mathematical Formulas from Data via GPT-guided Monte Carlo Tree Search
Yanjie Li
Weijun Li
Lina Yu
Min Wu
Jingyi Liu
Wenqiang Li
Meilan Hao
Shu Wei
Yusong Deng
53
9
0
24 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
44
3
0
24 Jan 2024
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants
Yixuan Sun
Sami Khairy
Richard B. Vilim
Rui Hu
Akshay J. Dave
29
2
0
23 Jan 2024
Adaptive Motion Planning for Multi-fingered Functional Grasp via Force Feedback
Dongying Tian
Xiangbo Lin
Yi Sun
31
3
0
22 Jan 2024
Efficient and Generalized end-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations
Zuojin Tang
Xiaoyu Chen
YongQiang Li
Jianyu Chen
31
2
0
22 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
51
96
0
08 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
29
10
0
06 Jan 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
24
30
0
06 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
38
3
0
30 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Wenzhang Liu
Wenzhe Cai
Kun Jiang
Guangran Cheng
Yuanda Wang
Changyin Sun
Jingyu Cao
Lele Xu
Chaoxu Mu
Changyin Sun
39
4
0
25 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
31
0
0
24 Dec 2023
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
42
1
0
21 Dec 2023
Solving the swing-up and balance task for the Acrobot and Pendubot with SAC
Chi Zhang
Akhil Sathuluri
Markus Zimmermann
31
3
0
18 Dec 2023
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang
Yichen Zhu
Yirui Zhou
Yaxin Peng
Jian Tang
Zhiyuan Xu
Chaomin Shen
Yangchun Zhang
39
4
0
18 Dec 2023
Aligning Human Intent from Imperfect Demonstrations with Confidence-based Inverse soft-Q Learning
Xizhou Bu
Wenjuan Li
Zhengxiong Liu
Zhiqiang Ma
Panfeng Huang
27
1
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
33
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
34
2
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
40
8
0
15 Dec 2023
Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu
Rongpeng Li
Chengchao Liang
Zhifeng Zhao
38
0
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
32
2
0
11 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
45
1
0
09 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
39
6
0
08 Dec 2023
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
34
0
0
06 Dec 2023
Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
Pathmanathan Pankayaraj
Natalia Díaz Rodríguez
Javier Del Ser
CLL
OffRL
43
0
0
05 Dec 2023
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning
Matteo Bettini
Amanda Prorok
Vincent Moens
OffRL
49
15
0
03 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
32
19
0
01 Dec 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Jun Wang
Hosein Hasanbeig
Kaiyuan Tan
Zihe Sun
Y. Kantaros
40
3
0
28 Nov 2023
Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning
Pouya Parsa
Raoof Zare Moayedi
Mohammad Bornosi
Mohammad Mahdi Bejani
27
0
0
25 Nov 2023
Offline Skill Generalization via Task and Motion Planning
Shin Watanabe
Geir Horn
J. Tørresen
K. Ellefsen
OffRL
35
0
0
24 Nov 2023
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi Ma
Sergey Levine
OffRL
38
15
0
21 Nov 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
50
4
0
20 Nov 2023
Decentralized Energy Marketplace via NFTs and AI-based Agents
Rasoul Nikbakht
Farhana Javed
Farhad Rezazadeh
N. Bartzoudis
J. Mangues-Bafalluy
25
1
0
17 Nov 2023
Interpretable Reinforcement Learning for Robotics and Continuous Control
Rohan R. Paleja
Letian Chen
Yaru Niu
Andrew Silva
Zhaoxin Li
...
K. Chang
H. E. Tseng
Yan Wang
S. Nageshrao
Matthew C. Gombolay
44
7
0
16 Nov 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
39
1
0
16 Nov 2023
Differentiable Cloth Parameter Identification and State Estimation in Manipulation
Dongzhe Zheng
Siqiong Yao
Wenqiang Xu
Cewu Lu
30
5
0
09 Nov 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
46
2
0
09 Nov 2023
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula
Aryaman Reddi
Maximilian Tölle
Jan Peters
Georgia Chalvatzaki
Carlo DÉramo
47
4
0
03 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
34
5
0
01 Nov 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
43
7
0
30 Oct 2023
Robot Control based on Motor Primitives -- A Comparison of Two Approaches
Moses C. Nah
Johannes Lachner
Neville Hogan
26
3
0
28 Oct 2023
Previous
1
2
3
...
7
8
9
...
32
33
34
Next