Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
100
1
0
05 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
62
1
0
05 Jul 2024
Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning
I. Lee
Hoang-Giang Cao
Cong-Tinh Dao
Yu-Cheng Chen
I-Chen Wu
66
0
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
182
26
0
05 Jul 2024
Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement
Jiafan Zhuang
Zihao Xia
Gaofei Han
Boxi Wang
Wenji Li
Dongliang Wang
Zhifeng Hao
Ruichu Cai
Zhun Fan
CML
102
0
0
04 Jul 2024
ROER: Regularized Optimal Experience Replay
Changling Li
Zhang-Wei Hong
Pulkit Agrawal
Divyansh Garg
Joni Pajarinen
OffRL
81
1
0
04 Jul 2024
RobocupGym: A challenging continuous control benchmark in Robocup
Michael Beukman
Branden Ingram
Geraud Nangue Tasse
Benjamin Rosman
Pravesh Ranchod
OffRL
90
2
0
03 Jul 2024
Reinforcement Learning for Sequence Design Leveraging Protein Language Models
Jithendaraa Subramanian
Shivakanth Sujit
Niloy Irtisam
Umong Sain
Derek Nowrouzezahrai
Samira Ebrahimi Kahou
Riashat Islam
71
0
0
03 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf B. Cassel
Aviv A. Rosenberg
96
1
0
03 Jul 2024
Solving Motion Planning Tasks with a Scalable Generative Model
Yihan Hu
Siqi Chai
Zhening Yang
Jingyu Qian
Kun Li
Wenxin Shao
Haichao Zhang
Wei Xu
Qiang Liu
100
21
0
03 Jul 2024
Safe CoR: A Dual-Expert Approach to Integrating Imitation Learning and Safe Reinforcement Learning Using Constraint Rewards
Hyeokjin Kwon
Gunmin Lee
Junseo Lee
Songhwai Oh
71
0
0
02 Jul 2024
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Kazumi Kasaura
137
0
0
02 Jul 2024
RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing
Bo Ai
Stephen Tian
Haochen Shi
Yixuan Wang
Cheston Tan
Yunzhu Li
Jiajun Wu
110
15
0
01 Jul 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
163
3
0
01 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
153
0
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
80
1
0
30 Jun 2024
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
67
3
0
30 Jun 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
76
0
0
30 Jun 2024
Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models
Sangwoong Yoon
Himchan Hwang
Dohyun Kwon
Yung-Kyun Noh
Frank C. Park
94
3
0
30 Jun 2024
Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity
Alireza Mohammadshafie
Akram Mirzaeinia
Haseebullah Jumakhan
Amir Mirzaeinia
AIFin
28
1
0
29 Jun 2024
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Aicheng Gong
Kai Yang
Jiafei Lyu
Xiu Li
71
9
0
29 Jun 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
117
6
0
29 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
84
3
0
29 Jun 2024
Operator World Models for Reinforcement Learning
P. Novelli
Marco Prattico
Massimiliano Pontil
C. Ciliberto
OffRL
131
1
0
28 Jun 2024
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
Bernd Frauenknecht
Ramil Sabirov
Sebastian Trimpe
OffRL
OnRL
131
5
0
28 Jun 2024
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning
Nishesh Singh
Sidharth Ramesh
Abhishek Shankar
Jyotishka Duttagupta
Leander Stephen D'Souza
Sanjay Singh
41
0
0
27 Jun 2024
Combining Automated Optimisation of Hyperparameters and Reward Shape
Julian Dierkes
Emma Cramer
Holger Hoos
Sebastian Trimpe
102
1
0
26 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
61
0
0
26 Jun 2024
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
78
2
0
26 Jun 2024
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
140
1
0
26 Jun 2024
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
127
9
0
25 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
85
0
0
24 Jun 2024
Probabilistic Subgoal Representations for Hierarchical Reinforcement learning
V. Wang
Tinghuai Wang
Wenyan Yang
Joni-Kristian Kämäräinen
Joni Pajarinen
BDL
62
4
0
24 Jun 2024
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
Masayoshi Tomizuka
Wei Zhan
62
1
0
24 Jun 2024
Position: Benchmarking is Limited in Reinforcement Learning Research
Scott M. Jordan
Adam White
Bruno Castro da Silva
Martha White
Philip S. Thomas
OffRL
65
8
0
23 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
116
5
0
23 Jun 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
69
1
0
21 Jun 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
76
2
0
21 Jun 2024
Using Multimodal Foundation Models and Clustering for Improved Style Ambiguity Loss
James Baker
DiffM
29
1
0
20 Jun 2024
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability
Shuang Ao
Simon Khan
Haris Aziz
Flora D. Salim
145
0
0
20 Jun 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun Luo
OffRL
78
2
0
20 Jun 2024
Equivariant Offline Reinforcement Learning
Arsh Tangri
Ondrej Biza
Dian Wang
David Klee
Owen Howell
Robert Platt
OffRL
122
4
0
20 Jun 2024
A Decision-Making GPT Model Augmented with Entropy Regularization for Autonomous Vehicles
Jiaqi Liu
Shiyu Fang
Xuekai Liu
Lulu Guo
Peng Hang
Jian Sun
74
3
0
20 Jun 2024
SRL-VIC: A Variable Stiffness-Based Safe Reinforcement Learning for Contact-Rich Robotic Tasks
Heng Zhang
Gokhan Solak
G. J. G. Lahr
Arash Ajoudani
62
14
0
19 Jun 2024
Improving GFlowNets with Monte Carlo Tree Search
Nikita Morozov
D. Tiapkin
S. Samsonov
Alexey Naumov
Dmitry Vetrov
107
2
0
19 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
100
1
0
19 Jun 2024
Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning
Harry Robertshaw
Lennart Karstensen
Benjamin Jackson
Alejandro Granados
Thomas C Booth
51
7
0
18 Jun 2024
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
96
1
0
18 Jun 2024
BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning
Yi Liu
Cong Wang
Lizhen Qu
AAML
99
3
0
18 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
118
1
0
17 Jun 2024
Previous
1
2
3
...
14
15
16
...
81
82
83
Next