Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,044 papers shown
Title
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
42
2
0
11 Jul 2024
Gradient Boosting Reinforcement Learning
Benjamin Fuhrer
Chen Tessler
Gal Dalal
OffRL
AI4CE
75
3
0
11 Jul 2024
RoboMorph: Evolving Robot Morphology using Large Language Models
Kevin Qiu
Krzysztof Ciebiera
Krzysztof Ciebiera
Marek Cygan
Marek Cygan
Łukasz Kuciński
LM&Ro
66
0
0
11 Jul 2024
Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning
Francisco Giral
Ignacio Gómez
S. L. Clainche
29
0
0
09 Jul 2024
Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning
Augustine N. Mavor-Parker
Matthew J. Sargent
Caswell Barry
Lewis D. Griffin
Clare Lyle
59
2
0
09 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
51
3
0
09 Jul 2024
Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation
Detian Chu
Linyuan Bai
Jianuo Huang
Zhenlong Fang
Peng Zhang
Wei Kang
Haifeng Lin
106
2
0
08 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
59
0
0
08 Jul 2024
A Novel Bifurcation Method for Observation Perturbation Attacks on Reinforcement Learning Agents: Load Altering Attacks on a Cyber Physical Power System
Kiernan Broda-Milian
Ranwa Al-Mallah
H. Dagdougui
AAML
51
0
0
06 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
85
1
0
06 Jul 2024
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
58
2
0
05 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
47
1
0
05 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
40
1
0
05 Jul 2024
Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning
I. Lee
Hoang-Giang Cao
Cong-Tinh Dao
Yu-Cheng Chen
I-Chen Wu
35
0
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
64
19
0
05 Jul 2024
Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement
Jiafan Zhuang
Zihao Xia
Gaofei Han
Boxi Wang
Wenji Li
Dongliang Wang
Zhifeng Hao
Ruichu Cai
Zhun Fan
CML
47
0
0
04 Jul 2024
ROER: Regularized Optimal Experience Replay
Changling Li
Zhang-Wei Hong
Pulkit Agrawal
Divyansh Garg
Joni Pajarinen
OffRL
60
1
0
04 Jul 2024
RobocupGym: A challenging continuous control benchmark in Robocup
Michael Beukman
Branden Ingram
Geraud Nangue Tasse
Benjamin Rosman
Pravesh Ranchod
OffRL
55
1
0
03 Jul 2024
Reinforcement Learning for Sequence Design Leveraging Protein Language Models
Jithendaraa Subramanian
Shivakanth Sujit
Niloy Irtisam
Umong Sain
Derek Nowrouzezahrai
Samira Ebrahimi Kahou
Riashat Islam
53
0
0
03 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf B. Cassel
Aviv A. Rosenberg
60
1
0
03 Jul 2024
Solving Motion Planning Tasks with a Scalable Generative Model
Yihan Hu
Siqi Chai
Zhening Yang
Jingyu Qian
Kun Li
Wenxin Shao
Haichao Zhang
Wei Xu
Qiang Liu
53
18
0
03 Jul 2024
Safe CoR: A Dual-Expert Approach to Integrating Imitation Learning and Safe Reinforcement Learning Using Constraint Rewards
Hyeokjin Kwon
Gunmin Lee
Junseo Lee
Songhwai Oh
59
0
0
02 Jul 2024
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Kazumi Kasaura
54
0
0
02 Jul 2024
RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing
Bo Ai
Stephen Tian
Haochen Shi
Yixuan Wang
Cheston Tan
Yunzhu Li
Jiajun Wu
72
12
0
01 Jul 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
50
3
0
01 Jul 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
79
0
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
59
1
0
30 Jun 2024
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
28
2
0
30 Jun 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
33
0
0
30 Jun 2024
Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models
Sangwoong Yoon
Himchan Hwang
Dohyun Kwon
Yung-Kyun Noh
Frank C. Park
44
3
0
30 Jun 2024
Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity
Alireza Mohammadshafie
Akram Mirzaeinia
Haseebullah Jumakhan
Amir Mirzaeinia
AIFin
21
1
0
29 Jun 2024
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
Aicheng Gong
Kai Yang
Jiafei Lyu
Xiu Li
37
7
0
29 Jun 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
75
4
0
29 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
40
2
0
29 Jun 2024
Operator World Models for Reinforcement Learning
P. Novelli
Marco Prattico
Massimiliano Pontil
C. Ciliberto
OffRL
71
0
0
28 Jun 2024
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
Bernd Frauenknecht
Ramil Sabirov
Sebastian Trimpe
OffRL
OnRL
92
3
0
28 Jun 2024
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning
Nishesh Singh
Sidharth Ramesh
Abhishek Shankar
Jyotishka Duttagupta
Leander Stephen D'Souza
Sanjay Singh
23
0
0
27 Jun 2024
Combining Automated Optimisation of Hyperparameters and Reward Shape
Julian Dierkes
Emma Cramer
Holger Hoos
Sebastian Trimpe
51
1
0
26 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
43
0
0
26 Jun 2024
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
61
2
0
26 Jun 2024
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
69
1
0
26 Jun 2024
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
72
7
0
25 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
49
0
0
24 Jun 2024
Probabilistic Subgoal Representations for Hierarchical Reinforcement learning
V. Wang
Tinghuai Wang
Wenyan Yang
Joni-Kristian Kämäräinen
Joni Pajarinen
BDL
38
3
0
24 Jun 2024
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
Masayoshi Tomizuka
Wei Zhan
31
1
0
24 Jun 2024
Position: Benchmarking is Limited in Reinforcement Learning Research
Scott M. Jordan
Adam White
Bruno Castro da Silva
Martha White
Philip S. Thomas
OffRL
31
6
0
23 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
68
3
0
23 Jun 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
40
1
0
21 Jun 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
52
1
0
21 Jun 2024
Using Multimodal Foundation Models and Clustering for Improved Style Ambiguity Loss
James Baker
DiffM
27
1
0
20 Jun 2024
Previous
1
2
3
...
12
13
14
...
79
80
81
Next