Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
A State Representation for Diminishing Rewards
Ted Moskovitz
Samo Hromadka
Ahmed Touati
Diana Borsa
M. Sahani
55
2
0
07 Sep 2023
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Miguel Abreu
Luis Paulo Reis
Nuno Lau
124
6
0
06 Sep 2023
Efficient RL via Disentangled Environment and Agent Representations
Kevin Gmelin
Shikhar Bahl
Russell Mendonca
Deepak Pathak
DRL
73
9
0
05 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
78
12
0
05 Sep 2023
Model-based Offline Policy Optimization with Adversarial Network
Junming Yang
Xingguo Chen
Shengyuan Wang
Bolei Zhang
OffRL
65
2
0
05 Sep 2023
A Survey on Physics Informed Reinforcement Learning: Review and Open Problems
C. Banerjee
Kien Nguyen
Clinton Fookes
M. Raissi
PINN
AI4CE
111
10
0
05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
92
4
0
04 Sep 2023
Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning
Qisen Yang
Huanqian Wang
Mukun Tong
Wenjie Shi
Gao Huang
Shiji Song
72
5
0
04 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
79
8
0
04 Sep 2023
Deception Game: Closing the Safety-Learning Loop in Interactive Robot Autonomy
Haimin Hu
Zixu Zhang
Kensuke Nakamura
Andrea V. Bajcsy
J. F. Fisac
99
12
0
03 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
100
17
0
02 Sep 2023
Parallel Distributional Prioritized Deep Reinforcement Learning for Unmanned Aerial Vehicles
A. H. Kolling
V. A. Kich
J. C. Jesus
Andressa Cavalcante da Silva
Ricardo B. Grando
Paulo L. J. Drews-Jr
D. T. Gamarra
77
3
0
01 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
107
14
0
31 Aug 2023
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
Alberto Dionigi
Simone Felicioni
Mirko Leomanni
G. Costante
76
10
0
31 Aug 2023
Multi-Objective Decision Transformers for Offline Reinforcement Learning
Abdelghani Ghanem
P. Ciblat
Mounir Ghogho
OffRL
63
1
0
31 Aug 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
100
0
0
31 Aug 2023
DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving
Yinda Xu
Lidong Yu
72
7
0
30 Aug 2023
Deep Inductive Logic Programming meets Reinforcement Learning
Andreas Bueff
Vaishak Belle
AI4CE
43
4
0
30 Aug 2023
Learning the References of Online Model Predictive Control for Urban Self-Driving
Yubin Wang
Zeng Peng
Yusen Xie
Yulin Li
Hakim Ghazzai
Jun Ma
81
0
0
30 Aug 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
64
4
0
29 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
115
7
0
29 Aug 2023
Stochastic Motion Planning as Gaussian Variational Inference: Theory and Algorithms
Hongzhe Yu
Yongxin Chen
98
3
0
29 Aug 2023
Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance
Mackenzie J. Meni
Ryan T. White
Michael L. Mayo
K. Pilkiewicz
BDL
78
6
0
28 Aug 2023
Tackling Diverse Minorities in Imbalanced Classification
Kwei-Herng Lai
Daochen Zha
Huiyuan Chen
M. Bendre
Yuzhong Chen
Mahashweta Das
Hao Yang
Helen Zhou
72
0
0
28 Aug 2023
Symmetric Models for Visual Force Policy Learning
Colin Kohler
A. S. Srikanth
Eshan Arora
Robert Platt
80
10
0
28 Aug 2023
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm
C. Bellinger
Laurence Lamarche-Cliche
81
0
0
28 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
220
13
0
28 Aug 2023
Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward
Taisuke Kobayashi
72
4
0
24 Aug 2023
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems
Ahmed Haj Yahmed
Rached Bouchoucha
Houssem Ben Braiek
Foutse Khomh
CLL
AI4CE
68
0
0
23 Aug 2023
A Survey on Large Language Model based Autonomous Agents
Lei Wang
Chengbang Ma
Xueyang Feng
Zeyu Zhang
Hao-ran Yang
...
Xu Chen
Yankai Lin
Wayne Xin Zhao
Zhewei Wei
Ji-Rong Wen
LLMAG
AI4CE
LM&Ro
208
1,333
0
22 Aug 2023
Careful at Estimation and Bold at Exploration
Xing Chen
Yijun Liu
Zhaogeng Liu
Hechang Chen
Hengshuai Yao
Yi-Ju Chang
21
0
0
22 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
80
7
0
22 Aug 2023
A Safe Deep Reinforcement Learning Approach for Energy Efficient Federated Learning in Wireless Communication Networks
Nikolaos Koursioumpas
Lina Magoula
Nikolaos Petropouleas
Alexandros-Ioannis Thanopoulos
Theodora Panagea
Nancy Alonistioti
M. A. Gutierrez-Estevez
R. Khalili
69
1
0
21 Aug 2023
RL-LABEL: A Deep Reinforcement Learning Approach Intended for AR Label Placement in Dynamic Scenarios
Zhutian Chen
Daniele Chiappalupi
Tica Lin
Yalong Yang
Johanna Beyer
Hanspeter Pfister
OffRL
97
5
0
20 Aug 2023
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL
Ye Zhang
Jian Sun
G. Wang
Zhuoxian Li
Wei Chen
OffRL
51
0
0
20 Aug 2023
DoCRL: Double Critic Deep Reinforcement Learning for Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Medium Transition
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
R. S. Guerra
Paulo L. J. Drews-Jr
85
4
0
18 Aug 2023
Integrating Expert Guidance for Efficient Learning of Safe Overtaking in Autonomous Driving Using Deep Reinforcement Learning
Jinxiong Lu
G. Alcan
Ville Kyrki
25
2
0
18 Aug 2023
Impression-Aware Recommender Systems
F. B. P. Maurera
Maurizio Ferrari Dacrema
P. Castells
Paolo Cremonesi
AI4TS
78
2
0
15 Aug 2023
Learning to Identify Critical States for Reinforcement Learning from Videos
Haozhe Liu
Mingchen Zhuge
Bing Li
Yu‐Han Wang
Francesco Faccio
Guohao Li
Jürgen Schmidhuber
OffRL
82
10
0
15 Aug 2023
Hierarchical generative modelling for autonomous robots
Kai Yuan
Noor Sajid
Karl J. Friston
Zhibin Li
63
14
0
15 Aug 2023
RL-based Variable Horizon Model Predictive Control of Multi-Robot Systems using Versatile On-Demand Collision Avoidance
Shreyash Gupta
Abhinav Kumar
N. S. Tripathy
S. Shah
51
0
0
14 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
85
1
0
14 Aug 2023
Value-Distributional Model-Based Reinforcement Learning
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
70
4
0
12 Aug 2023
Reinforcement Logic Rule Learning for Temporal Point Processes
Chao Yang
Lu Wang
Kun Gao
Shuang Li
AI4TS
35
0
0
11 Aug 2023
Learning Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding
Jaeho Chung
Jamil Fayyad
Younes Al Younes
Homayoun Najjaran
80
17
0
11 Aug 2023
A Smart Robotic System for Industrial Plant Supervision
D. A. Gómez-Rosal
M. Bergau
G. Fischer
Andreas Wachaja
Johannes Grater
...
Nikhil Gosala
Niklas Wetzel
Daniel Buscher
Abhinav Valada
Wolfram Burgard
67
2
0
10 Aug 2023
RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation
Chang Nie
Guangming Wang
Yanfeng Guo
Luca Cavalli
Marc Pollefeys
Hesheng Wang
OffRL
59
4
0
10 Aug 2023
Improving Autonomous Separation Assurance through Distributed Reinforcement Learning with Attention Networks
Marc Brittain
Luis E. Alvarez
Kara Breeden
25
5
0
09 Aug 2023
Actor-Critic with variable time discretization via sustained actions
Jakub Lyskawa
Pawel Wawrzyñski
OffRL
28
0
0
08 Aug 2023
Synthesizing Programmatic Policies with Actor-Critic Algorithms and ReLU Networks
S. Orfanos
Levi H. S. Lelis
55
6
0
04 Aug 2023
Previous
1
2
3
...
29
30
31
...
81
82
83
Next