Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning
Hector Kohler
Quentin Delfosse
R. Akrour
Kristian Kersting
Philippe Preux
141
16
0
23 May 2024
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Takuya Hiraoka
Guanquan Wang
Takashi Onishi
Yoshimasa Tsuruoka
109
0
0
23 May 2024
Reinforcing Language Agents via Policy Optimization with Action Decomposition
Muning Wen
Bo Liu
Weinan Zhang
Jun Wang
Ying Wen
82
10
0
23 May 2024
A Unification Between Deep-Learning Vision, Compartmental Dynamical Thermodynamics, and Robotic Manipulation for a Circular Economy
Federico Zocco
W. Haddad
Andrea Corti
Monica Malvezzi
94
5
0
23 May 2024
Doubly-Dynamic ISAC Precoding for Vehicular Networks: A Constrained Deep Reinforcement Learning (CDRL) Approach
Zonghui Yang
Shijian Gao
Xiang Cheng
18
3
0
23 May 2024
Variational Delayed Policy Optimization
Qingyuan Wu
S. Zhan
Yixuan Wang
Yuhui Wang
Chung-Wei Lin
Chen Lv
Qi Zhu
Chao Huang
OffRL
86
5
0
23 May 2024
Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks
Jingchi Jiang
Rujia Shen
Boran Wang
Yi Guan
OffRL
BDL
87
1
0
23 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
113
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
355
54
0
23 May 2024
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles
Jiawei Zhang
Chejian Xu
Yue Liu
121
48
0
22 May 2024
Attention as an RNN
Leo Feng
Frederick Tung
Hossein Hajimirsadeghi
Mohamed Osama Ahmed
Yoshua Bengio
Greg Mori
GNN
AI4TS
105
8
0
22 May 2024
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
Chen-Hao Chao
Chien Feng
Wei-Fang Sun
Cheng-Kuang Lee
Simon See
Chun-Yi Lee
88
5
0
22 May 2024
Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training
Zhiyuan Wang
Bokui Chen
Xiaoyang Qu
Zhenhou Hong
Jing Xiao
Jianzong Wang
87
0
0
22 May 2024
Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing
Amutheezan Sivagnanam
Ava Pettet
Hunter Lee
Ayan Mukhopadhyay
Abhishek Dubey
Aron Laszka
124
0
0
21 May 2024
Deep Reinforcement Learning for Time-Critical Wilderness Search And Rescue Using Drones
Jan‐Hendrik Ewers
David Anderson
Douglas G. Thomson
93
5
0
21 May 2024
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?
Yang Dai
Oubo Ma
Longfei Zhang
Xingxing Liang
Shengchao Hu
Mengzhu Wang
Shouling Ji
Jincai Huang
Li Shen
Mamba
111
6
0
20 May 2024
Reward-Punishment Reinforcement Learning with Maximum Entropy
Jiexin Wang
E. Uchibe
58
0
0
20 May 2024
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
90
2
0
20 May 2024
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang
Boyuan Zheng
Anqi Guo
Tianying Ji
Anqi Guo
Junqiao Zhao
Lanqing Li
OffRL
145
0
0
20 May 2024
URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images
Zoey Chen
Aaron Walsman
Marius Memmel
Kaichun Mo
Alex Fang
Karthikeya Vemuri
Alan Wu
Dieter Fox
Abhishek Gupta
AI4CE
VGen
141
32
0
19 May 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
64
1
0
19 May 2024
PDE Control Gym: A Benchmark for Data-Driven Boundary Control of Partial Differential Equations
Luke Bhan
Yuexin Bian
Miroslav Krstic
Yuanyuan Shi
OOD
AI4CE
74
6
0
18 May 2024
An Efficient Learning Control Framework With Sim-to-Real for String-Type Artificial Muscle-Driven Robotic Systems
Jiyue Tao
Yunsong Zhang
Sunil Kumar Rajendran
Feitian Zhang
206
0
0
17 May 2024
Reinforcement learning
Florentin Wörgötter
137
2,528
0
16 May 2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Yuexiang Zhai
Hao Bai
Zipeng Lin
Jiayi Pan
Shengbang Tong
...
Alane Suhr
Saining Xie
Yann LeCun
Yi-An Ma
Sergey Levine
LLMAG
LRM
143
81
0
16 May 2024
Reinformer: Max-Return Sequence Modeling for Offline RL
Zifeng Zhuang
Dengyun Peng
Jinxin Liu
Ziqi Zhang
Donglin Wang
OffRL
AI4TS
111
14
0
14 May 2024
Enhancing Reinforcement Learning in Sensor Fusion: A Comparative Analysis of Cubature and Sampling-based Integration Methods for Rover Search Planning
Jan‐Hendrik Ewers
S. Swinton
David Anderson
E. McGookin
Douglas G. Thomson
112
2
0
14 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
102
4
0
14 May 2024
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning
M. Khan
Syed Hammad Ahmed
G. Sukthankar
73
0
0
14 May 2024
Adaptive Exploration for Data-Efficient General Value Function Evaluations
Arushi Jain
Josiah P. Hanna
Doina Precup
66
2
0
13 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
96
2
0
13 May 2024
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf B. Cassel
Haipeng Luo
Aviv A. Rosenberg
Dmitry Sotnikov
OffRL
83
4
0
13 May 2024
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
87
0
0
13 May 2024
Intrinsic Rewards for Exploration without Harm from Observational Noise: A Simulation Study Based on the Free Energy Principle
Theodore Jerome Tinker
Kenji Doya
Jun Tani
61
0
0
13 May 2024
DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Yang Jin
Jun Lv
Shuqiang Jiang
Cewu Lu
130
1
0
12 May 2024
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment
Yuwei Zeng
Yao Mu
Lin Shao
96
13
0
12 May 2024
Stealthy Imitation: Reward-guided Environment-free Policy Stealing
Zhixiong Zhuang
Maria-Irina Nicolae
Mario Fritz
AAML
80
1
0
11 May 2024
Semi-supervised Anomaly Detection via Adaptive Reinforcement Learning-Enabled Method with Causal Inference for Sensor Signals
Xiangwei Chen
Ruliang Xiao
Zhixia Zeng
Zhipeng Qiu
Shi Zhang
Xin Du
79
0
0
11 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
103
5
0
10 May 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
113
14
0
08 May 2024
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning
Haobin Zhang
Zhuang Yang
70
0
0
08 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
106
1
0
07 May 2024
Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies
Paul Templier
Emmanuel Rachelson
Antoine Cully
Dennis G. Wilson
49
0
0
07 May 2024
Improving Offline Reinforcement Learning with Inaccurate Simulators
Yiwen Hou
Haoyuan Sun
Jinming Ma
Feng Wu
OffRL
75
6
0
07 May 2024
Logic-Skill Programming: An Optimization-based Approach to Sequential Skill Planning
Teng Xue
Amirreza Razmjoo
Suhan Shetty
Sylvain Calinon
121
4
0
07 May 2024
Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning
Stone Tao
Arth Shukla
Tse-kai Chan
Hao Su
OffRL
86
6
0
06 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
87
4
0
06 May 2024
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
Zelei Cheng
Xian Wu
Jiahao Yu
Sabrina Yang
Gang Wang
Xinyu Xing
OffRL
83
6
0
05 May 2024
Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
Siow Meng Low
Akshat Kumar
OffRL
86
0
0
05 May 2024
Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization
Youbang Sun
Tao-Wen Liu
P. R. Kumar
Shahin Shahrampour
72
1
0
04 May 2024
Previous
1
2
3
...
17
18
19
...
81
82
83
Next