Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,044 papers shown
Title
Adaptive Motion Planning for Multi-fingered Functional Grasp via Force Feedback
Dongying Tian
Xiangbo Lin
Yi Sun
49
3
0
22 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
59
9
0
22 Jan 2024
Efficient and Generalized end-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations
Zuojin Tang
Xiaoyu Chen
YongQiang Li
Jianyu Chen
49
2
0
22 Jan 2024
Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning
Ge Li
Hongyi Zhou
Dominik Roth
Serge Thilges
Fabian Otto
Rudolf Lioutikov
Gerhard Neumann
OffRL
38
7
0
21 Jan 2024
Visual Imitation Learning with Calibrated Contrastive Representation
Yunke Wang
Linwei Tao
Bo Du
Yutian Lin
Chang Xu
33
0
0
21 Jan 2024
Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive Performance in Fin Ray Control
Xin-Yang Liu
Dariush Bodaghi
Q. Xue
Xudong Zheng
Jian-Xun Wang
66
0
0
21 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CE
LRM
56
20
0
19 Jan 2024
FREED++: Improving RL Agents for Fragment-Based Molecule Generation by Thorough Reproduction
Alexander Telepov
Artem Tsypin
Kuzma Khrabrov
Sergey Yakukhnov
Pavel Strashnov
...
Egor Rumiantsev
Daniel Ezhov
Manvel Avetisian
Olga Popova
Artur Kadurin
36
4
0
18 Jan 2024
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
45
15
0
18 Jan 2024
Deployable Reinforcement Learning with Variable Control Rate
Dong Wang
Giovanni Beltrame
47
4
0
17 Jan 2024
Autonomous Catheterization with Open-source Simulator and Expert Trajectory
Tudor Jianu
Baoru Huang
Tuan V. Vo
M. Vu
Jingxuan Kang
Hoan Nguyen
O. Omisore
Pierre Berthet-Rayne
S. Fichera
Anh Nguyen
47
7
0
17 Jan 2024
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback
Teng Xiao
Suhang Wang
OffRL
48
8
0
17 Jan 2024
Open RAN LSTM Traffic Prediction and Slice Management using Deep Reinforcement Learning
Fatemeh Lotfi
Fatemeh Afghah
AI4TS
39
7
0
12 Jan 2024
Identifying Policy Gradient Subspaces
Jan Schneider-Barnes
Pierre Schumacher
Simon Guist
Tianyu Cui
Daniel Haeufle
Bernhard Scholkopf
Le Chen
49
6
0
12 Jan 2024
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem
Niklas Strauß
Matthias Schubert
38
0
0
11 Jan 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
51
2
0
11 Jan 2024
An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control
Antonio Manjavacas
Alejandro Campoy-Nieves
Javier Jiménez Raboso
Miguel Molina-Solana
Juan Gómez-Romero
AI4CE
39
9
0
11 Jan 2024
ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometries
Thomas Rudolf
Daniel Flögel
Tobias Schürmann
Simon Süß
S. Schwab
Sören Hohmann
AI4CE
49
1
0
10 Jan 2024
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang
Paul Weng
OffRL
42
0
0
10 Jan 2024
Autonomous Navigation of Tractor-Trailer Vehicles through Roundabout Intersections
Daniel Attard
Josef Bajada
27
2
0
10 Jan 2024
Fully Spiking Actor Network with Intra-layer Connections for Reinforcement Learning
Ding Chen
Peixi Peng
Tiejun Huang
Yonghong Tian
46
6
0
09 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
59
100
0
08 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
57
10
0
06 Jan 2024
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
66
11
0
06 Jan 2024
Interpreting Adaptive Gradient Methods by Parameter Scaling for Learning-Rate-Free Optimization
Min-Kook Suh
Seung-Woo Seo
ODL
41
0
0
06 Jan 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
29
30
0
06 Jan 2024
Deep Reinforcement Learning for Local Path Following of an Autonomous Formula SAE Vehicle
Harvey Merton
Thomas Delamore
Karl Stol
Henry Williams
38
0
0
05 Jan 2024
XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model
Zhitao Wang
Wei Wang
Zirao Li
Long Wang
Can Yi
Xinjie Xu
Luyang Cao
Hanjing Su
Shouzhi Chen
Jun Zhou
ALM
LLMAG
37
8
0
05 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
60
3
0
30 Dec 2023
Exploring Deep Reinforcement Learning for Robust Target Tracking using Micro Aerial Vehicles
Alberto Dionigi
Mirko Leomanni
Alessandro Saviolo
Giuseppe Loianno
G. Costante
48
2
0
29 Dec 2023
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e
Chenwei Xu
Jerry Yao-Chieh Hu
A. Narayanan
M. Thieme
V. Nagaslaev
...
Rui Shi
S. Memik
A. Shuping
Kyle Hazelwood
Han Liu
32
2
0
28 Dec 2023
Gradient-based Planning with World Models
V. JyothirS
Siddhartha Jalagam
Yann LeCun
Vlad Sobal
48
4
0
28 Dec 2023
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
64
3
0
28 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
46
3
0
27 Dec 2023
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
31
0
0
26 Dec 2023
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Renzhe Zhou
Chenxiao Gao
Zongzhang Zhang
Yang Yu
OffRL
52
11
0
26 Dec 2023
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Wenzhang Liu
Wenzhe Cai
Kun Jiang
Guangran Cheng
Yuanda Wang
Changyin Sun
Jingyu Cao
Lele Xu
Chaoxu Mu
Changyin Sun
39
4
0
25 Dec 2023
BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Lin Sun
Weijun Wang
Tingting Yuan
Liang Mi
Haipeng Dai
Yunxin Liu
Xiaoming Fu
31
4
0
25 Dec 2023
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
48
0
0
25 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
35
0
0
24 Dec 2023
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Bram Grooten
Tristan Tomilin
Gautham Vasan
Matthew E. Taylor
A. R. Mahmood
Meng Fang
Mykola Pechenizkiy
Decebal Constantin Mocanu
47
8
0
23 Dec 2023
Human-Centric Resource Allocation for the Metaverse With Multiaccess Edge Computing
Zijian Long
Haiwei Dong
Abdulmotaleb El Saddik
20
18
0
23 Dec 2023
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism
Seyed soroush Karimi madahi
Bert Claessens
Chris Develder
42
3
0
23 Dec 2023
An investigation of belief-free DRL and MCTS for inspection and maintenance planning
Daniel Koutas
E. Bismut
Daniel Straub
26
2
0
22 Dec 2023
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He
Kai Li
Yifan Zang
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
MoE
41
5
0
22 Dec 2023
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
42
1
0
21 Dec 2023
Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator
Zichun Xu
Yuntao Li
Xiaohang Yang
Zhiyuan Zhao
Zhuang Lei
Jingdong Zhao
54
2
0
21 Dec 2023
Safe Multi-Agent Reinforcement Learning for Formation Control without Individual Reference Targets
Murad Dawood
Sicong Pan
Nils Dengler
Siqi Zhou
Angela P. Schoellig
Maren Bennewitz
OffRL
61
3
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
48
17
0
20 Dec 2023
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
61
6
0
19 Dec 2023
Previous
1
2
3
...
21
22
23
...
79
80
81
Next