Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,130 papers shown
Title
EVAN: Evolutional Video Streaming Adaptation via Neural Representation
Mufan Liu
Le Yang
Yiling Xu
Ye-Kui Wang
Lei Li
35
4
0
15 Apr 2024
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong
Kui Wu
Hai Ci
Churan Wang
Hao Chen
OffRL
103
5
0
15 Apr 2024
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
Tidiane Camaret Ndir
André Biedenkapp
Noor H. Awad
OffRL
85
1
0
15 Apr 2024
Hierarchical Decision Making Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
77
0
0
15 Apr 2024
Adversarial Imitation Learning via Boosting
Jonathan D. Chang
Dhruv Sreenivas
Yingbing Huang
Kianté Brantley
Wen Sun
58
3
0
12 Apr 2024
Monte Carlo Tree Search with Boltzmann Exploration
Michael Painter
Mohamed Baioumy
Nick Hawes
Bruno Lacerda
33
5
0
11 Apr 2024
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Yunxiang Li
Rui Yuan
Chen Fan
Mark Schmidt
Samuel Horváth
Robert Mansel Gower
Martin Takávc
72
0
0
11 Apr 2024
Generative Probabilistic Planning for Optimizing Supply Chain Networks
Hyung-il Ahn
Santiago Olivar
Hershel Mehta
Young Chol Song
69
0
0
11 Apr 2024
Collaborative Ground-Space Communications via Evolutionary Multi-objective Deep Reinforcement Learning
Jiahui Li
Geng Sun
Qingqing Wu
Dusit Niyato
Jiawen Kang
Abbas Jamalipour
Victor C. M. Leung
89
26
0
11 Apr 2024
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Tongzhou Mu
Yijie Guo
Jie Xu
Ankit Goyal
Hao Su
Dieter Fox
Animesh Garg
LM&Ro
111
0
0
11 Apr 2024
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
L. Nasvytis
Kai Sandbrink
Jakob N. Foerster
Tim Franzmeyer
Christian Schroeder de Witt
OffRL
OODD
53
9
0
10 Apr 2024
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control
Zeno Woywood
Jasper I. Wiltfang
Julius Luy
Tobias Enders
Maximilian Schiffer
74
3
0
10 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
124
8
0
09 Apr 2024
Efficient Multi-Task Reinforcement Learning via Task-Specific Action Correction
Jinyuan Feng
Min Chen
Zhiqiang Pu
Tenghai Qiu
Jianqiang Yi
80
2
0
09 Apr 2024
Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning
Bo Lin
Yangzheng Zhong
Weiqing Ren
50
0
0
08 Apr 2024
Human-Machine Interaction in Automated Vehicles: Reducing Voluntary Driver Intervention
Xinzhi Zhong
Yang Zhou
Varshini Kamaraj
Zhenhao Zhou
Wissam Kontar
Dan Negrut
John D. Lee
Soyoung Ahn
26
1
0
08 Apr 2024
Percentile Criterion Optimization in Offline Reinforcement Learning
Elita Lobo
Cyrus Cousins
Yair Zick
Marek Petrik
OffRL
102
1
0
07 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Tongzheng Ren
Bo Dai
Na Li
84
1
0
07 Apr 2024
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
Xudong Yu
Chenjia Bai
Haoran He
Changhong Wang
Xuelong Li
122
6
0
07 Apr 2024
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
Yeda Song
Dongwook Lee
Gunhee Kim
OffRL
64
1
0
06 Apr 2024
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Corby Rosset
Ching-An Cheng
Arindam Mitra
Michael Santacroce
Ahmed Hassan Awadallah
Tengyang Xie
209
132
0
04 Apr 2024
AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale
Adam Pardyl
Michal Wronka
Maciej Wolczyk
Kamil Adamczewski
Tomasz Trzciñski
Bartosz Zieliñski
85
2
0
04 Apr 2024
REACT: Revealing Evolutionary Action Consequence Trajectories for Interpretable Reinforcement Learning
Philipp Altmann
Céline Davignon
Maximilian Zorn
Fabian Ritz
Claudia Linnhoff-Popien
Thomas Gabor
45
1
0
04 Apr 2024
Benchmarking Population-Based Reinforcement Learning across Robotic Tasks with GPU-Accelerated Simulation
Asad Ali Shahid
Yashraj S. Narang
Vincenzo Petrone
Enrico Ferrentino
Ankur Handa
Dieter Fox
Marco Pavone
L. Roveda
108
3
0
04 Apr 2024
Distributionally Robust Policy and Lyapunov-Certificate Learning
Kehan Long
Jorge Cortés
Nikolay Atanasov
102
3
0
03 Apr 2024
Unsupervised Learning of Effective Actions in Robotics
Marko Zaric
Jakob J. Hollenstein
J. Piater
Erwan Renaudo
29
0
0
03 Apr 2024
SliceIt! -- A Dual Simulator Framework for Learning Robot Food Slicing
C. C. Beltran-Hernandez
Nicolas Erbetti
Masashi Hamaya
78
5
0
03 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
85
0
0
03 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
87
1
0
02 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
95
0
0
02 Apr 2024
Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
Eric M. S. P. Veith
Torben Logemann
Aleksandr Berezin
Arlena Wellßow
Stephan Balduin
84
2
0
02 Apr 2024
Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee
Ukcheol Shin
Byeong-uk Lee
75
4
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
128
0
0
02 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
92
0
0
31 Mar 2024
Thin-Shell Object Manipulations With Differentiable Physics Simulations
Yian Wang
Juntian Zheng
Zhehuan Chen
Zhou Xian
Gu Zhang
Chao Liu
Chuang Gan
92
5
0
30 Mar 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
137
71
0
30 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
331
2
0
29 Mar 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
94
7
0
28 Mar 2024
Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning with Asymmetric Rewards
Yasin Sonmez
Neelay Junnarkar
Murat Arcak
77
1
0
27 Mar 2024
CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning
Elliot Chane-Sane
Pierre-Alexandre Leziart
T. Flayols
O. Stasse
Philippe Souères
Nicolas Mansard
131
10
0
27 Mar 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
94
2
0
26 Mar 2024
VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts
Marius Captari
Remo Sasso
M. Sabatelli
33
0
0
26 Mar 2024
Exploring CausalWorld: Enhancing robotic manipulation via knowledge transfer and curriculum learning
Xinrui Wang
Yan Jin
102
2
0
25 Mar 2024
Bridging the Sim-to-Real Gap with Bayesian Inference
Jonas Rothfuss
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
AI4CE
98
4
0
25 Mar 2024
Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm
Titouan Renard
Andreas Schlaginhaufen
Tingting Ni
Maryam Kamgarpour
89
1
0
25 Mar 2024
DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation
Mu-Yi Shen
Chia-Chi Hsu
Hao-Yu Hou
Yu-Chen Huang
Wei-Fang Sun
Chia-Che Chang
Yu-Lun Liu
Chun-Yi Lee
116
3
0
23 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
108
6
0
22 Mar 2024
A Twin Delayed Deep Deterministic Policy Gradient Algorithm for Autonomous Ground Vehicle Navigation via Digital Twin Perception Awareness
K. Olayemi
Mien Van
Seán F. McLoone
Yuzhu Sun
Jack Close
Minh-Nhat Nguyen
Stephen McIlvanna
88
3
0
22 Mar 2024
Boundary-Aware Value Function Generation for Safe Stochastic Motion Planning
Junhong Xu
Kai-Li Yin
Jason M. Gregory
Kris Hauser
Lantao Liu
92
2
0
22 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
93
0
0
21 Mar 2024
Previous
1
2
3
...
19
20
21
...
81
82
83
Next