Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Empirical Design in Reinforcement Learning
Andrew Patterson
Samuel Neumann
Martha White
Adam White
112
30
0
03 Apr 2023
Generative Adversarial Neuroevolution for Control Behaviour Imitation
Maximilien Le Clei
Pierre C. Bellec
47
0
0
03 Apr 2023
Neuroevolution of Recurrent Architectures on Control Tasks
Maximilien Le Clei
Pierre C. Bellec
28
4
0
03 Apr 2023
TacGNN:Learning Tactile-based In-hand Manipulation with a Blind Robot
Linhan Yang
Bidan Huang
Qingbiao Li
Ya-Yen Tsai
Wang Wei Lee
Chaoyang Song
Jia Pan
51
23
0
03 Apr 2023
Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning
Behnaz Hadi
A. Khosravi
Pouria Sarhadi
83
20
0
01 Apr 2023
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization
M. Chadi
H. Mousannif
OffRL
43
4
0
31 Mar 2023
Learning Human-to-Robot Handovers from Point Clouds
Sammy Christen
Wei Yang
Claudia Pérez-DÁrpino
Otmar Hilliges
Dieter Fox
Yu-Wei Chao
73
43
0
30 Mar 2023
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions
Yicheng Luo
Jackie Kay
Edward Grefenstette
M. Deisenroth
OffRL
OnRL
69
16
0
30 Mar 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
90
16
0
30 Mar 2023
Dependent Task Offloading in Edge Computing Using GNN and Deep Reinforcement Learning
Zequn Cao
Xiaoheng Deng
32
12
0
30 Mar 2023
Importance Sampling for Stochastic Gradient Descent in Deep Neural Networks
Thibault Lahire
31
2
0
29 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations
Li Haofeng
C. Yiwen
Tan Jiayi
Marcelo H. Ang Jr
OffRL
35
2
0
29 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
98
85
0
28 Mar 2023
The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers
Valentin Macé
Raphael Boige
Félix Chalumeau
Thomas Pierrot
Guillaume Richard
Nicolas Perrin-Gilbert
OffRL
123
13
0
27 Mar 2023
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning
Alex Beeson
Giovanni Montana
OffRL
70
13
0
26 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
53
14
0
24 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
77
4
0
24 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
74
1
0
23 Mar 2023
EDGI: Equivariant Diffusion for Planning with Embodied Agents
Johann Brehmer
Joey Bose
P. D. Haan
Taco S. Cohen
DiffM
105
36
0
22 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
66
6
0
21 Mar 2023
Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
Dapeng Li
Feiyang Pan
Jia He
Zhiwei Xu
Dandan Tu
Guoliang Fan
AI4TS
56
2
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
71
5
0
17 Mar 2023
Efficient Learning of High Level Plans from Play
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
50
3
0
16 Mar 2023
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics
Baihan Lin
Guillermo Cecchi
Djallel Bouneffouf
OffRL
AI4TS
AI4MH
107
11
0
16 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
38
3
0
16 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
69
24
0
14 Mar 2023
Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning
Bryan Lim
Manon Flageat
Antoine Cully
OnRL
81
7
0
10 Mar 2023
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
118
10
0
09 Mar 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
188
125
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
114
3
0
08 Mar 2023
A Strategy-Oriented Bayesian Soft Actor-Critic Model
Qin Yang
Ramviyas Parasuraman
73
8
0
07 Mar 2023
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
Daniel Palenicek
M. Lutter
João Carvalho
Jan Peters
79
4
0
07 Mar 2023
MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
92
19
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
142
56
0
07 Mar 2023
Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers
Gagan Khandate
C. Mehlman
Xingsheng Wei
M. Ciocarlie
57
3
0
06 Mar 2023
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
82
10
0
06 Mar 2023
Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A Reinforcement Learning Based Approach
Xiaorui Tang
Sicong Liu
Xiaojiang Du
Mohsen Guizani
57
0
0
05 Mar 2023
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks
Maryam Abdool
Tony Dear
34
1
0
05 Mar 2023
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
91
41
0
05 Mar 2023
CFlowNets: Continuous Control with Generative Flow Networks
Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
132
23
0
04 Mar 2023
Decision Transformer under Random Frame Dropping
Kaizhe Hu
Rachel Zheng
Yang Gao
Huazhe Xu
OffRL
172
13
0
03 Mar 2023
Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning
Jorge de Heuvel
Weixian Shi
Xiangyu Zeng
Maren Bennewitz
95
1
0
02 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
69
1
0
02 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
83
10
0
02 Mar 2023
A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Young-Jin Sung
53
7
0
01 Mar 2023
Human-Inspired Framework to Accelerate Reinforcement Learning
Ali Beikmohammadi
Sindri Magnússon
OffRL
86
4
0
28 Feb 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
63
0
0
28 Feb 2023
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
85
26
0
28 Feb 2023
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
58
1
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
75
2
0
26 Feb 2023
Previous
1
2
3
...
18
19
20
...
42
43
44
Next