Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 841 papers shown
Title
Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization
Chenyang Miao
Yunduan Cui
Huiyun Li
Xin Wu
24
5
0
26 Sep 2023
Adapting Double Q-Learning for Continuous Reinforcement Learning
Arsenii Kuznetsov
OffRL
OnRL
24
0
0
25 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
34
9
0
25 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Chao Yu
Yi Wu
Yu Wang
36
28
0
22 Sep 2023
PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring
T. Shaik
Xiaohui Tao
Lin Li
Haoran Xie
Usha R. Acharya
R. Gururajan
Xujuan Zhou
OffRL
AI4TS
26
0
0
19 Sep 2023
Efficient Reinforcement Learning for Jumping Monopods
Riccardo Bussola
Michele Focchi
Andrea Del Prete
Daniele Fontanelli
Luigi Palopoli
42
2
0
13 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
34
13
0
12 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
30
8
0
04 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
46
13
0
31 Aug 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
28
0
0
31 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
33
1
0
14 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
35
15
0
31 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
27
3
0
29 Jul 2023
Worrisome Properties of Neural Network Controllers and Their Symbolic Representations
J. Cyranka
Kevin E. M. Church
J. Lessard
42
0
0
28 Jul 2023
Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation
Mahyar Abbasian
Taha Rajabzadeh
Ahmadreza Moradipari
Seyed Amir Hossein Aqajari
Hong-ming Lu
Amir M. Rahmani
13
3
0
26 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
52
2
0
21 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
40
4
0
16 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
35
5
0
13 Jul 2023
Policy Contrastive Imitation Learning
Jialei Huang
Zhao-Heng Yin
Yingdong Hu
Yang Gao
34
3
0
06 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
30
3
0
22 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
35
1
0
21 Jun 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
23
19
0
20 Jun 2023
Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning
Tianlun Hu
Qi Liao
Li-Yu Daisy Liu
Georg Carle
27
3
0
20 Jun 2023
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication
Adam Callaghan
Karl Mason
Patrick Mannion
37
2
0
20 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
26
0
0
17 Jun 2023
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction
Cheng Ruei Tang
J. Hsieh
Shin-You Teng
21
0
0
16 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning
B. D. Evans
H. Engelbrecht
H. W. Jordaan
24
19
0
12 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
25
6
0
01 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Probing reaction channels via reinforcement learning
Senwei Liang
Aditya Singh
Yuanran Zhu
David T. Limmer
Chao Yang
28
6
0
27 May 2023
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets
Dinghuai Zhang
H. Dai
Nikolay Malkin
Aaron Courville
Yoshua Bengio
L. Pan
30
36
0
26 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
41
40
0
22 May 2023
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search
Naman Saxena
Subhojyoti Khastagir
Shishir Kolathaya
S. Bhatnagar
OffRL
10
8
0
20 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
35
37
0
16 May 2023
A Deep RL Approach on Task Placement and Scaling of Edge Resources for Cellular Vehicle-to-Network Service Provisioning
Cyril Shih-Huan Hsu
Jorge Martín-Pérez
D. D. Vleeschauwer
K. Kondepu
L. Valcarenghi
Xi Li
Chrysa Papagianni
13
0
0
16 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
34
14
0
16 May 2023
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
Deep Reinforcement Learning Based Resource Allocation for Cloud Native Wireless Network
L. Wang
Jiasheng Wu
Yueyuan Gao
Jingjing Zhang
12
3
0
10 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
35
2
0
09 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
45
3
0
08 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
35
1
0
04 May 2023
An Imitation Learning Based Algorithm Enabling Priori Knowledge Transfer in Modern Electricity Markets for Bayesian Nash Equilibrium Estimation
Ziqing Zhu
K. Chan
S. Bu
Ze Hu
S. Xia
18
2
0
04 May 2023
Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning
Muhammad Burhan Hafez
Tilman Immisch
Tom Weber
S. Wermter
CLL
23
4
0
03 May 2023
A Multi-Task Approach to Robust Deep Reinforcement Learning for Resource Allocation
Steffen Gracla
C. Bockelmann
Armin Dekorsy
28
3
0
25 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
27
2
0
21 Apr 2023
Previous
1
2
3
4
5
6
...
15
16
17
Next