ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Multimodal foundation world models for generalist embodied agents
Multimodal foundation world models for generalist embodied agents
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Rameswar Panda
Sai Rajeswar
OffRLLM&Ro
84
1
0
26 Jun 2024
When does Self-Prediction help? Understanding Auxiliary Tasks in
  Reinforcement Learning
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
C. Voelcker
Tyler Kastner
Igor Gilitschenski
Amir-massoud Farahmand
SSL
90
6
0
25 Jun 2024
BricksRL: A Platform for Democratizing Robotics and Reinforcement
  Learning Research and Education with LEGO
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
Sebastian Dittert
Vincent Moens
Gianni De Fabritiis
82
1
0
25 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in
  Cyber Physical Systems
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
80
0
0
24 Jun 2024
Probabilistic Subgoal Representations for Hierarchical Reinforcement
  learning
Probabilistic Subgoal Representations for Hierarchical Reinforcement learning
V. Wang
Tinghuai Wang
Wenyan Yang
Joni-Kristian Kämäräinen
Joni Pajarinen
BDL
57
4
0
24 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
106
5
0
23 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
86
1
0
22 Jun 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement
  Learning
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
69
2
0
21 Jun 2024
Defending Against Sophisticated Poisoning Attacks with RL-based
  Aggregation in Federated Learning
Defending Against Sophisticated Poisoning Attacks with RL-based Aggregation in Federated Learning
Yujing Wang
Hainan Zhang
Sijia Wen
Wangjie Qiu
Binghui Guo
AAML
85
0
0
20 Jun 2024
Trapezoidal Gradient Descent for Effective Reinforcement Learning in
  Spiking Networks
Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
Yuhao Pan
Xiucheng Wang
Nan Cheng
Qi Qiu
86
0
0
19 Jun 2024
Reinforcement Learning to improve delta robot throws for sorting scrap
  metal
Reinforcement Learning to improve delta robot throws for sorting scrap metal
Arthur Louette
Gaspard Lambrechts
Damien Ernst
Eric Pirard
G. Dislaire
28
1
0
19 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
93
1
0
19 Jun 2024
NaviSplit: Dynamic Multi-Branch Split DNNs for Efficient Distributed
  Autonomous Navigation
NaviSplit: Dynamic Multi-Branch Split DNNs for Efficient Distributed Autonomous Navigation
Timothy K Johnsen
Ian Harshbarger
Zixia Xia
Marco Levorato
69
1
0
18 Jun 2024
Discovering Minimal Reinforcement Learning Environments
Discovering Minimal Reinforcement Learning Environments
Jarek Liesen
Chris Xiaoxuan Lu
Andrei Lupu
Jakob N. Foerster
Henning Sprekeler
R. T. Lange
OffRL
92
4
0
18 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
112
1
0
17 Jun 2024
Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning
Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning
Kangwei Qi
Qiong Wu
Pingyi Fan
Nan Cheng
Qiang Fan
Jiangzhou Wang
102
12
0
17 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
  in Decision Making
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
97
17
0
13 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep
  Reinforcement Learning Algorithms
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
29
1
0
13 Jun 2024
Deep Reinforcement Learning-based Quadcopter Controller: A Practical
  Approach and Experiments
Deep Reinforcement Learning-based Quadcopter Controller: A Practical Approach and Experiments
Truong-Dong Do
Nguyen Xuan Mung
Sung Kyung Hong
57
0
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
96
4
0
13 Jun 2024
Time-Constrained Robust MDPs
Time-Constrained Robust MDPs
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OOD
66
1
0
12 Jun 2024
Explore-Go: Leveraging Exploration for Generalisation in Deep
  Reinforcement Learning
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
Felix Kaubek
M. Spaan
Wendelin Bohmer
100
0
0
12 Jun 2024
Discovering Multiple Solutions from a Single Task in Offline
  Reinforcement Learning
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning
Takayuki Osa
Tatsuya Harada
OffRL
69
2
0
10 Jun 2024
Strategically Conservative Q-Learning
Strategically Conservative Q-Learning
Yutaka Shimizu
Joey Hong
Sergey Levine
Masayoshi Tomizuka
OffRLOnRL
76
0
0
06 Jun 2024
Redundancy-aware Action Spaces for Robot Learning
Redundancy-aware Action Spaces for Robot Learning
Pietro Mazzaglia
Nicholas Backshall
Xiao Ma
Stephen James
78
2
0
06 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
79
0
0
06 Jun 2024
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Bahareh Tasdighi
Nicklas Werge
Yi-Shan Wu
M. Kandemir
30
0
0
06 Jun 2024
Quality-Diversity with Limited Resources
Quality-Diversity with Limited Resources
Ren-Jian Wang
Ke Xue
Cong Guan
Chao Qian
84
3
0
06 Jun 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through
  Continuous Action Masking
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
46
4
0
06 Jun 2024
Reflective Policy Optimization
Reflective Policy Optimization
Yaozhong Gan
Renye Yan
Zhe Wu
Junliang Xing
84
1
0
06 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
115
0
0
05 Jun 2024
Representation Learning For Efficient Deep Multi-Agent Reinforcement
  Learning
Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning
Dom Huh
Prasant Mohapatra
91
1
0
05 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Dieter Büchler
Joni Pajarinen
SSL
147
5
0
04 Jun 2024
Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space
  Alignment
Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space Alignment
Tianyu Wang
Dwait Bhatt
Xiaolong Wang
Nikolay Atanasov
124
5
0
04 Jun 2024
Improving Generalization in Aerial and Terrestrial Mobile Robots Control
  Through Delayed Policy Learning
Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning
Ricardo B. Grando
Raul Steinmetz
V. A. Kich
A. H. Kolling
Pablo M. Furik
J. C. Jesus
Bruna V. Guterres
D. T. Gamarra
R. S. Guerra
Paulo L. J. Drews-Jr
92
5
0
04 Jun 2024
Learning the Target Network in Function Space
Learning the Target Network in Function Space
Kavosh Asadi
Yao Liu
Shoham Sabach
Ming Yin
Rasool Fakoor
117
0
0
03 Jun 2024
MOSEAC: Streamlined Variable Time Step Reinforcement Learning
MOSEAC: Streamlined Variable Time Step Reinforcement Learning
Dong Wang
Giovanni Beltrame
75
1
0
03 Jun 2024
Learning-based legged locomotion; state of the art and future
  perspectives
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
144
20
0
03 Jun 2024
Deep reinforcement learning for weakly coupled MDP's with continuous
  actions
Deep reinforcement learning for weakly coupled MDP's with continuous actions
Francisco Robledo
U. Ayesta
Konstantin Avrachenkov
51
0
0
03 Jun 2024
REvolve: Reward Evolution with Large Language Models using Human Feedback
REvolve: Reward Evolution with Large Language Models using Human Feedback
Rishi Hazra
Alkis Sygkounas
Andreas Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
101
3
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
89
0
0
03 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
130
18
0
02 Jun 2024
Target Networks and Over-parameterization Stabilize Off-policy
  Bootstrapping with Function Approximation
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Fengdi Che
Chenjun Xiao
Jincheng Mei
Bo Dai
Ramki Gummadi
Oscar A Ramirez
Christopher K Harris
A. R. Mahmood
Dale Schuurmans
78
5
0
31 May 2024
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Haotian Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
OffRLOnRL
73
3
0
31 May 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
168
7
0
31 May 2024
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement
  Learning
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Davide Corsi
Davide Camponogara
Alessandro Farinelli
OffRL
77
2
0
30 May 2024
Adaptive Advantage-Guided Policy Regularization for Offline
  Reinforcement Learning
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
100
8
0
30 May 2024
Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement
  Learning
Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning
Hanye Zhao
Xiaoshen Han
Zhengbang Zhu
Minghuan Liu
Yong Yu
Weinan Zhang
OffRL
99
1
0
29 May 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
111
3
0
29 May 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
64
5
0
29 May 2024
Previous
123...789...424344
Next