ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
184
1
0
16 Dec 2024
C3: Learning Congestion Controllers with Formal Certificates
C3: Learning Congestion Controllers with Formal Certificates
Chenxi Yang
Divyanshu Saxena
Rohit Dwivedula
Kshiteej S. Mahajan
Swarat Chaudhuri
Aditya Akella
111
1
0
14 Dec 2024
ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street
  Scenes
ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street Scenes
Yuxi Wei
Jingbo Wang
Yuwen Du
Dingju Wang
Liang Pan
Chenxin Xu
Yao Feng
Bo Dai
Siheng Chen
AI4CE
150
1
0
11 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class
  and Backbone
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRLOnRL
184
10
0
09 Dec 2024
Conformal Symplectic Optimization for Stable Reinforcement Learning
Conformal Symplectic Optimization for Stable Reinforcement Learning
Yao Lyu
Xiangteng Zhang
Shengbo Eben Li
Jingliang Duan
Letian Tao
Qing Xu
Lei He
Keqiang Li
174
0
0
03 Dec 2024
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
100
0
0
28 Nov 2024
Dynamic Non-Prehensile Object Transport via Model-Predictive
  Reinforcement Learning
Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning
Neel Jawale
Byron Boots
Balakumar Sundaralingam
M. Bhardwaj
123
0
0
27 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
179
0
0
24 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
181
1
0
22 Nov 2024
Provably Efficient Action-Manipulation Attack Against Continuous
  Reinforcement Learning
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
Zhi Luo
Xiaoyu Yang
Pan Zhou
D. Wang
AAML
110
0
0
20 Nov 2024
AMaze: An intuitive benchmark generator for fast prototyping of
  generalizable agents
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
Kevin Godin-Dubois
Karine Miras
Anna V. Kononova
92
0
0
20 Nov 2024
Preference-Conditioned Gradient Variations for Multi-Objective
  Quality-Diversity
Preference-Conditioned Gradient Variations for Multi-Objective Quality-Diversity
Hannah Janmohamed
Maxence Faldor
Thomas Pierrot
Antoine Cully
118
1
0
19 Nov 2024
Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning
Ting Zhu
Yue Jin
Jeremie Houssineau
Giovanni Montana
74
0
0
17 Nov 2024
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
134
2
0
17 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
141
2
0
11 Nov 2024
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade
Xiaoning Jin
OffRL
194
0
0
10 Nov 2024
Deep Reinforcement Learning for Digital Twin-Oriented Complex Networked
  Systems
Deep Reinforcement Learning for Digital Twin-Oriented Complex Networked Systems
Jiaqi Wen
Bogdan Gabrys
Katarzyna Musial
AI4CE
66
0
0
09 Nov 2024
Structure Matters: Dynamic Policy Gradient
Structure Matters: Dynamic Policy Gradient
Sara Klein
Xiangyuan Zhang
Tamer Basar
Simon Weissmann
Leif Döring
59
0
0
07 Nov 2024
Harnessing the Power of Gradient-Based Simulations for Multi-Objective
  Optimization in Particle Accelerators
Harnessing the Power of Gradient-Based Simulations for Multi-Objective Optimization in Particle Accelerators
Kishansingh Rajput
Malachi Schram
Auralee Edelen
Jonathan Colen
Armen Kasparian
Ryan Roussel
Adam Carpenter
He Zhang
Jay Benesch
60
1
0
07 Nov 2024
CPIG: Leveraging Consistency Policy with Intention Guidance for
  Multi-agent Exploration
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
105
0
0
06 Nov 2024
IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and
  Context-Awared Resource Allocation
IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and Context-Awared Resource Allocation
Lingyi Wang
Wei Wu
Fuhui Zhou
Zhijin Qin
Qihui Wu
91
3
0
04 Nov 2024
Provably and Practically Efficient Adversarial Imitation Learning with
  General Function Approximation
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
88
1
0
01 Nov 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning
  Decision Transformers
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRLOnRL
68
0
0
31 Oct 2024
Maximum Entropy Hindsight Experience Replay
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
61
0
0
31 Oct 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human
  Preference
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
76
4
0
29 Oct 2024
Human-Readable Programs as Actors of Reinforcement Learning Agents Using
  Critic-Moderated Evolution
Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution
Senne Deproost
Denis Steckelmacher
Ann Nowé
69
0
0
29 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
168
0
0
27 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
140
7
0
25 Oct 2024
Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning
  for Inventory Control in Supply Chains
Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning for Inventory Control in Supply Chains
Niki Kotecha
Antonio del Rio Chanona
62
2
0
24 Oct 2024
Learning Versatile Skills with Curriculum Masking
Learning Versatile Skills with Curriculum Masking
Yao Tang
Zhihui Xie
Zichuan Lin
Deheng Ye
Shuai Li
OffRL
73
0
0
23 Oct 2024
Meta Stackelberg Game: Robust Federated Learning against Adaptive and
  Mixed Poisoning Attacks
Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Tao Li
Henger Li
Yunian Pan
Tianyi Xu
Zizhan Zheng
Quanyan Zhu
FedML
76
5
0
22 Oct 2024
Episodic Future Thinking Mechanism for Multi-agent Reinforcement
  Learning
Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning
Dongsu Lee
Minhae Kwon
100
1
0
22 Oct 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
Guiding Reinforcement Learning with Incomplete System Dynamics
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
107
1
0
22 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Wei Wei
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
36
0
0
22 Oct 2024
Long-distance Geomagnetic Navigation in GNSS-denied Environments with
  Deep Reinforcement Learning
Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning
Wenqi Bai
Xiaohui Zhang
Shiliang Zhang
Songnan Yang
Yushuai Li
Tingwen Huang
AI4CE
49
2
0
21 Oct 2024
Reinforced Imitative Trajectory Planning for Urban Automated Driving
Reinforced Imitative Trajectory Planning for Urban Automated Driving
Di Zeng
Ling Zheng
Xiantong Yang
Yinong Li
55
0
0
21 Oct 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh
  Smoothing
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
63
0
0
19 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRLOnRL
138
0
0
19 Oct 2024
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing
  Stock Selection and Execution
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution
Zijie Zhao
Roy E. Welsch
AIFin
112
1
0
19 Oct 2024
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor
  Environments
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments
Mariusz Wisniewski
Paraskevas Chatzithanos
Weisi Guo
Antonios Tsourdos
64
3
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
72
0
0
17 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
169
19
0
17 Oct 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based
  Continuous Control
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
Jinzhu Luo
Dingyang Chen
Qi Zhang
OffRL
78
0
0
16 Oct 2024
The State of Robot Motion Generation
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
89
3
0
16 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
44
0
0
15 Oct 2024
Visual Manipulation with Legs
Visual Manipulation with Legs
Xialin He
Chengjing Yuan
Wenxuan Zhou
Ruihan Yang
David Held
Xiaolong Wang
116
3
0
15 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
83
1
0
15 Oct 2024
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC
  Task
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
Yunho Kim
Jaehyun Park
Heejun Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
81
1
0
15 Oct 2024
Communication-Control Codesign for Large-Scale Wireless Networked
  Control Systems
Communication-Control Codesign for Large-Scale Wireless Networked Control Systems
Gaoyang Pang
Wanchun Liu
Dusit Niyato
Branka Vucetic
Yonghui Li
AI4CE
84
0
0
15 Oct 2024
Large Language Model Evaluation via Matrix Nuclear-Norm
Large Language Model Evaluation via Matrix Nuclear-Norm
Yongbin Li
Tingyu Xia
Yi-Ju Chang
Yuan Wu
61
1
0
14 Oct 2024
Previous
123456...424344
Next