ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Model-Based Reparameterization Policy Gradient Methods: Theory and
  Practical Algorithms
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
63
2
0
30 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
103
4
0
30 Oct 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with
  Linear Function Approximation
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu Wang
Yian Ma
88
6
0
29 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
81
9
0
28 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
100
4
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit
  Model-Free Reinforcement Learning Updates
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas Corrado
Josiah P. Hanna
83
5
0
26 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
80
8
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based
  Representations in Offline Reinforcement Learning
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
104
5
0
26 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
50
1
0
26 Oct 2023
Model-enhanced Contrastive Reinforcement Learning for Sequential
  Recommendation
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation
Chengpeng Li
Zhengyi Yang
Jizhi Zhang
Jiancan Wu
Dingxian Wang
Xiangnan He
Xiang Wang
OffRL
76
1
0
25 Oct 2023
State Sequences Prediction via Fourier Transform for Representation
  Learning
State Sequences Prediction via Fourier Transform for Representation Learning
Mingxuan Ye
Yufei Kuang
Jie Wang
Rui Yang
Wen-gang Zhou
Houqiang Li
Feng Wu
AI4TS
88
8
0
24 Oct 2023
Safe and Interpretable Estimation of Optimal Treatment Regimes
Safe and Interpretable Estimation of Optimal Treatment Regimes
Harsh Parikh
Quinn Lanners
Zade Akras
Sahar F. Zafar
M. P. M. Brandon Westover
Cynthia Rudin
A. Volfovsky
OffRL
31
1
0
23 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
63
3
0
23 Oct 2023
Robust Visual Imitation Learning with Inverse Dynamics Representations
Robust Visual Imitation Learning with Inverse Dynamics Representations
Siyuan Li
Xun Wang
Rongchang Zuo
Kewu Sun
Lingfei Cui
Jishiyu Ding
Peng Liu
Zhe Ma
68
4
0
22 Oct 2023
Stabilizing reinforcement learning control: A modular framework for
  optimizing over all stable behavior
Stabilizing reinforcement learning control: A modular framework for optimizing over all stable behavior
Nathan P. Lawrence
Philip D. Loewen
Shuyuan Wang
M. Forbes
R. Bhushan Gopaluni
39
2
0
21 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
73
0
0
21 Oct 2023
Absolute Policy Optimization
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
128
4
0
20 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRLOnRL
86
18
0
19 Oct 2023
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
Linrui Zhang
Zhenghao Peng
Quanyi Li
Bolei Zhou
AAML
95
28
0
19 Oct 2023
Learning to Generate Parameters of ConvNets for Unseen Image Data
Learning to Generate Parameters of ConvNets for Unseen Image Data
Shiye Wang
Kaituo Feng
Changsheng Li
Ye Yuan
Guoren Wang
91
1
0
18 Oct 2023
Using Experience Classification for Training Non-Markovian Tasks
Using Experience Classification for Training Non-Markovian Tasks
Ruixuan Miao
Xu Lu
Cong Tian
Bin Yu
Zhenhua Duan
OffRL
54
0
0
18 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in
  Continuous Control
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
64
1
0
17 Oct 2023
Sim-to-Real Transfer of Adaptive Control Parameters for AUV
  Stabilization under Current Disturbance
Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance
Thomas Chaffre
J. Wheare
A. Lammas
Paulo E. Santos
G. Chenadec
Karl Sammut
Benoit Clement
60
2
0
17 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
57
1
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRLLM&Ro
121
33
0
15 Oct 2023
Enhancing Task Performance of Learned Simplified Models via
  Reinforcement Learning
Enhancing Task Performance of Learned Simplified Models via Reinforcement Learning
Hien Bui
Michael Posa
76
1
0
15 Oct 2023
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch
  Constrained Off-Policy Approach
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach
Heasung Kim
S. Ankireddy
OffRL
45
0
0
12 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRLOnRL
107
2
0
12 Oct 2023
Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing
  Experiments in Model Identification of Battery Dynamics
Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing Experiments in Model Identification of Battery Dynamics
Gokhan Budan
Francesca Damiani
Can Kurtulus
N. K. Üre
13
0
0
12 Oct 2023
Generative Intrinsic Optimization: Intrinsic Control with Model Learning
Generative Intrinsic Optimization: Intrinsic Control with Model Learning
Jianfei Ma
58
0
0
12 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions
  with a Corpus of Examples
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
113
8
0
11 Oct 2023
Imitation Learning from Observation with Automatic Discount Scheduling
Imitation Learning from Observation with Automatic Discount Scheduling
Yuyang Liu
Weijun Dong
Yingdong Hu
Chuan Wen
Zhao-Heng Yin
Chongjie Zhang
Yang Gao
75
8
0
11 Oct 2023
Boosting Continuous Control with Consistency Policy
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
91
27
0
10 Oct 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
34
0
0
10 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
107
7
0
10 Oct 2023
Factual and Personalized Recommendations using Language Models and
  Reinforcement Learning
Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Jihwan Jeong
Yinlam Chow
Guy Tennenholtz
Chih-Wei Hsu
Azamat Tulepbergenov
Mohammad Ghavamzadeh
Craig Boutilier
88
4
0
09 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
87
23
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRLOnRL
76
6
0
09 Oct 2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable
  Environments
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Xiong-Hui Chen
Junyin Ye
Hang Zhao
Yi-Chen Li
Haoran Shi
...
Si-Hang Yang
Anqi Huang
Kai Xu
Zongzhang Zhang
Yang Yu
73
0
0
09 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline
  Reinforcement Learning
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He
Li Shen
Linrui Zhang
Junbo Tan
Xueqian Wang
OffRL
84
12
0
09 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
Distributional Soft Actor-Critic with Three Refinements
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODDOffRL
80
3
0
09 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
77
22
0
06 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
120
17
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in
  Non-Visual Environments: A Comparison
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
59
1
0
06 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
47
4
0
06 Oct 2023
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
53
3
0
06 Oct 2023
Controllable Multi-document Summarization: Coverage & Coherence
  Intuitive Policy with Large Language Model Based Rewards
Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards
Litton J. Kurisinkel
Nancy F. Chen
78
1
0
05 Oct 2023
Safe Exploration in Reinforcement Learning: A Generalized Formulation
  and Algorithms
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi
Wataru Hashimoto
Xun Shen
Kazumune Hashimoto
79
11
0
05 Oct 2023
$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program
  Synthesis
B\mathcal{B}B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Zishun Yu
Yunzhe Tao
Liyu Chen
Tao Sun
Hongxia Yang
83
13
0
04 Oct 2023
Imitation Learning from Observation through Optimal Transport
Imitation Learning from Observation through Optimal Transport
Wei-Di Chang
Scott Fujimoto
David Meger
Gregory Dudek
61
4
0
02 Oct 2023
Previous
123...131415...424344
Next