ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09477
  4. Cited By
Addressing Function Approximation Error in Actor-Critic Methods
v1v2v3 (latest)

Addressing Function Approximation Error in Actor-Critic Methods

26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Addressing Function Approximation Error in Actor-Critic Methods"

50 / 2,180 papers shown
Title
Disentangling Policy from Offline Task Representation Learning via
  Adversarial Data Augmentation
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Chengxing Jia
Fuxiang Zhang
Yi-Chen Li
Chenxiao Gao
Xu-Hui Liu
Lei Yuan
Zongzhang Zhang
Yang Yu
AAML
83
4
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic
  Manipulations With Large Language Models
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
100
3
0
09 Mar 2024
Conservative DDPG -- Pessimistic RL without Ensemble
Conservative DDPG -- Pessimistic RL without Ensemble
Nitsan Soffair
Shie Mannor
OffRL
47
0
0
08 Mar 2024
A mechanism-driven reinforcement learning framework for shape
  optimization of airfoils
A mechanism-driven reinforcement learning framework for shape optimization of airfoils
Jingfeng Wang
Guanghui Hu
53
1
0
07 Mar 2024
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for
  Reservoir Operation Decision and Control
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control
Sadegh Sadeghi Tabas
Vidya Samadi
23
0
0
07 Mar 2024
Noisy Spiking Actor Network for Exploration
Noisy Spiking Actor Network for Exploration
Ding Chen
Peixi Peng
Tiejun Huang
Yonghong Tian
45
1
0
07 Mar 2024
Cross Domain Policy Transfer with Effect Cycle-Consistency
Cross Domain Policy Transfer with Effect Cycle-Consistency
Ruiqi Zhu
Tianhong Dai
Oya Celiktutan
83
3
0
04 Mar 2024
Towards Fair and Efficient Learning-based Congestion Control
Towards Fair and Efficient Learning-based Congestion Control
Xudong Liao
Han Tian
Chaoliang Zeng
Xinchen Wan
Kai Chen
48
7
0
04 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical
  Tasks with Recovery Policy
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
Chenyang Cao
Zichen Yan
Renhao Lu
Junbo Tan
Xueqian Wang
OffRL
82
5
0
04 Mar 2024
Feint Behaviors and Strategies: Formalization, Implementation and Evaluation
Feint Behaviors and Strategies: Formalization, Implementation and Evaluation
Junyu Liu
Wangkai Jin
OffRL
55
0
0
04 Mar 2024
Barrier Functions Inspired Reward Shaping for Reinforcement Learning
Barrier Functions Inspired Reward Shaping for Reinforcement Learning
Nilaksh Nilaksh
Abhishek Ranjan
Shreenabh Agrawal
Aayush Jain
Pushpak Jagtap
Shishir Kolathaya
OffRL
81
5
0
03 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent
  Reinforcement Learning
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
66
7
0
02 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
68
0
0
01 Mar 2024
SELFI: Autonomous Self-Improvement with Reinforcement Learning for
  Social Navigation
SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation
Noriaki Hirose
Dhruv Shah
Kyle Stachowicz
A. Sridhar
Sergey Levine
126
5
0
01 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter
  Lesson of Reinforcement Learning
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
95
23
0
01 Mar 2024
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative
  Behaviors and Adversarial Style Sampling for Assistive Tasks
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks
Takayuki Osa
Tatsuya Harada
126
2
0
01 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective
  Reinforcement Learning
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
76
0
0
01 Mar 2024
Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks
Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks
B. D. Evans
Raphael Trumpp
Marco Caccamo
Felix Jahncke
Johannes Betz
H. W. Jordaan
H. Engelbrecht
112
8
0
28 Feb 2024
Beacon, a lightweight deep reinforcement learning benchmark library for
  flow control
Beacon, a lightweight deep reinforcement learning benchmark library for flow control
J. Viquerat
P. Meliga
Pablo Jeken
E. Hachem
AI4CE
49
1
0
27 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent
  World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
124
34
0
26 Feb 2024
Concurrent Learning of Policy and Unknown Safety Constraints in
  Reinforcement Learning
Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
Lunet Yifru
Ali Baheri
OffRL
73
1
0
24 Feb 2024
Discretionary Lane-Change Decision and Control via Parameterized Soft
  Actor-Critic for Hybrid Action Space
Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space
Yuan Lin
Xiao Liu
Zishun Zheng
58
5
0
24 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSLOffRL
106
30
0
23 Feb 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based
  Recommender Systems
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
83
3
0
23 Feb 2024
Trajectory-wise Iterative Reinforcement Learning Framework for
  Auto-bidding
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding
Haoming Li
Yusen Huo
Shuai Dou
Zhenzhe Zheng
Zhilin Zhang
Chuan Yu
Jian Xu
Fan Wu
OffRL
60
5
0
23 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
111
12
0
22 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
88
3
0
21 Feb 2024
Improving a Proportional Integral Controller with Reinforcement Learning
  on a Throttle Valve Benchmark
Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark
Paul Daoudi
B. Mavkov
Bogdan Robu
Christophe Prieur
Emmanuel Witrant
M. Barlier
Ludovic Dos Santos
54
3
0
21 Feb 2024
Analyzing Operator States and the Impact of AI-Enhanced Decision Support
  in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning
  Framework for Intervention Strategies
Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies
Ammar N. Abbas
Chidera W. Amazu
Joseph Mietkiewicz
Houda Briwa
Andres Alonzo Perez
Gabriele Baldissone
M. Demichela
Georgios G. Chasparis
John D. Kelleher
M. Leva
130
2
0
20 Feb 2024
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
77
3
0
20 Feb 2024
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning
  for End-to-end Navigation of Autonomous Vehicles
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles
Dong Hu
Chao Huang
Jingda Wu
Hongbo Gao
85
6
0
20 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
96
4
0
19 Feb 2024
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Davide Mambelli
Stephan Bongers
O. Zoeter
M. Spaan
F. Oliehoek
OffRL
30
0
0
19 Feb 2024
A novel framework for adaptive stress testing of autonomous vehicles in
  highways
A novel framework for adaptive stress testing of autonomous vehicles in highways
Linh Trinh
Q. Luu
Thai M. Nguyen
Hai L. Vu
122
0
0
19 Feb 2024
Revisiting Experience Replayable Conditions
Revisiting Experience Replayable Conditions
Taisuke Kobayashi
102
3
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
89
2
0
14 Feb 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
69
1
0
13 Feb 2024
Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL
  with Continuous Action Domains
Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains
Yasin Findik
S. Ahmadzadeh
OffRL
144
4
0
12 Feb 2024
Understanding Model Selection For Learning In Strategic Environments
Understanding Model Selection For Learning In Strategic Environments
Tinashe Handina
Eric Mazumdar
41
0
0
12 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
63
7
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
133
20
0
05 Feb 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement
  Learning
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
89
10
0
05 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement
  Learning Using Unique Experiences
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Sample Complexity of Algorithm Selection Using Neural Networks and Its
  Applications to Branch-and-Cut
Sample Complexity of Algorithm Selection Using Neural Networks and Its Applications to Branch-and-Cut
Hongyu Cheng
Sammy Khalife
Barbara Fiedorowicz
Amitabh Basu
70
2
0
04 Feb 2024
SQT -- std $Q$-target
SQT -- std QQQ-target
Nitsan Soffair
Dotan Di Castro
Orly Avner
Shie Mannor
OffRL
56
0
0
03 Feb 2024
Evolution Guided Generative Flow Networks
Evolution Guided Generative Flow Networks
Zarif Ikram
Ling Pan
Dianbo Liu
156
1
0
03 Feb 2024
Learning the Market: Sentiment-Based Ensemble Trading Agents
Learning the Market: Sentiment-Based Ensemble Trading Agents
Andrew Ye
James Xu
Yi Wang
Yifan Yu
Daniel Yan
Ryan Chen
Bosheng Dong
Vipin Chaudhary
Shuai Xu
AIFin
15
1
0
02 Feb 2024
To the Max: Reinventing Reward in Reinforcement Learning
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko
Wendelin Bohmer
Mathijs de Weerdt
68
6
0
02 Feb 2024
Previous
123...101112...424344
Next