Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Chengxing Jia
Fuxiang Zhang
Yi-Chen Li
Chenxiao Gao
Xu-Hui Liu
Lei Yuan
Zongzhang Zhang
Yang Yu
AAML
83
4
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
100
3
0
09 Mar 2024
Conservative DDPG -- Pessimistic RL without Ensemble
Nitsan Soffair
Shie Mannor
OffRL
47
0
0
08 Mar 2024
A mechanism-driven reinforcement learning framework for shape optimization of airfoils
Jingfeng Wang
Guanghui Hu
53
1
0
07 Mar 2024
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control
Sadegh Sadeghi Tabas
Vidya Samadi
23
0
0
07 Mar 2024
Noisy Spiking Actor Network for Exploration
Ding Chen
Peixi Peng
Tiejun Huang
Yonghong Tian
45
1
0
07 Mar 2024
Cross Domain Policy Transfer with Effect Cycle-Consistency
Ruiqi Zhu
Tianhong Dai
Oya Celiktutan
83
3
0
04 Mar 2024
Towards Fair and Efficient Learning-based Congestion Control
Xudong Liao
Han Tian
Chaoliang Zeng
Xinchen Wan
Kai Chen
48
7
0
04 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
Chenyang Cao
Zichen Yan
Renhao Lu
Junbo Tan
Xueqian Wang
OffRL
82
5
0
04 Mar 2024
Feint Behaviors and Strategies: Formalization, Implementation and Evaluation
Junyu Liu
Wangkai Jin
OffRL
55
0
0
04 Mar 2024
Barrier Functions Inspired Reward Shaping for Reinforcement Learning
Nilaksh Nilaksh
Abhishek Ranjan
Shreenabh Agrawal
Aayush Jain
Pushpak Jagtap
Shishir Kolathaya
OffRL
81
5
0
03 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
66
7
0
02 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
68
0
0
01 Mar 2024
SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation
Noriaki Hirose
Dhruv Shah
Kyle Stachowicz
A. Sridhar
Sergey Levine
126
5
0
01 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
95
23
0
01 Mar 2024
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks
Takayuki Osa
Tatsuya Harada
126
2
0
01 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
76
0
0
01 Mar 2024
Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks
B. D. Evans
Raphael Trumpp
Marco Caccamo
Felix Jahncke
Johannes Betz
H. W. Jordaan
H. Engelbrecht
112
8
0
28 Feb 2024
Beacon, a lightweight deep reinforcement learning benchmark library for flow control
J. Viquerat
P. Meliga
Pablo Jeken
E. Hachem
AI4CE
49
1
0
27 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
124
34
0
26 Feb 2024
Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
Lunet Yifru
Ali Baheri
OffRL
73
1
0
24 Feb 2024
Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space
Yuan Lin
Xiao Liu
Zishun Zheng
58
5
0
24 Feb 2024
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
106
30
0
23 Feb 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
83
3
0
23 Feb 2024
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding
Haoming Li
Yusen Huo
Shuai Dou
Zhenzhe Zheng
Zhilin Zhang
Chuan Yu
Jian Xu
Fan Wu
OffRL
60
5
0
23 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
111
12
0
22 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
88
3
0
21 Feb 2024
Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark
Paul Daoudi
B. Mavkov
Bogdan Robu
Christophe Prieur
Emmanuel Witrant
M. Barlier
Ludovic Dos Santos
54
3
0
21 Feb 2024
Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies
Ammar N. Abbas
Chidera W. Amazu
Joseph Mietkiewicz
Houda Briwa
Andres Alonzo Perez
Gabriele Baldissone
M. Demichela
Georgios G. Chasparis
John D. Kelleher
M. Leva
130
2
0
20 Feb 2024
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
77
3
0
20 Feb 2024
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles
Dong Hu
Chao Huang
Jingda Wu
Hongbo Gao
85
6
0
20 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
96
4
0
19 Feb 2024
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Davide Mambelli
Stephan Bongers
O. Zoeter
M. Spaan
F. Oliehoek
OffRL
30
0
0
19 Feb 2024
A novel framework for adaptive stress testing of autonomous vehicles in highways
Linh Trinh
Q. Luu
Thai M. Nguyen
Hai L. Vu
122
0
0
19 Feb 2024
Revisiting Experience Replayable Conditions
Taisuke Kobayashi
102
3
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
89
2
0
14 Feb 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
69
1
0
13 Feb 2024
Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains
Yasin Findik
S. Ahmadzadeh
OffRL
144
4
0
12 Feb 2024
Understanding Model Selection For Learning In Strategic Environments
Tinashe Handina
Eric Mazumdar
41
0
0
12 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
63
7
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
133
20
0
05 Feb 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
89
10
0
05 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Sample Complexity of Algorithm Selection Using Neural Networks and Its Applications to Branch-and-Cut
Hongyu Cheng
Sammy Khalife
Barbara Fiedorowicz
Amitabh Basu
70
2
0
04 Feb 2024
SQT -- std
Q
Q
Q
-target
Nitsan Soffair
Dotan Di Castro
Orly Avner
Shie Mannor
OffRL
56
0
0
03 Feb 2024
Evolution Guided Generative Flow Networks
Zarif Ikram
Ling Pan
Dianbo Liu
156
1
0
03 Feb 2024
Learning the Market: Sentiment-Based Ensemble Trading Agents
Andrew Ye
James Xu
Yi Wang
Yifan Yu
Daniel Yan
Ryan Chen
Bosheng Dong
Vipin Chaudhary
Shuai Xu
AIFin
15
1
0
02 Feb 2024
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko
Wendelin Bohmer
Mathijs de Weerdt
68
6
0
02 Feb 2024
Previous
1
2
3
...
10
11
12
...
42
43
44
Next