Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
63
2
0
30 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
103
4
0
30 Oct 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu Wang
Yian Ma
88
6
0
29 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
81
9
0
28 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
100
4
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas Corrado
Josiah P. Hanna
83
5
0
26 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
80
8
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
104
5
0
26 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
50
1
0
26 Oct 2023
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation
Chengpeng Li
Zhengyi Yang
Jizhi Zhang
Jiancan Wu
Dingxian Wang
Xiangnan He
Xiang Wang
OffRL
76
1
0
25 Oct 2023
State Sequences Prediction via Fourier Transform for Representation Learning
Mingxuan Ye
Yufei Kuang
Jie Wang
Rui Yang
Wen-gang Zhou
Houqiang Li
Feng Wu
AI4TS
88
8
0
24 Oct 2023
Safe and Interpretable Estimation of Optimal Treatment Regimes
Harsh Parikh
Quinn Lanners
Zade Akras
Sahar F. Zafar
M. P. M. Brandon Westover
Cynthia Rudin
A. Volfovsky
OffRL
31
1
0
23 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
63
3
0
23 Oct 2023
Robust Visual Imitation Learning with Inverse Dynamics Representations
Siyuan Li
Xun Wang
Rongchang Zuo
Kewu Sun
Lingfei Cui
Jishiyu Ding
Peng Liu
Zhe Ma
68
4
0
22 Oct 2023
Stabilizing reinforcement learning control: A modular framework for optimizing over all stable behavior
Nathan P. Lawrence
Philip D. Loewen
Shuyuan Wang
M. Forbes
R. Bhushan Gopaluni
39
2
0
21 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
73
0
0
21 Oct 2023
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
128
4
0
20 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
86
18
0
19 Oct 2023
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
Linrui Zhang
Zhenghao Peng
Quanyi Li
Bolei Zhou
AAML
95
28
0
19 Oct 2023
Learning to Generate Parameters of ConvNets for Unseen Image Data
Shiye Wang
Kaituo Feng
Changsheng Li
Ye Yuan
Guoren Wang
91
1
0
18 Oct 2023
Using Experience Classification for Training Non-Markovian Tasks
Ruixuan Miao
Xu Lu
Cong Tian
Bin Yu
Zhenhua Duan
OffRL
54
0
0
18 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
64
1
0
17 Oct 2023
Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance
Thomas Chaffre
J. Wheare
A. Lammas
Paulo E. Santos
G. Chenadec
Karl Sammut
Benoit Clement
60
2
0
17 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
57
1
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
121
33
0
15 Oct 2023
Enhancing Task Performance of Learned Simplified Models via Reinforcement Learning
Hien Bui
Michael Posa
76
1
0
15 Oct 2023
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach
Heasung Kim
S. Ankireddy
OffRL
45
0
0
12 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
107
2
0
12 Oct 2023
Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing Experiments in Model Identification of Battery Dynamics
Gokhan Budan
Francesca Damiani
Can Kurtulus
N. K. Üre
13
0
0
12 Oct 2023
Generative Intrinsic Optimization: Intrinsic Control with Model Learning
Jianfei Ma
58
0
0
12 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
113
8
0
11 Oct 2023
Imitation Learning from Observation with Automatic Discount Scheduling
Yuyang Liu
Weijun Dong
Yingdong Hu
Chuan Wen
Zhao-Heng Yin
Chongjie Zhang
Yang Gao
75
8
0
11 Oct 2023
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
91
27
0
10 Oct 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
34
0
0
10 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
107
7
0
10 Oct 2023
Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Jihwan Jeong
Yinlam Chow
Guy Tennenholtz
Chih-Wei Hsu
Azamat Tulepbergenov
Mohammad Ghavamzadeh
Craig Boutilier
88
4
0
09 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
87
23
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
76
6
0
09 Oct 2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Xiong-Hui Chen
Junyin Ye
Hang Zhao
Yi-Chen Li
Haoran Shi
...
Si-Hang Yang
Anqi Huang
Kai Xu
Zongzhang Zhang
Yang Yu
73
0
0
09 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He
Li Shen
Linrui Zhang
Junbo Tan
Xueqian Wang
OffRL
84
12
0
09 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODD
OffRL
80
3
0
09 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
77
22
0
06 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
120
17
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
59
1
0
06 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
47
4
0
06 Oct 2023
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
53
3
0
06 Oct 2023
Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards
Litton J. Kurisinkel
Nancy F. Chen
78
1
0
05 Oct 2023
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi
Wataru Hashimoto
Xun Shen
Kazumune Hashimoto
79
11
0
05 Oct 2023
B
\mathcal{B}
B
-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Zishun Yu
Yunzhe Tao
Liyu Chen
Tao Sun
Hongxia Yang
83
13
0
04 Oct 2023
Imitation Learning from Observation through Optimal Transport
Wei-Di Chang
Scott Fujimoto
David Meger
Gregory Dudek
61
4
0
02 Oct 2023
Previous
1
2
3
...
13
14
15
...
42
43
44
Next