Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Haanvid Lee
Tri Wahyu Guntara
Jongmin Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
64
1
0
29 May 2024
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
Tianle Zhang
Jiayi Guan
Lin Zhao
Yihang Li
Dongjiang Li
...
Lei Sun
Yue Chen
Xuelong Wei
Lusong Li
Xiaodong He
92
2
0
29 May 2024
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Fengshuo Bai
Rui Zhao
Hongming Zhang
Sijia Cui
Ying Wen
Yaodong Yang
Bo Xu
Lei Han
OffRL
95
8
0
29 May 2024
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Zhiyao Luo
Mingcheng Zhu
Fenglin Liu
Jiali Li
Yangchen Pan
Jiandong Zhou
Tingting Zhu
OffRL
62
3
0
28 May 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
OnRL
108
3
0
28 May 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
Longxiang He
Li Shen
Junbo Tan
Xueqian Wang
113
4
0
28 May 2024
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
96
1
0
28 May 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
99
0
0
27 May 2024
Rethinking Transformers in Solving POMDPs
Chenhao Lu
Ruizhe Shi
Yuyao Liu
Kaizhe Hu
Simon S. Du
Huazhe Xu
AI4CE
117
3
0
27 May 2024
RoboArm-NMP: a Learning Environment for Neural Motion Planning
Tom Jurgenson
Matan Sudry
Gal Avineri
Aviv Tamar
59
0
0
25 May 2024
Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions
Harry Zhang
69
0
0
25 May 2024
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Shutong Ding
Ke Hu
Zhenhao Zhang
Kan Ren
Weinan Zhang
Jingyi Yu
Jingya Wang
Ye-ling Shi
111
21
0
25 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
116
36
0
25 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
124
4
0
25 May 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
82
1
0
24 May 2024
Diffusion Actor-Critic with Entropy Regulator
Yinuo Wang
Likun Wang
Yuxuan Jiang
Wenjun Zou
Tong Liu
...
Wenxuan Wang
Liming Xiao
Jiang Wu
Jingliang Duan
Shengbo Eben Li
DiffM
135
17
0
24 May 2024
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
93
2
0
24 May 2024
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Takuya Hiraoka
Guanquan Wang
Takashi Onishi
Yoshimasa Tsuruoka
107
0
0
23 May 2024
Offline Reinforcement Learning from Datasets with Structured Non-Stationarity
Johannes Ackermann
Takayuki Osa
Masashi Sugiyama
OffRL
71
3
0
23 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
107
3
0
23 May 2024
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles
Jiawei Zhang
Chejian Xu
Yue Liu
109
48
0
22 May 2024
Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning
Arko Banerjee
Kia Rahmani
Joydeep Biswas
Işıl Dillig
82
2
0
22 May 2024
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
Chen-Hao Chao
Chien Feng
Wei-Fang Sun
Cheng-Kuang Lee
Simon See
Chun-Yi Lee
86
5
0
22 May 2024
Near-Field Spot Beamfocusing: A Correlation-Aware Transfer Learning Approach
Mohammad Amir Fallah
M. Monemi
Mehdi Rasti
Matti Latva-aho
50
2
0
21 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
91
3
0
20 May 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
64
1
0
19 May 2024
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
Thanh Nguyen
Tung M. Luu
Tri Ton
Chang D. Yoo
OffRL
AAML
84
0
0
18 May 2024
Reinforcement learning
Florentin Wörgötter
91
2,525
0
16 May 2024
NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable Networks
Timothy K Johnsen
Marco Levorato
80
1
0
16 May 2024
Chaos-based reinforcement learning with TD3
Toshitaka Matsuki
Yusuke Sakemi
Kazuyuki Aihara
112
0
0
15 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
85
4
0
14 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TS
CML
32
0
0
14 May 2024
DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Yang Jin
Jun Lv
Shuqiang Jiang
Cewu Lu
127
1
0
12 May 2024
Soft Contact Simulation and Manipulation Learning of Deformable Objects with Vision-based Tactile Sensor
Jianhua Shan
Yuhao Sun
Shixin Zhang
Gang Hua
Zixi Chen
Zirong Shen
Cesare Stefanini
Yiyong Yang
Shan Luo
Bin Fang
74
2
0
12 May 2024
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
79
1
0
12 May 2024
Value Augmented Sampling for Language Model Alignment and Personalization
Seungwook Han
Idan Shenfeld
Akash Srivastava
Yoon Kim
Pulkit Agrawal
OffRL
88
29
0
10 May 2024
Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models
Aylin Gunal
Baihan Lin
Djallel Bouneffouf
OffRL
AI4MH
LM&MA
68
1
0
08 May 2024
TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters
J. Lavington
Ke Zhang
Vasileios Lioutas
Matthew Niedoba
Yunpeng Liu
...
Xiaoxuan Liang
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank Wood
87
5
0
07 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
101
1
0
07 May 2024
Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies
Paul Templier
Emmanuel Rachelson
Antoine Cully
Dennis G. Wilson
44
0
0
07 May 2024
Improving Offline Reinforcement Learning with Inaccurate Simulators
Yiwen Hou
Haoyuan Sun
Jinming Ma
Feng Wu
OffRL
62
6
0
07 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
88
0
0
07 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
96
0
0
06 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
80
0
0
04 May 2024
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
David Valencia
Henry Williams
Trevor Gee
Bruce A MacDonaland
Minas V. Liarokapis
Minas Liarokapis
OffRL
164
2
0
04 May 2024
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
79
3
0
03 May 2024
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
Chengqian Gao
William de Vazelhes
Hualin Zhang
Bin Gu
Zhiqiang Xu
103
0
0
02 May 2024
Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning
Qiyuan Liu
58
0
0
02 May 2024
Markov flow policy -- deep MC
Nitsan Soffair
Gilad Katz
58
0
0
01 May 2024
Learning Tactile Insertion in the Real World
Daniel Palenicek
Theo Gruner
Tim Schneider
Alina Böhm
Janis Lenz
Inga Pfenning
Eric Krämer
Jan Peters
82
2
0
01 May 2024
Previous
1
2
3
...
8
9
10
...
42
43
44
Next