Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09477
Cited By
v1
v2
v3 (latest)
Addressing Function Approximation Error in Actor-Critic Methods
26 February 2018
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Addressing Function Approximation Error in Actor-Critic Methods"
50 / 2,180 papers shown
Title
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
184
1
0
16 Dec 2024
C3: Learning Congestion Controllers with Formal Certificates
Chenxi Yang
Divyanshu Saxena
Rohit Dwivedula
Kshiteej S. Mahajan
Swarat Chaudhuri
Aditya Akella
111
1
0
14 Dec 2024
ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street Scenes
Yuxi Wei
Jingbo Wang
Yuwen Du
Dingju Wang
Liang Pan
Chenxin Xu
Yao Feng
Bo Dai
Siheng Chen
AI4CE
150
1
0
11 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
184
10
0
09 Dec 2024
Conformal Symplectic Optimization for Stable Reinforcement Learning
Yao Lyu
Xiangteng Zhang
Shengbo Eben Li
Jingliang Duan
Letian Tao
Qing Xu
Lei He
Keqiang Li
174
0
0
03 Dec 2024
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
100
0
0
28 Nov 2024
Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning
Neel Jawale
Byron Boots
Balakumar Sundaralingam
M. Bhardwaj
123
0
0
27 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
179
0
0
24 Nov 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
181
1
0
22 Nov 2024
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
Zhi Luo
Xiaoyu Yang
Pan Zhou
D. Wang
AAML
110
0
0
20 Nov 2024
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
Kevin Godin-Dubois
Karine Miras
Anna V. Kononova
92
0
0
20 Nov 2024
Preference-Conditioned Gradient Variations for Multi-Objective Quality-Diversity
Hannah Janmohamed
Maxence Faldor
Thomas Pierrot
Antoine Cully
118
1
0
19 Nov 2024
Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning
Ting Zhu
Yue Jin
Jeremie Houssineau
Giovanni Montana
74
0
0
17 Nov 2024
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
134
2
0
17 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
141
2
0
11 Nov 2024
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade
Xiaoning Jin
OffRL
194
0
0
10 Nov 2024
Deep Reinforcement Learning for Digital Twin-Oriented Complex Networked Systems
Jiaqi Wen
Bogdan Gabrys
Katarzyna Musial
AI4CE
66
0
0
09 Nov 2024
Structure Matters: Dynamic Policy Gradient
Sara Klein
Xiangyuan Zhang
Tamer Basar
Simon Weissmann
Leif Döring
59
0
0
07 Nov 2024
Harnessing the Power of Gradient-Based Simulations for Multi-Objective Optimization in Particle Accelerators
Kishansingh Rajput
Malachi Schram
Auralee Edelen
Jonathan Colen
Armen Kasparian
Ryan Roussel
Adam Carpenter
He Zhang
Jay Benesch
60
1
0
07 Nov 2024
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
105
0
0
06 Nov 2024
IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and Context-Awared Resource Allocation
Lingyi Wang
Wei Wu
Fuhui Zhou
Zhijin Qin
Qihui Wu
91
3
0
04 Nov 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
88
1
0
01 Nov 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
OnRL
68
0
0
31 Oct 2024
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
61
0
0
31 Oct 2024
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
Kendong Liu
Zhiyu Zhu
Chuanhao Li
Hui Liu
H. Zeng
Junhui Hou
EGVM
76
4
0
29 Oct 2024
Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution
Senne Deproost
Denis Steckelmacher
Ann Nowé
69
0
0
29 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
168
0
0
27 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
140
7
0
25 Oct 2024
Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning for Inventory Control in Supply Chains
Niki Kotecha
Antonio del Rio Chanona
62
2
0
24 Oct 2024
Learning Versatile Skills with Curriculum Masking
Yao Tang
Zhihui Xie
Zichuan Lin
Deheng Ye
Shuai Li
OffRL
73
0
0
23 Oct 2024
Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Tao Li
Henger Li
Yunian Pan
Tianyi Xu
Zizhan Zheng
Quanyan Zhu
FedML
76
5
0
22 Oct 2024
Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning
Dongsu Lee
Minhae Kwon
100
1
0
22 Oct 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
107
1
0
22 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Wei Wei
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
36
0
0
22 Oct 2024
Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning
Wenqi Bai
Xiaohui Zhang
Shiliang Zhang
Songnan Yang
Yushuai Li
Tingwen Huang
AI4CE
49
2
0
21 Oct 2024
Reinforced Imitative Trajectory Planning for Urban Automated Driving
Di Zeng
Ling Zheng
Xiantong Yang
Yinong Li
55
0
0
21 Oct 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
63
0
0
19 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
138
0
0
19 Oct 2024
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution
Zijie Zhao
Roy E. Welsch
AIFin
112
1
0
19 Oct 2024
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments
Mariusz Wisniewski
Paraskevas Chatzithanos
Weisi Guo
Antonios Tsourdos
64
3
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
72
0
0
17 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
169
19
0
17 Oct 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
Jinzhu Luo
Dingyang Chen
Qi Zhang
OffRL
78
0
0
16 Oct 2024
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
89
3
0
16 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
44
0
0
15 Oct 2024
Visual Manipulation with Legs
Xialin He
Chengjing Yuan
Wenxuan Zhou
Ruihan Yang
David Held
Xiaolong Wang
116
3
0
15 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
83
1
0
15 Oct 2024
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
Yunho Kim
Jaehyun Park
Heejun Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
81
1
0
15 Oct 2024
Communication-Control Codesign for Large-Scale Wireless Networked Control Systems
Gaoyang Pang
Wanchun Liu
Dusit Niyato
Branka Vucetic
Yonghui Li
AI4CE
84
0
0
15 Oct 2024
Large Language Model Evaluation via Matrix Nuclear-Norm
Yongbin Li
Tingyu Xia
Yi-Ju Chang
Yuan Wu
61
1
0
14 Oct 2024
Previous
1
2
3
4
5
6
...
42
43
44
Next