Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1205.4839
Cited By
Off-Policy Actor-Critic
22 May 2012
T. Degris
Martha White
R. Sutton
OffRL
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Actor-Critic"
36 / 36 papers shown
Title
Distillation Policy Optimization
Jianfei Ma
OffRL
21
1
0
01 Feb 2023
Improved Policy Optimization for Online Imitation Learning
J. Lavington
Sharan Vaswani
Mark W. Schmidt
OffRL
18
6
0
29 Jul 2022
Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless Networks
Riccardo Marini
Sangwoo Park
Osvaldo Simeone
C. Buratti
37
7
0
13 Jul 2022
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xinyu Wang
Jing Xiao
16
0
0
11 May 2022
TASAC: a twin-actor reinforcement learning framework with stochastic policy for batch process control
Tanuja Joshi
H. Kodamana
Harikumar Kandath
N. Kaisare
OffRL
13
0
0
22 Apr 2022
Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning
Pascal Weber
Daniel Wälchli
Mustafa Zeqiri
Petros Koumoutsakos
CLL
OffRL
10
7
0
24 Mar 2022
Residual Robot Learning for Object-Centric Probabilistic Movement Primitives
João Carvalho
Dorothea Koert
Marek Daniv
Jan Peters
27
8
0
08 Mar 2022
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
19
1
0
04 Feb 2022
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning
Jingqing Ruan
Yali Du
Xuantang Xiong
Dengpeng Xing
Xiyun Li
Linghui Meng
Haifeng Zhang
Jun Wang
Bo Xu
46
29
0
17 Jan 2022
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
24
9
0
24 Nov 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
21
31
0
14 Oct 2021
Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures
S. Marzban
Erick Delage
Jonathan Yu-Meng Li
11
4
0
09 Sep 2021
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction
Jiawei Huang
Nan Jiang
11
5
0
02 Jun 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
63
29
0
26 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CML
OffRL
71
26
0
18 Feb 2021
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
26
25
0
27 Nov 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
27
174
0
24 Jul 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
14
585
0
16 Jun 2020
Regularized Off-Policy TD-Learning
Bo Liu
Sridhar Mahadevan
Ji Liu
OffRL
15
65
0
06 Jun 2020
State Action Separable Reinforcement Learning
Ziyao Zhang
Liang Ma
K. Leung
Konstantinos Poularakis
M. Srivatsa
28
2
0
05 Jun 2020
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
G. Novati
Hugues Lascombes de Laroussilhe
Petros Koumoutsakos
AI4CE
18
15
0
18 May 2020
Learning Multi-Agent Coordination through Connectivity-driven Communication
E. Pesce
Giovanni Montana
24
15
0
12 Feb 2020
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
30
34
0
23 Dec 2019
All-Action Policy Gradient Methods: A Numerical Integration Approach
Benjamin Petit
Loren Amdahl-Culleton
Yao Liu
Jimmy T.H. Smith
Pierre-Luc Bacon
13
9
0
21 Oct 2019
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
11
68
0
25 Sep 2019
Deep Active Inference as Variational Policy Gradients
Beren Millidge
BDL
20
103
0
08 Jul 2019
Deep Reinforcement Learning from Policy-Dependent Human Feedback
Dilip Arumugam
Jun Ki Lee
S. Saskin
Michael L. Littman
18
94
0
12 Feb 2019
Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication
E. Pesce
Giovanni Montana
17
71
0
12 Jan 2019
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
17
155
0
06 Jul 2018
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
Keith Ross
11
20
0
29 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
24
11
0
20 May 2018
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
24
37
0
21 Feb 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
21
327
0
19 Feb 2018
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
36
51
0
10 Jan 2018
Off-Policy Shaping Ensembles in Reinforcement Learning
A. Harutyunyan
Tim Brys
Peter Vrancx
A. Nowé
OffRL
39
11
0
21 May 2014
1