Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.05314
Cited By
Regularized Off-Policy TD-Learning
6 June 2020
Bo Liu
Sridhar Mahadevan
Ji Liu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regularized Off-Policy TD-Learning"
10 / 10 papers shown
Title
Does Sparsity Help in Learning Misspecified Linear Bandits?
Jialin Dong
Lin F. Yang
25
1
0
29 Mar 2023
Relative Sparsity for Medical Decision Problems
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
27
2
0
29 Nov 2022
Gradient Descent Temporal Difference-difference Learning
Rong Zhu
James M. Murray
OffRL
16
1
0
10 Sep 2022
Causal Inference in Network Economics
Sridhar Mahadevan
CML
26
6
0
20 Sep 2021
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
19
71
0
04 Jul 2020
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
13
94
0
17 Apr 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
26
130
0
22 Dec 2017
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning
Yichen Chen
Mengdi Wang
24
64
0
08 Dec 2016
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize
Huizhen Yu
17
29
0
23 Nov 2015
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1