ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.05314
  4. Cited By
Regularized Off-Policy TD-Learning

Regularized Off-Policy TD-Learning

6 June 2020
Bo Liu
Sridhar Mahadevan
Ji Liu
    OffRL
ArXivPDFHTML

Papers citing "Regularized Off-Policy TD-Learning"

10 / 10 papers shown
Title
Does Sparsity Help in Learning Misspecified Linear Bandits?
Does Sparsity Help in Learning Misspecified Linear Bandits?
Jialin Dong
Lin F. Yang
25
1
0
29 Mar 2023
Relative Sparsity for Medical Decision Problems
Relative Sparsity for Medical Decision Problems
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
27
2
0
29 Nov 2022
Gradient Descent Temporal Difference-difference Learning
Gradient Descent Temporal Difference-difference Learning
Rong Zhu
James M. Murray
OffRL
16
1
0
10 Sep 2022
Causal Inference in Network Economics
Causal Inference in Network Economics
Sridhar Mahadevan
CML
26
6
0
20 Sep 2021
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
19
71
0
04 Jul 2020
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
13
94
0
17 Apr 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic
  Regulator
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
26
130
0
22 Dec 2017
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement
  Learning
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning
Yichen Chen
Mengdi Wang
24
64
0
08 Dec 2016
Weak Convergence Properties of Constrained Emphatic Temporal-difference
  Learning with Constant and Slowly Diminishing Stepsize
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize
Huizhen Yu
17
29
0
23 Nov 2015
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1