ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1206.6444
  4. Cited By
Statistical Linear Estimation with Penalized Estimators: an Application
  to Reinforcement Learning

Statistical Linear Estimation with Penalized Estimators: an Application to Reinforcement Learning

27 June 2012
Bernardo Avila-Pires
Csaba Szepesvári
    OffRL
ArXivPDFHTML

Papers citing "Statistical Linear Estimation with Penalized Estimators: an Application to Reinforcement Learning"

7 / 7 papers shown
Title
The Optimal Approximation Factors in Misspecified Off-Policy Value
  Function Estimation
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
29
3
0
25 Jul 2023
A Complete Characterization of Linear Estimators for Offline Policy
  Evaluation
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
Juan C. Perdomo
A. Krishnamurthy
Peter L. Bartlett
Sham Kakade
OffRL
27
3
0
08 Mar 2022
Bellman-consistent Pessimism for Offline Reinforcement Learning
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
27
269
0
13 Jun 2021
Importance Weight Estimation and Generalization in Domain Adaptation
  under Label Shift
Importance Weight Estimation and Generalization in Domain Adaptation under Label Shift
Kamyar Azizzadenesheli
OOD
29
13
0
29 Nov 2020
A Finite Time Analysis of Temporal Difference Learning With Linear
  Function Approximation
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
Jalaj Bhandari
Daniel Russo
Raghav Singal
16
334
0
06 Jun 2018
Weak Convergence Properties of Constrained Emphatic Temporal-difference
  Learning with Constant and Slowly Diminishing Stepsize
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize
Huizhen Yu
17
29
0
23 Nov 2015
Rate of Convergence and Error Bounds for LSTD($λ$)
Rate of Convergence and Error Bounds for LSTD(λλλ)
Manel Tagorti
B. Scherrer
33
32
0
13 May 2014
1