ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.3653
  4. Cited By
On Minimax Optimal Offline Policy Evaluation

On Minimax Optimal Offline Policy Evaluation

12 September 2014
Lihong Li
Rémi Munos
Csaba Szepesvári
    OffRL
ArXivPDFHTML

Papers citing "On Minimax Optimal Offline Policy Evaluation"

6 / 6 papers shown
Title
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Germano Gabbianelli
Gergely Neu
Nneka Okolo
Matteo Papini
OffRL
38
7
0
22 May 2023
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
30
45
0
28 Apr 2021
Asymptotically Optimal Sequential Experimentation Under Generalized
  Ranking
Asymptotically Optimal Sequential Experimentation Under Generalized Ranking
Wesley Cowan
M. Katehakis
OffRL
22
11
0
07 Oct 2015
Asymptotically Optimal Multi-Armed Bandit Policies under a Cost
  Constraint
Asymptotically Optimal Multi-Armed Bandit Policies under a Cost Constraint
A. Burnetas
Odysseas Kanavetas
M. Katehakis
9
15
0
09 Sep 2015
An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
An Asymptotically Optimal Policy for Uniform Bandits of Unknown Support
Wesley Cowan
M. Katehakis
21
27
0
08 May 2015
Normal Bandits of Unknown Means and Variances: Asymptotic Optimality,
  Finite Horizon Regret Bounds, and a Solution to an Open Problem
Normal Bandits of Unknown Means and Variances: Asymptotic Optimality, Finite Horizon Regret Bounds, and a Solution to an Open Problem
Wesley Cowan
Junya Honda
M. Katehakis
24
22
0
22 Apr 2015
1