ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.03722
  4. Cited By
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

11 November 2015
Nan Jiang
Lihong Li
    OffRL
ArXivPDFHTML

Papers citing "Doubly Robust Off-policy Value Evaluation for Reinforcement Learning"

13 / 163 papers shown
Title
Can Neural Machine Translation be Improved with User Feedback?
Can Neural Machine Translation be Improved with User Feedback?
Julia Kreutzer
Shahram Khadivi
E. Matusov
Stefan Riezler
19
93
0
16 Apr 2018
Active Learning with Logged Data
Active Learning with Logged Data
Songbai Yan
Kamalika Chaudhuri
T. Javidi
34
27
0
25 Feb 2018
Learning Optimal Policies from Observational Data
Learning Optimal Policies from Observational Data
Onur Atan
W. Zame
M. Schaar
CML
OOD
OffRL
24
18
0
23 Feb 2018
More Robust Doubly Robust Off-policy Evaluation
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar
Yinlam Chow
Mohammad Ghavamzadeh
OffRL
17
264
0
10 Feb 2018
Estimation Considerations in Contextual Bandits
Estimation Considerations in Contextual Bandits
Maria Dimakopoulou
Zhengyuan Zhou
Susan Athey
Guido Imbens
34
69
0
19 Nov 2017
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
61
1,302
0
30 May 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep
  Reinforcement Learning Approach
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
21
192
0
23 May 2017
Using Options and Covariance Testing for Long Horizon Off-Policy Policy
  Evaluation
Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Z. Guo
Philip S. Thomas
Emma Brunskill
OffRL
23
2
0
09 Mar 2017
Multi-step Off-policy Learning Without Importance Sampling Ratios
Multi-step Off-policy Learning Without Importance Sampling Ratios
A. R. Mahmood
Huizhen Yu
R. Sutton
OffRL
24
54
0
09 Feb 2017
Constructing Effective Personalized Policies Using Counterfactual
  Inference from Biased Data Sets with Many Features
Constructing Effective Personalized Policies Using Counterfactual Inference from Biased Data Sets with Many Features
Onur Atan
W. Zame
Qiaojun Feng
M. Schaar
OffRL
CML
22
12
0
23 Dec 2016
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Yu Wang
Alekh Agarwal
Miroslav Dudík
OffRL
24
220
0
04 Dec 2016
Importance Sampling with Unequal Support
Importance Sampling with Unequal Support
Philip S. Thomas
Emma Brunskill
13
14
0
10 Nov 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
11
567
0
04 Apr 2016
Previous
1234