ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1503.02834
  4. Cited By
Doubly Robust Policy Evaluation and Optimization

Doubly Robust Policy Evaluation and Optimization

10 March 2015
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
    OffRL
ArXivPDFHTML

Papers citing "Doubly Robust Policy Evaluation and Optimization"

13 / 63 papers shown
Title
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
43
183
0
22 Aug 2019
Doubly-Robust Lasso Bandit
Doubly-Robust Lasso Bandit
Gi-Soo Kim
M. Paik
24
61
0
26 Jul 2019
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for
  Reinforcement Learning
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
24
54
0
09 Jun 2019
Balanced off-policy evaluation in general action spaces
Balanced off-policy evaluation in general action spaces
A. Sondhi
David Arbour
Drew Dimmery
OffRL
29
17
0
09 Jun 2019
Learning When-to-Treat Policies
Learning When-to-Treat Policies
Xinkun Nie
Emma Brunskill
Stefan Wager
CML
OffRL
26
89
0
23 May 2019
Interval Estimation of Individual-Level Causal Effects Under Unobserved
  Confounding
Interval Estimation of Individual-Level Causal Effects Under Unobserved Confounding
Nathan Kallus
Xiaojie Mao
Angela Zhou
CML
22
91
0
05 Oct 2018
Confounding-Robust Policy Improvement
Confounding-Robust Policy Improvement
Nathan Kallus
Angela Zhou
CML
OffRL
40
152
0
22 May 2018
Policy Evaluation and Optimization with Continuous Treatments
Policy Evaluation and Optimization with Continuous Treatments
Nathan Kallus
Angela Zhou
OffRL
11
132
0
16 Feb 2018
Estimation Considerations in Contextual Bandits
Estimation Considerations in Contextual Bandits
Maria Dimakopoulou
Zhengyuan Zhou
Susan Athey
Guido Imbens
32
69
0
19 Nov 2017
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Yu Wang
Alekh Agarwal
Miroslav Dudík
OffRL
24
220
0
04 Dec 2016
Off-policy evaluation for slate recommendation
Off-policy evaluation for slate recommendation
Adith Swaminathan
A. Krishnamurthy
Alekh Agarwal
Miroslav Dudík
John Langford
Damien Jose
I. Zitouni
CML
OffRL
13
225
0
16 May 2016
Compatible Value Gradients for Reinforcement Learning of Continuous Deep
  Policies
Compatible Value Gradients for Reinforcement Learning of Continuous Deep Policies
David Balduzzi
Muhammad Ghifary
26
33
0
10 Sep 2015
A Survey on Contextual Multi-armed Bandits
A Survey on Contextual Multi-armed Bandits
Li Zhou
20
124
0
13 Aug 2015
Previous
12