ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1103.4601
  4. Cited By
Doubly Robust Policy Evaluation and Learning

Doubly Robust Policy Evaluation and Learning

23 March 2011
Miroslav Dudík
John Langford
Lihong Li
    OffRL
ArXivPDFHTML

Papers citing "Doubly Robust Policy Evaluation and Learning"

13 / 13 papers shown
Title
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
116
0
0
02 May 2025
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
130
0
0
03 Apr 2025
Automatic debiasing of neural networks via moment-constrained learning
Automatic debiasing of neural networks via moment-constrained learning
Christian L. Hines
Oliver J. Hines
CML
OOD
54
0
0
29 Sep 2024
A Causal Framework for Evaluating Deferring Systems
A Causal Framework for Evaluating Deferring Systems
Filippo Palomba
Andrea Pugnana
Jose M. Alvarez
Salvatore Ruggieri
CML
69
4
0
29 May 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
78
5
0
22 Feb 2024
Evaluation of Active Feature Acquisition Methods for Time-varying Feature Settings
Evaluation of Active Feature Acquisition Methods for Time-varying Feature Settings
Henrik von Kleist
Alireza Zamanian
I. Shpitser
Narges Ahmidi
OffRL
124
2
0
03 Dec 2023
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Eli Ben-Michael
D. J. Greiner
Kosuke Imai
Zhichao Jiang
OffRL
106
22
0
22 Sep 2021
Off-policy Bandits with Deficient Support
Off-policy Bandits with Deficient Support
Noveen Sachdeva
Yi-Hsun Su
Thorsten Joachims
OffRL
98
75
0
16 Jun 2020
Large-scale Causal Approaches to Debiasing Post-click Conversion Rate
  Estimation with Multi-task Learning
Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning
Wenhao Zhang
Wentian Bao
Xiao-Yang Liu
Keping Yang
Quan Lin
Hong Wen
Ramin Ramezani
CML
49
104
0
16 Oct 2019
Balanced Policy Evaluation and Learning
Balanced Policy Evaluation and Learning
Nathan Kallus
CML
OffRL
188
141
0
21 May 2017
Policy Learning with Observational Data
Policy Learning with Observational Data
Susan Athey
Stefan Wager
CML
OffRL
170
182
0
09 Feb 2017
Learning from Logged Implicit Exploration Data
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
90
254
0
27 Feb 2010
The Offset Tree for Learning with Partial Labels
The Offset Tree for Learning with Partial Labels
A. Beygelzimer
John Langford
82
184
0
21 Dec 2008
1