Bandits with Partially Observable Confounded Data

11 June 2020

Papers citing "Bandits with Partially Observable Confounded Data"

5 / 5 papers shown

Title
Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision Li Ye Yishi Lin Hong Xie John C. S. Lui CML 55 11 0 16 Jan 2020
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models Michael Oberst David Sontag CML OffRL 43 170 0 14 May 2019
Combining Parametric and Nonparametric Models for Off-Policy Evaluation Omer Gottesman Yao Liu Scott Sussex Emma Brunskill Finale Doshi-Velez OffRL 59 35 0 14 May 2019
Learning through Dialogue Interactions by Asking Questions Jiwei Li Alexander H. Miller S. Chopra MarcÁurelio Ranzato Jason Weston 40 55 0 15 Dec 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Philip S. Thomas Emma Brunskill OffRL 212 573 0 04 Apr 2016