Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06731
Cited By
Bandits with Partially Observable Confounded Data
11 June 2020
Guy Tennenholtz
Uri Shalit
Shie Mannor
Yonathan Efroni
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bandits with Partially Observable Confounded Data"
5 / 5 papers shown
Title
Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision
Li Ye
Yishi Lin
Hong Xie
John C. S. Lui
CML
55
11
0
16 Jan 2020
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
Michael Oberst
David Sontag
CML
OffRL
43
170
0
14 May 2019
Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Omer Gottesman
Yao Liu
Scott Sussex
Emma Brunskill
Finale Doshi-Velez
OffRL
59
35
0
14 May 2019
Learning through Dialogue Interactions by Asking Questions
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
40
55
0
15 Dec 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
212
573
0
04 Apr 2016
1