Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06731
Cited By
Bandits with Partially Observable Confounded Data
11 June 2020
Guy Tennenholtz
Uri Shalit
Shie Mannor
Yonathan Efroni
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bandits with Partially Observable Confounded Data"
4 / 4 papers shown
Title
Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision
Li Ye
Yishi Lin
Hong Xie
John C. S. Lui
CML
53
11
0
16 Jan 2020
Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Omer Gottesman
Yao Liu
Scott Sussex
Emma Brunskill
Finale Doshi-Velez
OffRL
54
35
0
14 May 2019
Learning through Dialogue Interactions by Asking Questions
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
29
55
0
15 Dec 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
150
573
0
04 Apr 2016
1