ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.06731
  4. Cited By
Bandits with Partially Observable Confounded Data

Bandits with Partially Observable Confounded Data

11 June 2020
Guy Tennenholtz
Uri Shalit
Shie Mannor
Yonathan Efroni
    OffRL
ArXivPDFHTML

Papers citing "Bandits with Partially Observable Confounded Data"

5 / 5 papers shown
Title
Combining Offline Causal Inference and Online Bandit Learning for Data
  Driven Decision
Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision
Li Ye
Yishi Lin
Hong Xie
John C. S. Lui
CML
55
11
0
16 Jan 2020
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal
  Models
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
Michael Oberst
David Sontag
CML
OffRL
43
170
0
14 May 2019
Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Omer Gottesman
Yao Liu
Scott Sussex
Emma Brunskill
Finale Doshi-Velez
OffRL
59
35
0
14 May 2019
Learning through Dialogue Interactions by Asking Questions
Learning through Dialogue Interactions by Asking Questions
Jiwei Li
Alexander H. Miller
S. Chopra
MarcÁurelio Ranzato
Jason Weston
40
55
0
15 Dec 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
212
573
0
04 Apr 2016
1