Local Metric Learning for Off-Policy Evaluation in Contextual Bandits
  with Continuous Actions

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Papers citing "Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions"

14 / 14 papers shown
Title