Reward Learning as Doubly Nonparametric Bandits: Optimal Design and
  Scaling Laws

Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Papers citing "Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws"