ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.13554
  4. Cited By
Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under
  Batch Update Policy

Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy

23 October 2020
Masahiro Kato
Yusuke Kaneko
    OffRL
ArXivPDFHTML

Papers citing "Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy"

Title
No papers