ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.05622
  4. Cited By
Regret Analysis for Hierarchical Experts Bandit Problem

Regret Analysis for Hierarchical Experts Bandit Problem

11 August 2022
Qihan Guo
Siwei Wang
Jun Zhu
ArXiv (abs)PDFHTML

Papers citing "Regret Analysis for Hierarchical Experts Bandit Problem"

1 / 1 papers shown
Title
On-line Policy Improvement using Monte-Carlo Search
On-line Policy Improvement using Monte-Carlo SearchNeural Information Processing Systems (NeurIPS), 1996
Gerald Tesauro
Gregory R. Galperin
374
275
0
09 Jan 2025
1