ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.13390
21
0
v1v2 (latest)

Experimental Design for Semiparametric Bandits

16 June 2025
Seok-Jin Kim
Gi-Soo Kim
Min-hwan Oh
ArXiv (abs)PDFHTML
Main:14 Pages
4 Figures
Bibliography:2 Pages
Appendix:22 Pages
Abstract

We study finite-armed semiparametric bandits, where each arm's reward combines a linear component with an unknown, potentially adversarial shift. This model strictly generalizes classical linear bandits and reflects complexities common in practice. We propose the first experimental-design approach that simultaneously offers a sharp regret bound, a PAC bound, and a best-arm identification guarantee. Our method attains the minimax regret O~(dT)\tilde{O}(\sqrt{dT})O~(dT​), matching the known lower bound for finite-armed linear bandits, and further achieves logarithmic regret under a positive suboptimality gap condition. These guarantees follow from our refined non-asymptotic analysis of orthogonalized regression that attains the optimal d\sqrt{d}d​ rate, paving the way for robust and efficient learning across a broad class of semiparametric bandit problems.

View on arXiv
@article{kim2025_2506.13390,
  title={ Experimental Design for Semiparametric Bandits },
  author={ Seok-Jin Kim and Gi-Soo Kim and Min-hwan Oh },
  journal={arXiv preprint arXiv:2506.13390},
  year={ 2025 }
}
Comments on this paper