ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.03932
32
0

Improved Regret of Linear Ensemble Sampling

6 November 2024
Harin Lee
Min-hwan Oh
ArXivPDFHTML
Abstract

In this work, we close the fundamental gap of theory and practice by providing an improved regret bound for linear ensemble sampling. We prove that with an ensemble size logarithmic in TTT, linear ensemble sampling can achieve a frequentist regret bound of O~(d3/2T)\tilde{\mathcal{O}}(d^{3/2}\sqrt{T})O~(d3/2T​), matching state-of-the-art results for randomized linear bandit algorithms, where ddd and TTT are the dimension of the parameter and the time horizon respectively. Our approach introduces a general regret analysis framework for linear bandit algorithms. Additionally, we reveal a significant relationship between linear ensemble sampling and Linear Perturbed-History Exploration (LinPHE), showing that LinPHE is a special case of linear ensemble sampling when the ensemble size equals TTT. This insight allows us to derive a new regret bound of O~(d3/2T)\tilde{\mathcal{O}}(d^{3/2}\sqrt{T})O~(d3/2T​) for LinPHE, independent of the number of arms. Our contributions advance the theoretical foundation of ensemble sampling, bringing its regret bounds in line with the best known bounds for other randomized exploration algorithms.

View on arXiv
@article{lee2025_2411.03932,
  title={ Improved Regret of Linear Ensemble Sampling },
  author={ Harin Lee and Min-hwan Oh },
  journal={arXiv preprint arXiv:2411.03932},
  year={ 2025 }
}
Comments on this paper