ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.15662
31
0

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

22 July 2024
Odalric-Ambrym Maillard
M. S. Talebi
ArXivPDFHTML
Abstract

We consider the situation when a learner faces a set of unknown discrete distributions (pk)k∈K(p_k)_{k\in \mathcal K}(pk​)k∈K​ defined over a common alphabet X\mathcal XX, and can build for each distribution pkp_kpk​ an individual high-probability confidence set thanks to nkn_knk​ observations sampled from pkp_kpk​. The set (pk)k∈K(p_k)_{k\in \mathcal K}(pk​)k∈K​ is structured: each distribution pkp_kpk​ is obtained from the same common, but unknown, distribution q via applying an unknown permutation to X\mathcal XX. We call this \emph{permutation-equivalence}. The goal is to build refined confidence sets \emph{exploiting} this structural property. Like other popular notions of structure (Lipschitz smoothness, Linearity, etc.) permutation-equivalence naturally appears in machine learning problems, and to benefit from its potential gain calls for a specific approach. We present a strategy to effectively exploit permutation-equivalence, and provide a finite-time high-probability bound on the size of the refined confidence sets output by the strategy. Since a refinement is not possible for too few observations in general, under mild technical assumptions, our finite-time analysis establish when the number of observations (nk)k∈K(n_k)_{k\in \mathcal K}(nk​)k∈K​ are large enough so that the output confidence sets improve over initial individual sets. We carefully characterize this event and the corresponding improvement. Further, our result implies that the size of confidence sets shrink at asymptotic rates of O(1/∑k∈Knk)O(1/\sqrt{\sum_{k\in \mathcal K} n_k})O(1/∑k∈K​nk​​) and O(1/max⁡k∈Knk)O(1/\max_{k\in K} n_{k})O(1/maxk∈K​nk​), respectively for elements inside and outside the support of q, when the size of each individual confidence set shrinks at respective rates of O(1/nk)O(1/\sqrt{n_k})O(1/nk​​) and O(1/nk)O(1/n_k)O(1/nk​). We illustrate the practical benefit of exploiting permutation equivalence on a reinforcement learning task.

View on arXiv
Comments on this paper