ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00558
23
15

From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model

1 March 2019
Aadirupa Saha
Aditya Gopalan
ArXivPDFHTML
Abstract

We consider PAC-learning a good item from kkk-subsetwise feedback information sampled from a Plackett-Luce probability model, with instance-dependent sample complexity performance. In the setting where subsets of a fixed size can be tested and top-ranked feedback is made available to the learner, we give an algorithm with optimal instance-dependent sample complexity, for PAC best arm identification, of O(θ[k]k∑i=2nmax⁡(1,1Δi2)ln⁡kδ(ln⁡1Δi))O\bigg(\frac{\theta_{[k]}}{k}\sum_{i = 2}^n\max\Big(1,\frac{1}{\Delta_i^2}\Big) \ln\frac{k}{\delta}\Big(\ln \frac{1}{\Delta_i}\Big)\bigg)O(kθ[k]​​∑i=2n​max(1,Δi2​1​)lnδk​(lnΔi​1​)), Δi\Delta_iΔi​ being the Plackett-Luce parameter gap between the best and the ithi^{th}ith best item, and θ[k]\theta_{[k]}θ[k]​ is the sum of the \pl\, parameters for the top-kkk items. The algorithm is based on a wrapper around a PAC winner-finding algorithm with weaker performance guarantees to adapt to the hardness of the input instance. The sample complexity is also shown to be multiplicatively better depending on the length of rank-ordered feedback available in each subset-wise play. We show optimality of our algorithms with matching sample complexity lower bounds. We next address the winner-finding problem in Plackett-Luce models in the fixed-budget setting with instance dependent upper and lower bounds on the misidentification probability, of Ω(exp⁡(−2Δ~Q))\Omega\left(\exp(-2 \tilde \Delta Q) \right)Ω(exp(−2Δ~Q)) for a given budget QQQ, where Δ~\tilde \DeltaΔ~ is an explicit instance-dependent problem complexity parameter. Numerical performance results are also reported.

View on arXiv
Comments on this paper