ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.04229
11
14

Learning on the Edge: Online Learning with Stochastic Feedback Graphs

9 October 2022
Emmanuel Esposito
Federico Fusco
Dirk van der Hoeven
Nicolò Cesa-Bianchi
ArXivPDFHTML
Abstract

The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erd\H{o}s-R\ényi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. We prove nearly optimal regret bounds of order min⁡{min⁡ε(αε/ε)T, min⁡ε(δε/ε)1/3T2/3}\min\bigl\{\min_{\varepsilon} \sqrt{(\alpha_\varepsilon/\varepsilon) T},\, \min_{\varepsilon} (\delta_\varepsilon/\varepsilon)^{1/3} T^{2/3}\bigr\}min{minε​(αε​/ε)T​,minε​(δε​/ε)1/3T2/3} (ignoring logarithmic factors), where αε\alpha_{\varepsilon}αε​ and δε\delta_{\varepsilon}δε​ are graph-theoretic quantities measured on the support of the stochastic feedback graph G\mathcal{G}G with edge probabilities thresholded at ε\varepsilonε. Our result, which holds without any preliminary knowledge about G\mathcal{G}G, requires the learner to observe only the realized out-neighborhood of the chosen action. When the learner is allowed to observe the realization of the entire graph (but only the losses in the out-neighborhood of the chosen action), we derive a more efficient algorithm featuring a dependence on weighted versions of the independence and weak domination numbers that exhibits improved bounds for some special cases.

View on arXiv
Comments on this paper