ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03545
132
3
v1v2 (latest)

Learning Multiclass Classifier Under Noisy Bandit Feedback

5 June 2020
Mudit Agarwal
Naresh Manwani
    NoLa
ArXiv (abs)PDFHTML
Abstract

This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback based on the unbiased estimator technique. We further offer a method that can efficiently estimate the noise rates, thus providing an end-to-end framework. The proposed algorithm enjoys a mistake bound of the order of O(T)O(\sqrt{T})O(T​) in the high noise case and of the order of O(T\nicefrac23)O(T^{\nicefrac{2}{3}})O(T\nicefrac23) in the worst case. We show our approach's effectiveness using extensive experiments on several benchmark datasets.

View on arXiv
Comments on this paper