ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.09535
24
192

Learning with Biased Complementary Labels

27 November 2017
Xiyu Yu
Tongliang Liu
Biwei Huang
Dacheng Tao
ArXivPDFHTML
Abstract

In this paper, we study the classification problem in which we have access to easily obtainable surrogate for true labels, namely complementary labels, which specify classes that observations do \textbf{not} belong to. Let YYY and Yˉ\bar{Y}Yˉ be the true and complementary labels, respectively. We first model the annotation of complementary labels via transition probabilities P(Yˉ=i∣Y=j),i≠j∈{1,⋯ ,c}P(\bar{Y}=i|Y=j), i\neq j\in\{1,\cdots,c\}P(Yˉ=i∣Y=j),i=j∈{1,⋯,c}, where ccc is the number of classes. Previous methods implicitly assume that P(Yˉ=i∣Y=j),∀i≠jP(\bar{Y}=i|Y=j), \forall i\neq jP(Yˉ=i∣Y=j),∀i=j, are identical, which is not true in practice because humans are biased toward their own experience. For example, as shown in Figure 1, if an annotator is more familiar with monkeys than prairie dogs when providing complementary labels for meerkats, she is more likely to employ "monkey" as a complementary label. We therefore reason that the transition probabilities will be different. In this paper, we propose a framework that contributes three main innovations to learning with \textbf{biased} complementary labels: (1) It estimates transition probabilities with no bias. (2) It provides a general method to modify traditional loss functions and extends standard deep neural network classifiers to learn with biased complementary labels. (3) It theoretically ensures that the classifier learned with complementary labels converges to the optimal one learned with true labels. Comprehensive experiments on several benchmark datasets validate the superiority of our method to current state-of-the-art methods.

View on arXiv
Comments on this paper