ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.06954
39
0

Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation

10 March 2025
Xingye Fan
Zhongwen
Z. Zhang
Yuri Boykov
    VLM
ArXivPDFHTML
Abstract

This paper demonstrates a surprising result for segmentation with image-level targets: extending binary class tags to approximate relative object-size distributions allows off-the-shelf architectures to solve the segmentation problem. A straightforward zero-avoiding KL-divergence loss for average predictions produces segmentation accuracy comparable to the standard pixel-precise supervision with full ground truth masks. In contrast, current results based on class tags typically require complex non-reproducible architectural modifications and specialized multi-stage training procedures. Our ideas are validated on PASCAL VOC using our new human annotations of approximate object sizes. We also show the results on COCO and medical data using synthetically corrupted size targets. All standard networks demonstrate robustness to the size targets' errors. For some classes, the validation accuracy is significantly better than the pixel-level supervision; the latter is not robust to errors in the masks. Our work provides new ideas and insights on image-level supervision in segmentation and may encourage other simple general solutions to the problem.

View on arXiv
@article{fan2025_2503.06954,
  title={ Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation },
  author={ Xingye Fan and Zhongwen and Zhang and Yuri Boykov },
  journal={arXiv preprint arXiv:2503.06954},
  year={ 2025 }
}
Comments on this paper