Understanding Self-supervised Contrastive Learning through Supervised Objectives

12 October 2025

Byeongchan Lee

SSL

ArXiv (abs)PDF HTML

Main:9 Pages

7 Figures

Bibliography:5 Pages

6 Tables

Appendix:7 Pages

Abstract

Self-supervised representation learning has achieved impressive empirical success, yet its theoretical understanding remains limited. In this work, we provide a theoretical perspective by formulating self-supervised representation learning as an approximation to supervised representation learning objectives. Based on this formulation, we derive a loss function closely related to popular contrastive losses such as InfoNCE, offering insight into their underlying principles. Our derivation naturally introduces the concepts of prototype representation bias and a balanced contrastive loss, which help explain and improve the behavior of self-supervised learning algorithms. We further show how components of our theoretical framework correspond to established practices in contrastive learning. Finally, we empirically validate the effect of balancing positive and negative pair interactions. All theoretical proofs are provided in the appendix, and our code is included in the supplementary material.

View on arXiv

Comments on this paper