Retro: Reusing teacher projection head for efficient embedding
  distillation on Lightweight Models via Self-supervised Learning

Retro: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning

Papers citing "Retro: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning"

22 / 22 papers shown
Title

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.