LIVEJoin the current RTAI Connect sessionJoin now

87
0

All You Need to Know About Training Image Retrieval Models

Abstract

Image retrieval is the task of finding images in a database that are most similar to a given query image. The performance of an image retrieval pipeline depends on many training-time factors, including the embedding model architecture, loss function, data sampler, mining function, learning rate(s), and batch size. In this work, we run tens of thousands of training runs to understand the effect each of these factors has on retrieval accuracy. We also discover best practices that hold across multiple datasets. The code is available atthis https URL

View on arXiv
@article{berton2025_2503.13045,
  title={ All You Need to Know About Training Image Retrieval Models },
  author={ Gabriele Berton and Kevin Musgrave and Carlo Masone },
  journal={arXiv preprint arXiv:2503.13045},
  year={ 2025 }
}
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.