28

Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching

Wonseok Choi
Sohwi Lim
Nam Hyeon-Woo
Moon Ye-Bin
Dong-Ju Jeong
Jinyoung Hwang
Tae-Hyun Oh
Main:8 Pages
18 Figures
Bibliography:2 Pages
10 Tables
Appendix:14 Pages
Abstract

Instance-level image retrieval aims to find images containing the same object as a given query, despite variations in size, position, or appearance. To address this challenging task, we propose Patchify, a simple yet effective patch-wise retrieval framework that offers high performance, scalability, and interpretability without requiring fine-tuning. Patchify divides each database image into a small number of structured patches and performs retrieval by comparing these local features with a global query descriptor, enabling accurate and spatially grounded matching. To assess not just retrieval accuracy but also spatial correctness, we introduce LocScore, a localization-aware metric that quantifies whether the retrieved region aligns with the target object. This makes LocScore a valuable diagnostic tool for understanding and improving retrieval behavior. We conduct extensive experiments across multiple benchmarks, backbones, and region selection strategies, showing that Patchify outperforms global methods and complements state-of-the-art reranking pipelines. Furthermore, we apply Product Quantization for efficient large-scale retrieval and highlight the importance of using informative features during compression, which significantly boosts performance. Project website:this https URL

View on arXiv
Comments on this paper