Towards the Influence of Text Quantity on Writer Retrieval

9 June 2025

Main:15 Pages

9 Figures

Bibliography:2 Pages

7 Tables

Abstract

This paper investigates the task of writer retrieval, which identifies documents authored by the same individual within a dataset based on handwriting similarities. While existing datasets and methodologies primarily focus on page level retrieval, we explore the impact of text quantity on writer retrieval performance by evaluating line- and word level retrieval. We examine three state-of-the-art writer retrieval systems, including both handcrafted and deep learning-based approaches, and analyze their performance using varying amounts of text. Our experiments on the CVL and IAM dataset demonstrate that while performance decreases by 20-30% when only one line of text is used as query and gallery, retrieval accuracy remains above 90% of full-page performance when at least four lines are included. We further show that text-dependent retrieval can maintain strong performance in low-text scenarios. Our findings also highlight the limitations of handcrafted features in low-text scenarios, with deep learning-based methods like NetVLAD outperforming traditional VLAD encoding.

View on arXiv

@article{peer2025_2506.07566,
  title={ Towards the Influence of Text Quantity on Writer Retrieval },
  author={ Marco Peer and Robert Sablatnig and Florian Kleber },
  journal={arXiv preprint arXiv:2506.07566},
  year={ 2025 }
}

Comments on this paper