12
0

Towards the Influence of Text Quantity on Writer Retrieval

Main:15 Pages
9 Figures
Bibliography:2 Pages
7 Tables
Abstract

This paper investigates the task of writer retrieval, which identifies documents authored by the same individual within a dataset based on handwriting similarities. While existing datasets and methodologies primarily focus on page level retrieval, we explore the impact of text quantity on writer retrieval performance by evaluating line- and word level retrieval. We examine three state-of-the-art writer retrieval systems, including both handcrafted and deep learning-based approaches, and analyze their performance using varying amounts of text. Our experiments on the CVL and IAM dataset demonstrate that while performance decreases by 20-30% when only one line of text is used as query and gallery, retrieval accuracy remains above 90% of full-page performance when at least four lines are included. We further show that text-dependent retrieval can maintain strong performance in low-text scenarios. Our findings also highlight the limitations of handcrafted features in low-text scenarios, with deep learning-based methods like NetVLAD outperforming traditional VLAD encoding.

View on arXiv
@article{peer2025_2506.07566,
  title={ Towards the Influence of Text Quantity on Writer Retrieval },
  author={ Marco Peer and Robert Sablatnig and Florian Kleber },
  journal={arXiv preprint arXiv:2506.07566},
  year={ 2025 }
}
Comments on this paper