v1v2 (latest)

Parameter-Efficient Fine-Tuning of DINOv2 for Large-Scale Font Classification

14 February 2026

Daniel Chen

Zaria Zinn

Marcus Lowe

OSLM

AI4CE

ArXiv (abs)PDF HTML Github (19705★)

Main:8 Pages

7 Figures

Bibliography:1 Pages

3 Tables

Appendix:7 Pages

Abstract

We introduce GoogleFontsBench, the first public benchmark for classifying open-source web fonts, addressing a gap left by existing benchmarks that cover only commercial typefaces. GoogleFontsBench comprises 394 font variants across 32 Google Fonts families, a reproducible synthetic data generation pipeline (~575 images per variant, ~226K total), and a typographically-grounded evaluation metric (SWER) that weights errors by visual severity. We establish baselines using six fine-tuning strategies on a DINOv2 Vision Transformer backbone. Parameter-efficient adaptation with LoRA achieves 99.0% top-1 accuracy while training only 1% of the model's 87.2M parameters, with errors 140x less severe than random guessing. We release the benchmark, all trained models, and the full training pipeline as open-source resources.

View on arXiv

Comments on this paper