143
0
v1v2 (latest)

A UD Treebank for Bohairic Coptic

Main:7 Pages
2 Figures
Bibliography:3 Pages
5 Tables
Appendix:1 Pages
Abstract

Despite recent advances in digital resources for other Coptic dialects, especially Sahidic, Bohairic Coptic, the main Coptic dialect for pre-Mamluk, late Byzantine Egypt, and the contemporary language of the Coptic Church, remains critically under-resourced. This paper presents and evaluates the first syntactically annotated corpus of Bohairic Coptic, sampling data from a range of works, including Biblical text, saints' lives and Christian ascetic writing. We also explore some of the main differences we observe compared to the existing UD treebank of Sahidic Coptic, the classical dialect of the language, and conduct joint and cross-dialect parsing experiments, revealing the unique nature of Bohairic as a related, but distinct variety from the more often studied Sahidic.

View on arXiv
@article{zeldes2025_2504.18386,
  title={ A UD Treebank for Bohairic Coptic },
  author={ Amir Zeldes and Nina Speransky and Nicholas Wagner and Caroline T. Schroeder },
  journal={arXiv preprint arXiv:2504.18386},
  year={ 2025 }
}
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.