Hierarchical Bracketing Encodings for Dependency Parsing as Tagging

Abstract
We present a family of encodings for sequence labeling dependency parsing, based on the concept of hierarchical bracketing. We prove that the existing 4-bit projective encoding belongs to this family, but it is suboptimal in the number of labels used to encode a tree. We derive an optimal hierarchical bracketing, which minimizes the number of symbols used and encodes projective trees using only 12 distinct labels (vs. 16 for the 4-bit encoding). We also extend optimal hierarchical bracketing to support arbitrary non-projectivity in a more compact way than previous encodings. Our new encodings yield competitive accuracy on a diverse set of treebanks.
View on arXiv@article{ezquerro2025_2505.11693, title={ Hierarchical Bracketing Encodings for Dependency Parsing as Tagging }, author={ Ana Ezquerro and David Vilares and Anssi Yli-Jyrä and Carlos Gómez-Rodríguez }, journal={arXiv preprint arXiv:2505.11693}, year={ 2025 } }
Comments on this paper