Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging

This paper presents our system, Homa, for SemEval-2025 Task 5: Subject Tagging, which focuses on automatically assigning subject labels to technical records from TIBKAT using the Gemeinsame Normdatei (GND) taxonomy. We leverage OntoAligner, a modular ontology alignment toolkit, to address this task by integrating retrieval-augmented generation (RAG) techniques. Our approach formulates the subject tagging problem as an alignment task, where records are matched to GND categories based on semantic similarity. We evaluate OntoAligner's adaptability for subject indexing and analyze its effectiveness in handling multilingual records. Experimental results demonstrate the strengths and limitations of this method, highlighting the potential of alignment techniques for improving subject tagging in digital libraries.
View on arXiv@article{tekanlou2025_2504.21474, title={ Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging }, author={ Hadi Bayrami Asl Tekanlou and Jafar Razmara and Mahsa Sanaei and Mostafa Rahgouy and Hamed Babaei Giglou }, journal={arXiv preprint arXiv:2504.21474}, year={ 2025 } }