From raw affiliations to organization identifiers

Accurate affiliation matching, which links affiliation strings to standardized organization identifiers, is critical for improving research metadata quality, facilitating comprehensive bibliometric analyses, and supporting data interoperability across scholarly knowledge bases. Existing approaches fail to handle the complexity of affiliation strings that often include mentions of multiple organizations or extraneous information. In this paper, we present AffRo, a novel approach designed to address these challenges, leveraging advanced parsing and disambiguation techniques. We also introduce AffRoDB, an expert-curated dataset to systematically evaluate affiliation matching algorithms, ensuring robust benchmarking. Results demonstrate the effectiveness of AffRp in accurately identifying organizations from complex affiliation strings.
View on arXiv@article{kallipoliti2025_2505.07577, title={ From raw affiliations to organization identifiers }, author={ Myrto Kallipoliti and Serafeim Chatzopoulos and Miriam Baglioni and Eleni Adamidi and Paris Koloveas and Thanasis Vergoulis }, journal={arXiv preprint arXiv:2505.07577}, year={ 2025 } }