ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.02231
59
1

CODE-ACCORD: A Corpus of building regulatory data for rule generation towards automatic compliance checking

4 March 2024
Hansi Hettiarachchi
Amna Dridi
M. Gaber
Pouyan Parsafard
Nicoleta Bocaneala
Katja Breitenfelder
Gonçal Costa
Maria Hedblom
Mihaela Juganaru-Mathieu
Thamer Mecharnia
Sumee Park
He Tan
Abdel-Rahman H. Tawil
Edlira Vakaj
ArXivPDFHTML
Abstract

Automatic Compliance Checking (ACC) within the Architecture, Engineering, and Construction (AEC) sector necessitates automating the interpretation of building regulations to achieve its full potential. Converting textual rules into machine-readable formats is challenging due to the complexities of natural language and the scarcity of resources for advanced Machine Learning (ML). Addressing these challenges, we introduce CODE-ACCORD, a dataset of 862 sentences from the building regulations of England and Finland. Only the self-contained sentences, which express complete rules without needing additional context, were considered as they are essential for ACC. Each sentence was manually annotated with entities and relations by a team of 12 annotators to facilitate machine-readable rule generation, followed by careful curation to ensure accuracy. The final dataset comprises 4,297 entities and 4,329 relations across various categories, serving as a robust ground truth. CODE-ACCORD supports a range of ML and Natural Language Processing (NLP) tasks, including text classification, entity recognition, and relation extraction. It enables applying recent trends, such as deep neural networks and large language models, to ACC.

View on arXiv
@article{hettiarachchi2025_2403.02231,
  title={ CODE-ACCORD: A Corpus of building regulatory data for rule generation towards automatic compliance checking },
  author={ Hansi Hettiarachchi and Amna Dridi and Mohamed Medhat Gaber and Pouyan Parsafard and Nicoleta Bocaneala and Katja Breitenfelder and Gonçal Costa and Maria Hedblom and Mihaela Juganaru-Mathieu and Thamer Mecharnia and Sumee Park and He Tan and Abdel-Rahman H. Tawil and Edlira Vakaj },
  journal={arXiv preprint arXiv:2403.02231},
  year={ 2025 }
}
Comments on this paper