ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12509
52
0

LegalCore: A Dataset for Event Coreference Resolution in Legal Documents

18 February 2025
Kangda Wei
Xi Shi
Jonathan Tong
Sai Ramana Reddy
Anandhavelu Natarajan
R. Jain
Aparna Garimella
Ruihong Huang
    AILaw
ArXivPDFHTML
Abstract

Recognizing events and their coreferential mentions in a document is essential for understanding semantic meanings of text. The existing research on event coreference resolution is mostly limited to news articles. In this paper, we present the first dataset for the legal domain, LegalCore, which has been annotated with comprehensive event and event coreference information. The legal contract documents we annotated in this dataset are several times longer than news articles, with an average length of around 25k tokens per document. The annotations show that legal documents have dense event mentions and feature both short-distance and super long-distance coreference links between event mentions. We further benchmark mainstream Large Language Models (LLMs) on this dataset for both event detection and event coreference resolution tasks, and find that this dataset poses significant challenges for state-of-the-art open-source and proprietary LLMs, which perform significantly worse than a supervised baseline. We will publish the dataset as well as the code.

View on arXiv
@article{wei2025_2502.12509,
  title={ LegalCore: A Dataset for Event Coreference Resolution in Legal Documents },
  author={ Kangda Wei and Xi Shi and Jonathan Tong and Sai Ramana Reddy and Anandhavelu Natarajan and Rajiv Jain and Aparna Garimella and Ruihong Huang },
  journal={arXiv preprint arXiv:2502.12509},
  year={ 2025 }
}
Comments on this paper