ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.01966
31
11

GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic Evaluation

3 June 2023
Tatsuya Aoyama
Shabnam Behzad
Luke Gessler
Lauren Levine
Jessica Lin
Yang Liu
Siyao Peng
Yilun Zhu
Amir Zeldes
    AILaw
ArXivPDFHTML
Abstract

We present GENTLE, a new mixed-genre English challenge corpus totaling 17K tokens and consisting of 8 unusual text types for out-of domain evaluation: dictionary entries, esports commentaries, legal documents, medical notes, poetry, mathematical proofs, syllabuses, and threat letters. GENTLE is manually annotated for a variety of popular NLP tasks, including syntactic dependency parsing, entity recognition, coreference resolution, and discourse parsing. We evaluate state-of-the-art NLP systems on GENTLE and find severe degradation for at least some genres in their performance on all tasks, which indicates GENTLE's utility as an evaluation dataset for NLP systems.

View on arXiv
Comments on this paper