ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.19800
35
2

SemEval-2025 Task 9: The Food Hazard Detection Challenge

25 March 2025
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
Juli Bakagianni
ArXivPDFHTML
Abstract

In this challenge, we explored text-based food hazard prediction with long tail distributed classes. The task was divided into two subtasks: (1) predicting whether a web text implies one of ten food-hazard categories and identifying the associated food category, and (2) providing a more fine-grained classification by assigning a specific label to both the hazard and the product. Our findings highlight that large language model-generated synthetic data can be highly effective for oversampling long-tail distributions. Furthermore, we find that fine-tuned encoder-only, encoder-decoder, and decoder-only systems achieve comparable maximum performance across both subtasks. During this challenge, we gradually released (under CC BY-NC-SA 4.0) a novel set of 6,644 manually labeled food-incident reports.

View on arXiv
@article{randl2025_2503.19800,
  title={ SemEval-2025 Task 9: The Food Hazard Detection Challenge },
  author={ Korbinian Randl and John Pavlopoulos and Aron Henriksson and Tony Lindgren and Juli Bakagianni },
  journal={arXiv preprint arXiv:2503.19800},
  year={ 2025 }
}
Comments on this paper