ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.14695
29
0

Depth F1F_1F1​: Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability

20 June 2024
Parker Seegmiller
Joseph Gatto
S. Preum
    VLM
ArXivPDFHTML
Abstract

Recent evaluations of cross-domain text classification models aim to measure the ability of a model to obtain domain-invariant performance in a target domain given labeled samples in a source domain. The primary strategy for this evaluation relies on assumed differences between source domain samples and target domain samples in benchmark datasets. This evaluation strategy fails to account for the similarity between source and target domains, and may mask when models fail to transfer learning to specific target samples which are highly dissimilar from the source domain. We introduce Depth F1F_1F1​, a novel cross-domain text classification performance metric. Designed to be complementary to existing classification metrics such as F1F_1F1​, Depth F1F_1F1​ measures how well a model performs on target samples which are dissimilar from the source domain. We motivate this metric using standard cross-domain text classification datasets and benchmark several recent cross-domain text classification models, with the goal of enabling in-depth evaluation of the semantic generalizability of cross-domain text classification models.

View on arXiv
Comments on this paper