ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.14085
33
0

An Improved Method for Class-specific Keyword Extraction: A Case Study in the German Business Registry

19 July 2024
Stephen Meisenbacher
Tim Schopf
Weixin Yan
Patrick Holl
Florian Matthes
ArXivPDFHTML
Abstract

The task of keyword extraction\textit{keyword extraction}keyword extraction is often an important initial step in unsupervised information extraction, forming the basis for tasks such as topic modeling or document classification. While recent methods have proven to be quite effective in the extraction of keywords, the identification of class-specific\textit{class-specific}class-specific keywords, or only those pertaining to a predefined class, remains challenging. In this work, we propose an improved method for class-specific keyword extraction, which builds upon the popular KeyBERT\textbf{KeyBERT}KeyBERT library to identify only keywords related to a class described by seed keywords\textit{seed keywords}seed keywords. We test this method using a dataset of German business registry entries, where the goal is to classify each business according to an economic sector. Our results reveal that our method greatly improves upon previous approaches, setting a new standard for class-specific\textit{class-specific}class-specific keyword extraction.

View on arXiv
Comments on this paper