ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.00752
45
0

LLMs4SchemaDiscovery: A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models

1 April 2025
Sameer Sadruddin
Jennifer D’Souza
Eleni Poupaki
Alex Watkins
Hamed Babaei Giglou
Anisa Rula
Bora Karasulu
Sören Auer
Adrie Mackus
Erwin Kessels
ArXivPDFHTML
Abstract

Extracting structured information from unstructured text is crucial for modeling real-world processes, but traditional schema mining relies on semi-structured data, limiting scalability. This paper introduces schema-miner, a novel tool that combines large language models with human feedback to automate and refine schema extraction. Through an iterative workflow, it organizes properties from text, incorporates expert input, and integrates domain-specific ontologies for semantic depth. Applied to materials science--specifically atomic layer deposition--schema-miner demonstrates that expert-guided LLMs generate semantically rich schemas suitable for diverse real-world applications.

View on arXiv
@article{sadruddin2025_2504.00752,
  title={ LLMs4SchemaDiscovery: A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models },
  author={ Sameer Sadruddin and Jennifer D'Souza and Eleni Poupaki and Alex Watkins and Hamed Babaei Giglou and Anisa Rula and Bora Karasulu and Sören Auer and Adrie Mackus and Erwin Kessels },
  journal={arXiv preprint arXiv:2504.00752},
  year={ 2025 }
}
Comments on this paper