Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs

20 June 2025

Main:33 Pages

17 Figures

Bibliography:3 Pages

17 Tables

Abstract

Due to an exponential increase in published research articles, it is impossible for individual scientists to read all publications, even within their own research field. In this work, we investigate the use of large language models (LLMs) for the purpose of extracting the main concepts and semantic information from scientific abstracts in the domain of materials science to find links that were not noticed by humans and thus to suggest inspiring near/mid-term future research directions. We show that LLMs can extract concepts more efficiently than automated keyword extraction methods to build a concept graph as an abstraction of the scientific literature. A machine learning model is trained to predict emerging combinations of concepts, i.e. new research ideas, based on historical data. We demonstrate that integrating semantic concept information leads to an increased prediction performance. The applicability of our model is demonstrated in qualitative interviews with domain experts based on individualized model suggestions. We show that the model can inspire materials scientists in their creative thinking process by predicting innovative combinations of topics that have not yet been investigated.

View on arXiv

@article{marwitz2025_2506.16824,
  title={ Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs },
  author={ Thomas Marwitz and Alexander Colsmann and Ben Breitung and Christoph Brabec and Christoph Kirchlechner and Eva Blasco and Gabriel Cadilha Marques and Horst Hahn and Michael Hirtz and Pavel A. Levkin and Yolita M. Eggeler and Tobias Schlöder and Pascal Friederich },
  journal={arXiv preprint arXiv:2506.16824},
  year={ 2025 }
}

Comments on this paper