9
0

MOFGPT: Generative Design of Metal-Organic Frameworks using Language Models

Main:20 Pages
14 Figures
34 Tables
Appendix:53 Pages
Abstract

The discovery of Metal-Organic Frameworks (MOFs) with application-specific properties remains a central challenge in materials chemistry, owing to the immense size and complexity of their structural design space. Conventional computational screening techniques such as molecular simulations and density functional theory (DFT), while accurate, are computationally prohibitive at scale. Machine learning offers an exciting alternative by leveraging data-driven approaches to accelerate materials discovery. The complexity of MOFs, with their extended periodic structures and diverse topologies, creates both opportunities and challenges for generative modeling approaches. To address these challenges, we present a reinforcement learning-enhanced, transformer-based framework for the de novo design of MOFs. Central to our approach is MOFid, a chemically-informed string representation encoding both connectivity and topology, enabling scalable generative modeling. Our pipeline comprises three components: (1) a generative GPT model trained on MOFid sequences, (2) MOFormer, a transformer-based property predictor, and (3) a reinforcement learning (RL) module that optimizes generated candidates via property-guided reward functions. By integrating property feedback into sequence generation, our method drives the model toward synthesizable, topologically valid MOFs with desired functional attributes. This work demonstrates the potential of large language models, when coupled with reinforcement learning, to accelerate inverse design in reticular chemistry and unlock new frontiers in computational MOF discovery.

View on arXiv
@article{badrinarayanan2025_2506.00198,
  title={ MOFGPT: Generative Design of Metal-Organic Frameworks using Language Models },
  author={ Srivathsan Badrinarayanan and Rishikesh Magar and Akshay Antony and Radheesh Sharma Meda and Amir Barati Farimani },
  journal={arXiv preprint arXiv:2506.00198},
  year={ 2025 }
}
Comments on this paper