ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.00801
22
0

Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes

27 April 2023
Jianshen Zhu
Naveed Ahmed Azam
Kazuya Haraguchi
Liang Zhao
H. Nagamochi
Tatsuya Akutsu
ArXivPDFHTML
Abstract

A novel framework for designing the molecular structure of chemical compounds with a desired chemical property has recently been proposed. The framework infers a desired chemical graph by solving a mixed integer linear program (MILP) that simulates the computation process of a feature function defined by a two-layered model on chemical graphs and a prediction function constructed by a machine learning method. To improve the learning performance of prediction functions in the framework, we design a method that splits a given data set C\mathcal{C}C into two subsets C(i),i=1,2\mathcal{C}^{(i)},i=1,2C(i),i=1,2 by a hyperplane in a chemical space so that most compounds in the first (resp., second) subset have observed values lower (resp., higher) than a threshold θ\thetaθ. We construct a prediction function ψ\psiψ to the data set C\mathcal{C}C by combining prediction functions ψi,i=1,2\psi_i,i=1,2ψi​,i=1,2 each of which is constructed on C(i)\mathcal{C}^{(i)}C(i) independently. The results of our computational experiments suggest that the proposed method improved the learning performance for several chemical properties to which a good prediction function has been difficult to construct.

View on arXiv
Comments on this paper