Molecular Design Based on Integer Programming and Quadratic Descriptors
in a Two-layered Model
- URL: http://arxiv.org/abs/2209.13527v1
- Date: Tue, 13 Sep 2022 08:27:25 GMT
- Title: Molecular Design Based on Integer Programming and Quadratic Descriptors
in a Two-layered Model
- Authors: Jianshen Zhu, Naveed Ahmed Azam, Shengjuan Cao, Ryota Ido, Kazuya
Haraguchi, Liang Zhao, Hiroshi Nagamochi and Tatsuya Akutsu
- Abstract summary: The framework infers a desired chemical graph by solving a mixed integer linear program (MILP)
A set of graph theoretical descriptors in the feature function plays a key role to derive a compact formulation of such an MILP.
The results of our computational experiments suggest that the proposed method can infer a chemical structure with up to 50 non-hydrogen atoms.
- Score: 5.845754795753478
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A novel framework has recently been proposed for designing the molecular
structure of chemical compounds with a desired chemical property, where design
of novel drugs is an important topic in bioinformatics and chemo-informatics.
The framework infers a desired chemical graph by solving a mixed integer linear
program (MILP) that simulates the computation process of a feature function
defined by a two-layered model on chemical graphs and a prediction function
constructed by a machine learning method. A set of graph theoretical
descriptors in the feature function plays a key role to derive a compact
formulation of such an MILP. To improve the learning performance of prediction
functions in the framework maintaining the compactness of the MILP, this paper
utilizes the product of two of those descriptors as a new descriptor and then
designs a method of reducing the number of descriptors. The results of our
computational experiments suggest that the proposed method improved the
learning performance for many chemical properties and can infer a chemical
structure with up to 50 non-hydrogen atoms.
Related papers
- GraphXForm: Graph transformer for computer-aided molecular design with application to extraction [73.1842164721868]
We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned.
We evaluate it on two solvent design tasks for liquid-liquid extraction, showing that it outperforms four state-of-the-art molecular design techniques.
arXiv Detail & Related papers (2024-11-03T19:45:15Z) - Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - Cycle-Configuration: A Novel Graph-theoretic Descriptor Set for Molecular Inference [6.170488239124818]
Cycle-configuration (CC) descriptors can be used in the standard "two-layered (2L) model of mol-infer.
Proposed descriptors capture the notion of ortho/meta/para patterns that appear in aromatic rings.
arXiv Detail & Related papers (2024-08-09T15:45:41Z) - A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics [0.0]
We present conditional Gaussian process models to predict ordinal outcomes from chemical experiments.
A novel aspect of our model is that the kernel contains a scaling parameter, that controls the strength of the correlation between elements of the chemical space.
Using molecular fingerprints, a numerical representation of a compound's location within the chemical space, we show that accounting for correlation amongst chemical compounds improves predictive performance.
arXiv Detail & Related papers (2024-05-16T11:18:32Z) - Beyond Chemical Language: A Multimodal Approach to Enhance Molecular
Property Prediction [2.1202329976106924]
We present a novel multimodal language model approach for predicting molecular properties by combining chemical language representation with physicochemical features.
Our approach, MULTIMODAL-MOLFORMER, utilizes a causal multistage feature selection method that identifies physicochemical features based on their direct causal effect on a specific target property.
Our results demonstrate a superior performance compared to existing state-of-the-art algorithms, including the chemical language-based MOLFORMER and graph neural networks.
arXiv Detail & Related papers (2023-06-22T13:28:59Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - Molecular Design Based on Integer Programming and Splitting Data Sets by
Hyperplanes [6.504869613326338]
We propose a framework for designing the molecular structure of chemical compounds with a desired chemical property.
The proposed framework infers a desired chemical graph by solving a mixed integer linear program (MILP) and a prediction function constructed by a machine learning method.
The results of our computational experiments suggest that the proposed method improved the learning performance for several chemical properties to which a good prediction function has been difficult to construct.
arXiv Detail & Related papers (2023-04-27T04:18:41Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Molecular Design Based on Artificial Neural Networks, Integer
Programming and Grid Neighbor Search [6.519339570726759]
In this paper, we propose a procedure for generating other feasible solutions of a mixed integer linear program.
The results of our computational experiments suggest that the proposed method can generate an additional number of new chemical graphs with up to 50 non-hydrogen atoms.
arXiv Detail & Related papers (2021-08-23T16:10:57Z) - Structure-Enhanced Meta-Learning For Few-Shot Graph Classification [53.54066611743269]
This work explores the potential of metric-based meta-learning for solving few-shot graph classification.
An implementation upon GIN, named SMFGIN, is tested on two datasets, Chembl and TRIANGLES.
arXiv Detail & Related papers (2021-03-05T09:03:03Z) - Energy-based View of Retrosynthesis [70.66156081030766]
We propose a framework that unifies sequence- and graph-based methods as energy-based models.
We present a novel dual variant within the framework that performs consistent training over Bayesian forward- and backward-prediction.
This model improves state-of-the-art performance by 9.6% for template-free approaches where the reaction type is unknown.
arXiv Detail & Related papers (2020-07-14T18:51:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.