SE(3)-Invariant Multiparameter Persistent Homology for Chiral-Sensitive
  Molecular Property Prediction
        - URL: http://arxiv.org/abs/2312.07633v1
- Date: Tue, 12 Dec 2023 09:33:54 GMT
- Title: SE(3)-Invariant Multiparameter Persistent Homology for Chiral-Sensitive
  Molecular Property Prediction
- Authors: Andac Demir, Francis Prael III, Bulent Kiziltan
- Abstract summary: We present a novel method for generating molecular fingerprints using multi parameter persistent homology (MPPH)
This technique holds considerable significance for drug discovery and materials science, where precise molecular property prediction is vital.
We demonstrate its superior performance over existing state-of-the-art methods in predicting molecular properties through extensive evaluations on the MoleculeNet benchmark.
- Score: 1.534667887016089
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract:   In this study, we present a novel computational method for generating
molecular fingerprints using multiparameter persistent homology (MPPH). This
technique holds considerable significance for drug discovery and materials
science, where precise molecular property prediction is vital. By integrating
SE(3)-invariance with Vietoris-Rips persistent homology, we effectively capture
the three-dimensional representations of molecular chirality. This
non-superimposable mirror image property directly influences the molecular
interactions, serving as an essential factor in molecular property prediction.
We explore the underlying topologies and patterns in molecular structures by
applying Vietoris-Rips persistent homology across varying scales and parameters
such as atomic weight, partial charge, bond type, and chirality. Our method's
efficacy can be improved by incorporating additional parameters such as
aromaticity, orbital hybridization, bond polarity, conjugated systems, as well
as bond and torsion angles. Additionally, we leverage Stochastic Gradient
Langevin Boosting in a Bayesian ensemble of GBDTs to obtain aleatoric and
epistemic uncertainty estimates for gradient boosting models. With these
uncertainty estimates, we prioritize high-uncertainty samples for active
learning and model fine-tuning, benefiting scenarios where data labeling is
costly or time consuming. Compared to conventional GNNs which usually suffer
from oversmoothing and oversquashing, MPPH provides a more comprehensive and
interpretable characterization of molecular data topology. We substantiate our
approach with theoretical stability guarantees and demonstrate its superior
performance over existing state-of-the-art methods in predicting molecular
properties through extensive evaluations on the MoleculeNet benchmark datasets.
 
      
        Related papers
        - Aligned Manifold Property and Topology Point Clouds for Learning   Molecular Properties [55.2480439325792]
 This work introduces AMPTCR, a molecular surface representation that combines local quantum-derived scalar fields and custom topological descriptors within an aligned point cloud format.<n>For molecular weight, results confirm that AMPTCR encodes physically meaningful data, with a validation R2 of 0.87.<n>In the bacterial inhibition task, AMPTCR enables both classification and direct regression of E. coli inhibition values.
 arXiv  Detail & Related papers  (2025-07-22T04:35:50Z)
- Structure-Aware Compound-Protein Affinity Prediction via Graph Neural   Network with Group Lasso Regularization [11.595051456139021]
 We build end-to-end explainable machine learning models for structure-activity relationship (SAR) modeling for compound property prediction.<n>We implement graph neural network (GNN) methods to obtain atom-level feature information and predict compound-protein affinity.<n>We also utilize group lasso and sparse group lasso to prune and highlight molecular subgraphs and enhance the structure-specific model explainability.
 arXiv  Detail & Related papers  (2025-07-04T06:12:18Z)
- Knowledge-aware contrastive heterogeneous molecular graph learning [77.94721384862699]
 We propose a paradigm shift by encoding molecular graphs into Heterogeneous Molecular Graph Learning (KCHML)
KCHML conceptualizes molecules through three distinct graph views-molecular, elemental, and pharmacological-enhanced by heterogeneous molecular graphs and a dual message-passing mechanism.
This design offers a comprehensive representation for property prediction, as well as for downstream tasks such as drug-drug interaction (DDI) prediction.
 arXiv  Detail & Related papers  (2025-02-17T11:53:58Z)
- Conditional Synthesis of 3D Molecules with Time Correction Sampler [58.0834973489875]
 Time-Aware Conditional Synthesis (TACS) is a novel approach to conditional generation on diffusion models.
It integrates adaptively controlled plug-and-play "online" guidance into a diffusion model, driving samples toward the desired properties.
 arXiv  Detail & Related papers  (2024-11-01T12:59:25Z)
- Data-Driven Parametrization of Molecular Mechanics Force Fields for   Expansive Chemical Space Coverage [16.745564099126575]
 We develop ByteFF, an Amber-compatible force field for drug-like molecules.
Our model predicts all bonded and non-bonded MM force field parameters for drug-like molecules simultaneously across a broad chemical space.
 arXiv  Detail & Related papers  (2024-08-23T03:37:06Z)
- Molecule Design by Latent Prompt Transformer [76.2112075557233]
 This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
 arXiv  Detail & Related papers  (2024-02-27T03:33:23Z)
- Field-based Molecule Generation [50.124402120798365]
 We show how the flexibility of this method provides crucial advantages over the prevalent, point-cloud based methods.
We tackle optical isomerism (enantiomers), a previously omitted molecular property that is crucial for drug safety and effectiveness.
 arXiv  Detail & Related papers  (2024-02-24T17:13:58Z)
- Multiparameter Persistent Homology for Molecular Property Prediction [1.8130068086063336]
 This approach reveals the latent structures and relationships within molecular geometry.
We have conducted extensive experiments on the Lipophilicity, FreeSolv, and ESOL datasets to demonstrate its effectiveness in predicting molecular properties.
 arXiv  Detail & Related papers  (2023-11-17T17:57:56Z)
- From Peptides to Nanostructures: A Euclidean Transformer for Fast and
  Stable Machine Learned Force Fields [5.013279299982324]
 We propose a transformer architecture called SO3krates that combines sparse equivariant representations with a self-attention mechanism.
 SO3krates achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on extended time and system size scales.
 arXiv  Detail & Related papers  (2023-09-21T09:22:05Z)
- Molecular Conformation Generation via Shifting Scores [21.986775283620883]
 We propose a novel molecular conformation generation approach driven by the observation that the disintegration of a molecule can be viewed as casting increasing force fields to its composing atoms.
The corresponding generative modeling ensures a feasible inter-atomic distance geometry and exhibits time reversibility.
 arXiv  Detail & Related papers  (2023-09-12T07:39:43Z)
- Molecule Design by Latent Space Energy-Based Modeling and Gradual
  Distribution Shifting [53.44684898432997]
 Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
 arXiv  Detail & Related papers  (2023-06-09T03:04:21Z)
- Atomic and Subgraph-aware Bilateral Aggregation for Molecular
  Representation Learning [57.670845619155195]
 We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
 arXiv  Detail & Related papers  (2023-05-22T00:56:00Z)
- Graph neural networks for the prediction of molecular structure-property
  relationships [59.11160990637615]
 Graph neural networks (GNNs) are a novel machine learning method that directly work on the molecular graph.
GNNs allow to learn properties in an end-to-end fashion, thereby avoiding the need for informative descriptors.
We describe the fundamentals of GNNs and demonstrate the application of GNNs via two examples for molecular property prediction.
 arXiv  Detail & Related papers  (2022-07-25T11:30:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.