Atom-Motif Contrastive Transformer for Molecular Property Prediction
- URL: http://arxiv.org/abs/2310.07351v1
- Date: Wed, 11 Oct 2023 10:03:10 GMT
- Title: Atom-Motif Contrastive Transformer for Molecular Property Prediction
- Authors: Wentao Yu, Shuo Chen, Chen Gong, Gang Niu, Masashi Sugiyama
- Abstract summary: Graph Transformer (GT) models have been widely used in the task of Molecular Property Prediction (MPP)
We propose a novel Atom-Motif Contrastive Transformer (AMCT) which explores atom-level interactions and considers motif-level interactions.
Our proposed AMCT is extensively evaluated on seven popular benchmark datasets, and both quantitative and qualitative results firmly demonstrate its effectiveness.
- Score: 68.85399466928976
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, Graph Transformer (GT) models have been widely used in the task of
Molecular Property Prediction (MPP) due to their high reliability in
characterizing the latent relationship among graph nodes (i.e., the atoms in a
molecule). However, most existing GT-based methods usually explore the basic
interactions between pairwise atoms, and thus they fail to consider the
important interactions among critical motifs (e.g., functional groups consisted
of several atoms) of molecules. As motifs in a molecule are significant
patterns that are of great importance for determining molecular properties
(e.g., toxicity and solubility), overlooking motif interactions inevitably
hinders the effectiveness of MPP. To address this issue, we propose a novel
Atom-Motif Contrastive Transformer (AMCT), which not only explores the
atom-level interactions but also considers the motif-level interactions. Since
the representations of atoms and motifs for a given molecule are actually two
different views of the same instance, they are naturally aligned to generate
the self-supervisory signals for model training. Meanwhile, the same motif can
exist in different molecules, and hence we also employ the contrastive loss to
maximize the representation agreement of identical motifs across different
molecules. Finally, in order to clearly identify the motifs that are critical
in deciding the properties of each molecule, we further construct a
property-aware attention mechanism into our learning framework. Our proposed
AMCT is extensively evaluated on seven popular benchmark datasets, and both
quantitative and qualitative results firmly demonstrate its effectiveness when
compared with the state-of-the-art methods.
Related papers
- Multi-Modal Representation Learning for Molecular Property Prediction:
Sequence, Graph, Geometry [6.049566024728809]
Deep learning-based molecular property prediction has emerged as a solution to the resource-intensive nature of traditional methods.
In this paper, we propose a novel multi-modal representation learning model, called SGGRL, for molecular property prediction.
To ensure consistency across modalities, SGGRL is trained to maximize the similarity of representations for the same molecule while minimizing similarity for different molecules.
arXiv Detail & Related papers (2024-01-07T02:18:00Z) - Diffusing on Two Levels and Optimizing for Multiple Properties: A Novel
Approach to Generating Molecules with Desirable Properties [33.2976176283611]
We present a novel approach to generating molecules with desirable properties, which expands the diffusion model framework with multiple innovative designs.
To get desirable molecular fragments, we develop a novel electronic effect based fragmentation method.
We show that the molecules generated by our proposed method have better validity, uniqueness, novelty, Fr'echet ChemNet Distance (FCD), QED, and PlogP than those generated by current SOTA models.
arXiv Detail & Related papers (2023-10-05T11:43:21Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - MolCPT: Molecule Continuous Prompt Tuning to Generalize Molecular
Representation Learning [77.31492888819935]
We propose a novel paradigm of "pre-train, prompt, fine-tune" for molecular representation learning, named molecule continuous prompt tuning (MolCPT)
MolCPT defines a motif prompting function that uses the pre-trained model to project the standalone input into an expressive prompt.
Experiments on several benchmark datasets show that MolCPT efficiently generalizes pre-trained GNNs for molecular property prediction.
arXiv Detail & Related papers (2022-12-20T19:32:30Z) - DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding [51.970607704953096]
Previous works usually generate atoms in an auto-regressive way, where element types and 3D coordinates of atoms are generated one by one.
In real-world molecular systems, the interactions among atoms in an entire molecule are global, leading to the energy function pair-coupled among atoms.
In this work, a generative diffusion model for molecular 3D structures based on target proteins is established, at a full-atom level in a non-autoregressive way.
arXiv Detail & Related papers (2022-11-21T07:02:15Z) - A Molecular Multimodal Foundation Model Associating Molecule Graphs with
Natural Language [63.60376252491507]
We propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data.
We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine.
arXiv Detail & Related papers (2022-09-12T00:56:57Z) - Interpretable Molecular Graph Generation via Monotonic Constraints [19.401468196146336]
Deep graph generative models treat molecule design as graph generation problems.
Existing models have many shortcomings, including poor interpretability and controllability toward desired molecular properties.
This paper proposes new methodologies for molecule generation with interpretable and deep controllable models.
arXiv Detail & Related papers (2022-02-28T08:35:56Z) - Property-aware Adaptive Relation Networks for Molecular Property
Prediction [34.13439007658925]
We propose a property-aware adaptive relation networks (PAR) for the few-shot molecular property prediction problem.
Our PAR is compatible with existing graph-based molecular encoders, and are further equipped with the ability to obtain property-aware molecular embedding and model molecular relation graph.
arXiv Detail & Related papers (2021-07-16T16:22:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.