Dual-Modality Representation Learning for Molecular Property Prediction
- URL: http://arxiv.org/abs/2501.06608v1
- Date: Sat, 11 Jan 2025 18:15:37 GMT
- Title: Dual-Modality Representation Learning for Molecular Property Prediction
- Authors: Anyin Zhao, Zuquan Chen, Zhengyu Fang, Xiaoge Zhang, Jing Li,
- Abstract summary: Accurate prediction of drug properties relies heavily on effective molecular representations.
Recent advances in learning drug properties commonly employ Graph Neural Networks (GNNs) based on the graph representation.
We propose a method named Dual-Modality Cross-Attention (DMCA) that can effectively combine the strengths of two representations.
- Score: 3.0953718537420545
- License:
- Abstract: Molecular property prediction has attracted substantial attention recently. Accurate prediction of drug properties relies heavily on effective molecular representations. The structures of chemical compounds are commonly represented as graphs or SMILES sequences. Recent advances in learning drug properties commonly employ Graph Neural Networks (GNNs) based on the graph representation. For the SMILES representation, Transformer-based architectures have been adopted by treating each SMILES string as a sequence of tokens. Because each representation has its own advantages and disadvantages, combining both representations in learning drug properties is a promising direction. We propose a method named Dual-Modality Cross-Attention (DMCA) that can effectively combine the strengths of two representations by employing the cross-attention mechanism. DMCA was evaluated across eight datasets including both classification and regression tasks. Results show that our method achieves the best overall performance, highlighting its effectiveness in leveraging the complementary information from both graph and SMILES modalities.
Related papers
- UniMAP: Universal SMILES-Graph Representation Learning [21.25038529787392]
We propose a universal SMILE-graph representation learning model, namely UniMAP.
Four kinds of pre-training tasks are designed for UniMAP, including Multi-Level Cross-Modality Masking (CMM), SMILES-Graph Matching (SGM), Fragment-Level Alignment (FLA), and Domain Knowledge Learning (DKL)
Experimental results show that UniMAP outperforms current state-of-the-art pre-training methods.
arXiv Detail & Related papers (2023-10-22T07:48:33Z) - GraphCL-DTA: a graph contrastive learning with molecular semantics for
drug-target binding affinity prediction [2.523552067304274]
GraphCL-DTA is a graph contrastive learning framework for molecular graphs to learn drug representations.
Next, we design a new loss function that can be directly used to adjust the uniformity of drug and target representations.
The effectiveness of the above innovative elements is verified on two real datasets.
arXiv Detail & Related papers (2023-07-18T06:01:37Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations [68.32093648671496]
We introduce GODE, which accounts for the dual-level structure inherent in molecules.
Molecules possess an intrinsic graph structure and simultaneously function as nodes within a broader molecular knowledge graph.
By pre-training two GNNs on different graph structures, GODE effectively fuses molecular structures with their corresponding knowledge graph substructures.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - Molecular Joint Representation Learning via Multi-modal Information [11.493011069441188]
We propose a novel framework of molecular joint representation learning via Multi-Modal information of SMILES and molecular Graphs, called MMSG.
We improve the self-attention mechanism by introducing bond level graph representation as attention bias in Transformer.
We further propose a Bidirectional Message Communication Graph Neural Network (BMC GNN) to strengthen the information flow aggregated from graphs for further combination.
arXiv Detail & Related papers (2022-11-25T11:53:23Z) - Graph neural networks for the prediction of molecular structure-property
relationships [59.11160990637615]
Graph neural networks (GNNs) are a novel machine learning method that directly work on the molecular graph.
GNNs allow to learn properties in an end-to-end fashion, thereby avoiding the need for informative descriptors.
We describe the fundamentals of GNNs and demonstrate the application of GNNs via two examples for molecular property prediction.
arXiv Detail & Related papers (2022-07-25T11:30:44Z) - Chemical-Reaction-Aware Molecule Representation Learning [88.79052749877334]
We propose using chemical reactions to assist learning molecule representation.
Our approach is proven effective to 1) keep the embedding space well-organized and 2) improve the generalization ability of molecule embeddings.
Experimental results demonstrate that our method achieves state-of-the-art performance in a variety of downstream tasks.
arXiv Detail & Related papers (2021-09-21T00:08:43Z) - Learning Attributed Graph Representations with Communicative Message
Passing Transformer [3.812358821429274]
We propose a Communicative Message Passing Transformer (CoMPT) neural network to improve the molecular graph representation.
Unlike the previous transformer-style GNNs that treat molecules as fully connected graphs, we introduce a message diffusion mechanism to leverage the graph connectivity inductive bias.
arXiv Detail & Related papers (2021-07-19T11:58:32Z) - Dual-view Molecule Pre-training [186.07333992384287]
Dual-view molecule pre-training can effectively combine the strengths of both types of molecule representations.
DMP is tested on nine molecular property prediction tasks and achieves state-of-the-art performances on seven of them.
arXiv Detail & Related papers (2021-06-17T03:58:38Z) - MoCL: Contrastive Learning on Molecular Graphs with Multi-level Domain
Knowledge [28.386302970315736]
We propose a novel framework called MoCL, which utilizes domain knowledge at both local- and global-level to assist representation learning.
We evaluate MoCL on various molecular datasets under both linear and semi-supervised settings.
arXiv Detail & Related papers (2021-06-05T18:00:51Z) - Multi-View Graph Neural Networks for Molecular Property Prediction [67.54644592806876]
We present Multi-View Graph Neural Network (MV-GNN), a multi-view message passing architecture.
In MV-GNN, we introduce a shared self-attentive readout component and disagreement loss to stabilize the training process.
We further boost the expressive power of MV-GNN by proposing a cross-dependent message passing scheme.
arXiv Detail & Related papers (2020-05-17T04:46:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.