GraphGANFed: A Federated Generative Framework for Graph-Structured
Molecules Towards Efficient Drug Discovery
- URL: http://arxiv.org/abs/2304.05498v1
- Date: Tue, 11 Apr 2023 21:15:28 GMT
- Title: GraphGANFed: A Federated Generative Framework for Graph-Structured
Molecules Towards Efficient Drug Discovery
- Authors: Daniel Manu, Jingjing Yao, Wuji Liu, and Xiang Sun
- Abstract summary: We propose a Graph convolutional network in Generative Adversarial Networks via Federated learning (GraphGANFed) framework to generate novel molecules without sharing local data sets.
The molecules generated by GraphGANFed can achieve high novelty (=100) and diversity (> 0.9)
- Score: 2.309914459672556
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advances in deep learning have accelerated its use in various
applications, such as cellular image analysis and molecular discovery. In
molecular discovery, a generative adversarial network (GAN), which comprises a
discriminator to distinguish generated molecules from existing molecules and a
generator to generate new molecules, is one of the premier technologies due to
its ability to learn from a large molecular data set efficiently and generate
novel molecules that preserve similar properties. However, different
pharmaceutical companies may be unwilling or unable to share their local data
sets due to the geo-distributed and sensitive nature of molecular data sets,
making it impossible to train GANs in a centralized manner. In this paper, we
propose a Graph convolutional network in Generative Adversarial Networks via
Federated learning (GraphGANFed) framework, which integrates graph
convolutional neural Network (GCN), GAN, and federated learning (FL) as a whole
system to generate novel molecules without sharing local data sets. In
GraphGANFed, the discriminator is implemented as a GCN to better capture
features from molecules represented as molecular graphs, and FL is used to
train both the discriminator and generator in a distributive manner to preserve
data privacy. Extensive simulations are conducted based on the three bench-mark
data sets to demonstrate the feasibility and effectiveness of GraphGANFed. The
molecules generated by GraphGANFed can achieve high novelty (=100) and
diversity (> 0.9). The simulation results also indicate that 1) a lower
complexity discriminator model can better avoid mode collapse for a smaller
data set, 2) there is a tradeoff among different evaluation metrics, and 3)
having the right dropout ratio of the generator and discriminator can avoid
mode collapse.
Related papers
- Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule
Representations [55.42602325017405]
We propose a novel method called GODE, which takes into account the two-level structure of individual molecules.
By pre-training two graph neural networks (GNNs) on different graph structures, combined with contrastive learning, GODE fuses molecular structures with their corresponding knowledge graph substructures.
When fine-tuned across 11 chemical property tasks, our model outperforms existing benchmarks, registering an average ROC-AUC uplift of 13.8% for classification tasks and an average RMSE/MAE enhancement of 35.1% for regression tasks.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - Conditional Diffusion Based on Discrete Graph Structures for Molecular
Graph Generation [32.66694406638287]
We propose a Conditional Diffusion model based on discrete Graph Structures (CDGS) for molecular graph generation.
Specifically, we construct a forward graph diffusion process on both graph structures and inherent features through differential equations (SDE)
We present a specialized hybrid graph noise prediction model that extracts the global context and the local node-edge dependency from intermediate graph states.
arXiv Detail & Related papers (2023-01-01T15:24:15Z) - HiGNN: Hierarchical Informative Graph Neural Networks for Molecular
Property Prediction Equipped with Feature-Wise Attention [5.735627221409312]
We propose a well-designed hierarchical informative graph neural networks framework (termed HiGNN) for predicting molecular property.
Experiments demonstrate that HiGNN achieves state-of-the-art predictive performance on many challenging drug discovery-associated benchmark datasets.
arXiv Detail & Related papers (2022-08-30T05:16:15Z) - Molecular Graph Generation via Geometric Scattering [7.796917261490019]
Graph neural networks (GNNs) have been used extensively for addressing problems in drug design and discovery.
We propose a representation-first approach to molecular graph generation.
We show that our architecture learns meaningful representations of drug datasets and provides a platform for goal-directed drug synthesis.
arXiv Detail & Related papers (2021-10-12T18:00:23Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z) - ASGN: An Active Semi-supervised Graph Neural Network for Molecular
Property Prediction [61.33144688400446]
We propose a novel framework called Active Semi-supervised Graph Neural Network (ASGN) by incorporating both labeled and unlabeled molecules.
In the teacher model, we propose a novel semi-supervised learning method to learn general representation that jointly exploits information from molecular structure and molecular distribution.
At last, we proposed a novel active learning strategy in terms of molecular diversities to select informative data during the whole framework learning.
arXiv Detail & Related papers (2020-07-07T04:22:39Z) - Self-Supervised Graph Transformer on Large-Scale Molecular Data [73.3448373618865]
We propose a novel framework, GROVER, for molecular representation learning.
GROVER can learn rich structural and semantic information of molecules from enormous unlabelled molecular data.
We pre-train GROVER with 100 million parameters on 10 million unlabelled molecules -- the biggest GNN and the largest training dataset in molecular representation learning.
arXiv Detail & Related papers (2020-06-18T08:37:04Z) - Uncovering the Folding Landscape of RNA Secondary Structure with Deep
Graph Embeddings [71.20283285671461]
We propose a geometric scattering autoencoder (GSAE) network for learning such graph embeddings.
Our embedding network first extracts rich graph features using the recently proposed geometric scattering transform.
We show that GSAE organizes RNA graphs both by structure and energy, accurately reflecting bistable RNA structures.
arXiv Detail & Related papers (2020-06-12T00:17:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.