De novo PROTAC design using graph-based deep generative models
- URL: http://arxiv.org/abs/2211.02660v1
- Date: Fri, 4 Nov 2022 15:34:45 GMT
- Title: De novo PROTAC design using graph-based deep generative models
- Authors: Divya Nori, Connor W. Coley, Roc\'io Mercado
- Abstract summary: We show that a graph-based generative model can be used to propose PROTAC-like structures from empty graphs.
We steer the generative model towards compounds with higher likelihoods of predicted degradation activity.
After fine-tuning, predicted activity against a challenging POI increases from 50% to >80% with near-perfect chemical validity.
- Score: 2.566673015346446
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: PROteolysis TArgeting Chimeras (PROTACs) are an emerging therapeutic modality
for degrading a protein of interest (POI) by marking it for degradation by the
proteasome. Recent developments in artificial intelligence (AI) suggest that
deep generative models can assist with the de novo design of molecules with
desired properties, and their application to PROTAC design remains largely
unexplored. We show that a graph-based generative model can be used to propose
novel PROTAC-like structures from empty graphs. Our model can be guided towards
the generation of large molecules (30--140 heavy atoms) predicted to degrade a
POI through policy-gradient reinforcement learning (RL). Rewards during RL are
applied using a boosted tree surrogate model that predicts a molecule's
degradation potential for each POI. Using this approach, we steer the
generative model towards compounds with higher likelihoods of predicted
degradation activity. Despite being trained on sparse public data, the
generative model proposes molecules with substructures found in known
degraders. After fine-tuning, predicted activity against a challenging POI
increases from 50% to >80% with near-perfect chemical validity for sampled
compounds, suggesting this is a promising approach for the optimization of
large, PROTAC-like molecules for targeted protein degradation.
Related papers
- Cell Morphology-Guided Small Molecule Generation with GFlowNets [41.8027680592766]
We propose an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets.
The proposed model learns to generate new molecules that could produce phenotypic effects similar to those of the given image target.
We demonstrate that the proposed method generates molecules with high morphological and structural similarity to the target, increasing the likelihood of similar biological activity.
arXiv Detail & Related papers (2024-08-09T17:40:35Z) - Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
AliDiff is a novel framework to align pretrained target diffusion models with preferred functional properties.
It can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment [51.49238426241974]
This paper introduces UAlign, a template-free graph-to-sequence pipeline for retrosynthesis prediction.
By combining graph neural networks and Transformers, our method can more effectively leverage the inherent graph structure of molecules.
arXiv Detail & Related papers (2024-03-25T03:23:03Z) - Molecule Design by Latent Prompt Transformer [76.2112075557233]
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
arXiv Detail & Related papers (2024-02-27T03:33:23Z) - FREED++: Improving RL Agents for Fragment-Based Molecule Generation by
Thorough Reproduction [33.57089414199478]
Reinforcement Learning (RL) has emerged as a promising approach to generating molecules with the docking score (DS) as a reward.
We reproduce, scrutinize and improve the recent model for molecule generation called FREED (arXiv:2110.01219)
arXiv Detail & Related papers (2024-01-18T09:54:19Z) - Distributed Reinforcement Learning for Molecular Design: Antioxidant
case [0.20971479389679337]
DA-MolDQN is a distributed reinforcement learning algorithm for antioxidants.
It is 100x faster than previous algorithms and can discover new optimized molecules from proprietary and public antioxidants.
arXiv Detail & Related papers (2023-12-03T03:23:13Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - CELLS: Cost-Effective Evolution in Latent Space for Goal-Directed
Molecular Generation [23.618366377098614]
We propose a cost-effective evolution strategy in latent space, which optimize the molecular latent representation vectors.
We adopt a pre-trained molecular generative model to map the latent and observation spaces.
We conduct extensive experiments on multiple optimization tasks comparing the proposed framework to several advanced techniques.
arXiv Detail & Related papers (2021-11-30T11:02:18Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - The Synthesizability of Molecules Proposed by Generative Models [3.032184156362992]
Discovery of functional molecules is an expensive and time-consuming process.
One class of techniques of growing interest for early-stage drug discovery is de novo molecular generation and optimization.
These techniques can suggest novel molecular structures intended to maximize a multi-objective function.
However, the utility of these approaches is stymied by ignorance of synthesizability.
arXiv Detail & Related papers (2020-02-17T15:41:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.