Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning
- URL: http://arxiv.org/abs/2012.01736v1
- Date: Thu, 3 Dec 2020 07:35:38 GMT
- Title: Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning
- Authors: Marcin J. Skwark, Nicol\'as L\'opez Carranza, Thomas Pierrot, Joe
Phillips, Slim Said, Alexandre Laterre, Amine Kerkeni, U\u{g}ur \c{S}ahin,
Karim Beguir
- Abstract summary: SARS-CoV-2 pandemic has created a global race for a cure.
One approach focuses on designing a novel variant of the human angiotensin-converting enzyme 2 (ACE2)
We formulate a novel protein design framework as a reinforcement learning problem.
- Score: 50.57291257437373
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The SARS-CoV-2 pandemic has created a global race for a cure. One approach
focuses on designing a novel variant of the human angiotensin-converting enzyme
2 (ACE2) that binds more tightly to the SARS-CoV-2 spike protein and diverts it
from human cells. Here we formulate a novel protein design framework as a
reinforcement learning problem. We generate new designs efficiently through the
combination of a fast, biologically-grounded reward function and sequential
action-space formulation. The use of Policy Gradients reduces the compute
budget needed to reach consistent, high-quality designs by at least an order of
magnitude compared to standard methods. Complexes designed by this method have
been validated by molecular dynamics simulations, confirming their increased
stability. This suggests that combining leading protein design methods with
modern deep reinforcement learning is a viable path for discovering a Covid-19
cure and may accelerate design of peptide-based therapeutics for other
diseases.
Related papers
- A Comprehensive Review of Emerging Approaches in Machine Learning for De Novo PROTAC Design [1.534667887016089]
Targeted protein degradation (TPD) aims to regulate the intracellular levels of proteins by harnessing the cell's innate degradation pathways.
Proteolysis-targeting chimeras (PROTACs) are at the heart of TPD strategies.
Traditional methodologies for designing such complex molecules have limitations.
arXiv Detail & Related papers (2024-06-24T14:42:27Z) - A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design [54.30457372514873]
We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
arXiv Detail & Related papers (2023-10-30T02:39:15Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Improving few-shot learning-based protein engineering with evolutionary
sampling [0.0]
We propose a few-shot learning approach to novel protein design that aims to accelerate the expensive wet lab testing cycle.
Our approach is composed of two parts: a semi-supervised transfer learning approach to generate a discrete fitness landscape for a desired protein function and a novel evolutionary Monte Carlo Chain sampling algorithm.
We demonstrate the performance of our approach by experimentally screening predicted high fitness gene activators, resulting in a dramatically improved hit rate compared to existing methods.
arXiv Detail & Related papers (2023-05-23T23:07:53Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - Structured Q-learning For Antibody Design [82.78798397798533]
One of the crucial steps involved in antibody design is to find an arrangement of amino acids in a protein sequence that improves its binding with a pathogen.
Combinatorial optimization of antibodies is difficult due to extremely large search spaces and non-linear objectives.
Applying traditional Reinforcement Learning to antibody design optimization results in poor performance.
We propose Q-learning, an extension of Q-learning that incorporates structural priors for optimization.
arXiv Detail & Related papers (2022-09-10T15:36:55Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Evening the Score: Targeting SARS-CoV-2 Protease Inhibition in Graph
Generative Models for Therapeutic Candidates [11.853524110656991]
We look at the therapeutic design of novel drug candidates targeting SARS-CoV-2 viral proteins.
We use an autoencoder that generates molecules with similar structures to a dataset of drugs with anti-SARS activity.
During generation, we explore optimization toward several design targets to balance druglikeness, synthetic accessability, and anti-SARS activity.
arXiv Detail & Related papers (2021-05-07T18:39:25Z) - High-Throughput Virtual Screening of Small Molecule Inhibitors for
SARS-CoV-2 Protein Targets with Deep Fusion Models [3.8075853084146023]
Over 500 million small molecules were screened against four protein structures from the novel coronavirus (SARS-CoV-2), which causes COVID-19.
Three enhancements to Deep Fusion were made in order to evaluate more than 5 billion docked poses on SARS-CoV-2 protein targets.
arXiv Detail & Related papers (2021-04-09T18:18:26Z) - PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with
conditional generative models [2.0750380105212116]
With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents.
We propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets.
arXiv Detail & Related papers (2020-05-27T11:30:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.