Random Copolymer inverse design system orienting on Accurate discovering
of Antimicrobial peptide-mimetic copolymers
- URL: http://arxiv.org/abs/2212.00023v1
- Date: Wed, 30 Nov 2022 14:29:50 GMT
- Title: Random Copolymer inverse design system orienting on Accurate discovering
of Antimicrobial peptide-mimetic copolymers
- Authors: Tianyu Wu, Yang Tang
- Abstract summary: We develop a universal random copolymer inverse design system via multi-model copolymer representation learning, knowledge distillation and reinforcement learning.
By pre-training a scaffold-decorator generative model via knowledge distillation, copolymer space are greatly contracted to the near space of existing data for exploration.
Our reinforcement learning algorithm can be adaptive for customized generation on specific scaffolds and requirements on property or structures.
- Score: 9.416757363901295
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Antimicrobial resistance is one of the biggest health problem, especially in
the current period of COVID-19 pandemic. Due to the unique membrane-destruction
bactericidal mechanism, antimicrobial peptide-mimetic copolymers are paid more
attention and it is urgent to find more potential candidates with
broad-spectrum antibacterial efficacy and low toxicity. Artificial intelligence
has shown significant performance on small molecule or biotech drugs, however,
the higher-dimension of polymer space and the limited experimental data
restrict the application of existing methods on copolymer design. Herein, we
develop a universal random copolymer inverse design system via multi-model
copolymer representation learning, knowledge distillation and reinforcement
learning. Our system realize a high-precision antimicrobial activity prediction
with few-shot data by extracting various chemical information from multi-modal
copolymer representations. By pre-training a scaffold-decorator generative
model via knowledge distillation, copolymer space are greatly contracted to the
near space of existing data for exploration. Thus, our reinforcement learning
algorithm can be adaptive for customized generation on specific scaffolds and
requirements on property or structures. We apply our system on collected
antimicrobial peptide-mimetic copolymers data, and we discover candidate
copolymers with desired properties.
Related papers
- Active learning for affinity prediction of antibodies [45.58662352490961]
For large molecules such as antibodies, identifying mutations that enhance antibody affinity is challenging.
FERB methods can offer valuable insights into how different mutations will impact the potency and selectivity of a drug candidate.
We present an active learning framework that iteratively proposes sequences for simulators to evaluate, thereby accelerating the search for improved binders.
arXiv Detail & Related papers (2024-06-11T13:42:49Z) - Active Causal Learning for Decoding Chemical Complexities with Targeted Interventions [0.0]
We introduce an active learning approach that discerns underlying cause-effect relationships through strategic sampling.
This method identifies the smallest subset of the dataset capable of encoding the most information representative of a much larger chemical space.
The identified causal relations are then leveraged to conduct systematic interventions, optimizing the design task within a chemical space that the models have not encountered previously.
arXiv Detail & Related papers (2024-04-05T17:15:48Z) - Hybrid quantum cycle generative adversarial network for small molecule
generation [0.0]
This work introduces several new generative adversarial network models based on engineering integration of parametrized quantum circuits into known molecular generative adversarial networks.
The introduced machine learning models incorporate a new multi- parameter reward function grounded in reinforcement learning principles.
arXiv Detail & Related papers (2023-12-28T14:10:26Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Copolymer Informatics with Multi-Task Deep Neural Networks [0.0]
We address the property prediction challenge for copolymers, extending the polymer informatics framework beyond homopolymers.
A large data set containing over 18,000 data points of glass transition, melting, and degradation temperature of homopolymers and copolymers of up to two monomers is used.
The developed models are accurate, fast, flexible, and scalable to more copolymer properties when suitable data become available.
arXiv Detail & Related papers (2021-03-25T23:28:20Z) - Optimizing Molecules using Efficient Queries from Property Evaluations [66.66290256377376]
We propose QMO, a generic query-based molecule optimization framework.
QMO improves the desired properties of an input molecule based on efficient queries.
We show that QMO outperforms existing methods in the benchmark tasks of optimizing small organic molecules.
arXiv Detail & Related papers (2020-11-03T18:51:18Z) - Polymer Informatics: Current Status and Critical Next Steps [1.3238373064156097]
Surrogate models are trained on available polymer data for instant property prediction.
Data-driven strategies to tackle unique challenges resulting from the extraordinary chemical and physical diversity of polymers at small and large scales are being explored.
Methods to solve inverse problems, wherein polymer recommendations are made using advanced AI algorithms that meet application targets, are being investigated.
arXiv Detail & Related papers (2020-11-01T14:17:22Z) - Accelerating Antimicrobial Discovery with Controllable Deep Generative
Models and Molecular Dynamics [109.70543391923344]
CLaSS (Controlled Latent attribute Space Sampling) is an efficient computational method for attribute-controlled generation of molecules.
We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations.
The proposed approach is demonstrated for designing non-toxic antimicrobial peptides (AMPs) with strong broad-spectrum potency.
arXiv Detail & Related papers (2020-05-22T15:57:58Z) - CogMol: Target-Specific and Selective Drug Design for COVID-19 Using
Deep Generative Models [74.58583689523999]
We propose an end-to-end framework, named CogMol, for designing new drug-like small molecules targeting novel viral proteins.
CogMol combines adaptive pre-training of a molecular SMILES Variational Autoencoder (VAE) and an efficient multi-attribute controlled sampling scheme.
CogMol handles multi-constraint design of synthesizable, low-toxic, drug-like molecules with high target specificity and selectivity.
arXiv Detail & Related papers (2020-04-02T18:17:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.