Peptide Vaccine Design by Evolutionary Multi-Objective Optimization
- URL: http://arxiv.org/abs/2406.05743v1
- Date: Sun, 9 Jun 2024 11:28:55 GMT
- Title: Peptide Vaccine Design by Evolutionary Multi-Objective Optimization
- Authors: Dan-Xuan Liu, Yi-Heng Xu, Chao Qian,
- Abstract summary: The main challenge of peptide vaccine design is selecting an effective subset of peptides due to the allelic diversity among individuals.
Previous works mainly formulated this task as a constrained optimization problem, aiming to maximize the expected number of peptide-Major Histocompatibility Complex (peptide-MHC) bindings.
We propose a new framework-EMO based on Multi-objective Optimization, which reformulates peptide Vaccine Design as a bi-objective optimization problem.
- Score: 34.83487850400559
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Peptide vaccines are growing in significance for fighting diverse diseases. Machine learning has improved the identification of peptides that can trigger immune responses, and the main challenge of peptide vaccine design now lies in selecting an effective subset of peptides due to the allelic diversity among individuals. Previous works mainly formulated this task as a constrained optimization problem, aiming to maximize the expected number of peptide-Major Histocompatibility Complex (peptide-MHC) bindings across a broad range of populations by selecting a subset of diverse peptides with limited size; and employed a greedy algorithm, whose performance, however, may be limited due to the greedy nature. In this paper, we propose a new framework PVD-EMO based on Evolutionary Multi-objective Optimization, which reformulates Peptide Vaccine Design as a bi-objective optimization problem that maximizes the expected number of peptide-MHC bindings and minimizes the number of selected peptides simultaneously, and employs a Multi-Objective Evolutionary Algorithm (MOEA) to solve it. We also incorporate warm-start and repair strategies into MOEAs to improve efficiency and performance. We prove that the warm-start strategy ensures that PVD-EMO maintains the same worst-case approximation guarantee as the previous greedy algorithm, and meanwhile, the EMO framework can help avoid local optima. Experiments on a peptide vaccine design for COVID-19, caused by the SARS-CoV-2 virus, demonstrate the superiority of PVD-EMO.
Related papers
- A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer [53.85265022754878]
We propose a lightweight DDG predictor (Light-DDG) for fast mutation screening.
We also release a large-scale dataset containing millions of mutation data for pre-training Light-DDG.
For the target antibody, we propose a novel Mutation Explainer to learn mutation preferences.
arXiv Detail & Related papers (2025-02-10T09:26:57Z) - PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion [2.6668932659159905]
We present PepTune, a multi-objective discrete diffusion model for the simultaneous generation and optimization of therapeutic peptide SMILES.
We generate diverse, chemically-modified peptides optimized for multiple therapeutic properties, including target binding affinity, membrane permeability, solubility, hemolysis, and non-fouling characteristics.
In total, our results demonstrate that PepTune is a powerful and modular approach for multi-objective sequence design in discrete state spaces.
arXiv Detail & Related papers (2024-12-23T18:38:49Z) - Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension [34.5830296205857]
PepHAR is a hot-spot-driven autoregressive generative model for designing peptides targeting specific proteins.
By combining hot spot sampling with fragment-based extension, our approach enables de novo peptide design tailored to a target protein.
arXiv Detail & Related papers (2024-11-26T15:13:17Z) - Active learning for affinity prediction of antibodies [45.58662352490961]
For large molecules such as antibodies, identifying mutations that enhance antibody affinity is challenging.
FERB methods can offer valuable insights into how different mutations will impact the potency and selectivity of a drug candidate.
We present an active learning framework that iteratively proposes sequences for simulators to evaluate, thereby accelerating the search for improved binders.
arXiv Detail & Related papers (2024-06-11T13:42:49Z) - MoFormer: Multi-objective Antimicrobial Peptide Generation Based on Conditional Transformer Joint Multi-modal Fusion Descriptor [15.98003148948758]
We establish a multi-objective AMP synthesis pipeline (MoFormer) for the simultaneous optimization of multi-attributes of AMPs.
MoFormer improves the desired attributes of AMP sequences in a highly structured latent space, guided by conditional constraints and fine-grained multi-descriptor.
We show that MoFormer outperforms existing methods in the generation task of enhanced antimicrobial activity and minimal hemolysis.
arXiv Detail & Related papers (2024-06-03T07:17:18Z) - LightCPPgen: An Explainable Machine Learning Pipeline for Rational Design of Cell Penetrating Peptides [0.32985979395737786]
We introduce an innovative approach for the de novo design of CPPs, leveraging the strengths of machine learning (ML) and optimization algorithms.
Our strategy, named Light CPPgen, integrates a LightGBM-based predictive model with a genetic algorithm (GA)
The GA solutions specifically target the candidate sequences' penetrability score, while trying to maximize similarity with the original non-penetrating peptide.
arXiv Detail & Related papers (2024-05-31T10:57:25Z) - Efficient Prediction of Peptide Self-assembly through Sequential and
Graphical Encoding [57.89530563948755]
This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
arXiv Detail & Related papers (2023-07-17T00:43:33Z) - Designing Biological Sequences via Meta-Reinforcement Learning and
Bayesian Optimization [68.28697120944116]
We train an autoregressive generative model via Meta-Reinforcement Learning to propose promising sequences for selection.
We pose this problem as that of finding an optimal policy over a distribution of MDPs induced by sampling subsets of the data.
Our in-silico experiments show that meta-learning over such ensembles provides robustness against reward misspecification and achieves competitive results.
arXiv Detail & Related papers (2022-09-13T18:37:27Z) - Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning [50.57291257437373]
SARS-CoV-2 pandemic has created a global race for a cure.
One approach focuses on designing a novel variant of the human angiotensin-converting enzyme 2 (ACE2)
We formulate a novel protein design framework as a reinforcement learning problem.
arXiv Detail & Related papers (2020-12-03T07:35:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.