Targeted AMP generation through controlled diffusion with efficient embeddings
- URL: http://arxiv.org/abs/2504.17247v1
- Date: Thu, 24 Apr 2025 04:53:04 GMT
- Title: Targeted AMP generation through controlled diffusion with efficient embeddings
- Authors: Diogo Soares, Leon Hetzel, Paulina Szymczak, Fabian Theis, Stephan Günnemann, Ewa Szczurek,
- Abstract summary: Deep learning-based antimicrobial peptide (AMP) discovery faces critical challenges such as low experimental hit rates.<n>We introduce OmegAMP, a framework that leverages a diffusion-based generative model with efficient low-dimensional embeddings.<n>We demonstrate that OmegAMP achieves state-of-the-art performance across all stages of the AMP discovery pipeline.
- Score: 38.37635965727843
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning-based antimicrobial peptide (AMP) discovery faces critical challenges such as low experimental hit rates as well as the need for nuanced controllability and efficient modeling of peptide properties. To address these challenges, we introduce OmegAMP, a framework that leverages a diffusion-based generative model with efficient low-dimensional embeddings, precise controllability mechanisms, and novel classifiers with drastically reduced false positive rates for candidate filtering. OmegAMP enables the targeted generation of AMPs with specific physicochemical properties, activity profiles, and species-specific effectiveness. Moreover, it maximizes sample diversity while ensuring faithfulness to the underlying data distribution during generation. We demonstrate that OmegAMP achieves state-of-the-art performance across all stages of the AMP discovery pipeline, significantly advancing the potential of computational frameworks in combating antimicrobial resistance.
Related papers
- Adaptive teachers for amortized samplers [76.88721198565861]
We propose an adaptive training distribution (the teacher) to guide the training of the primary amortized sampler (the student)<n>We validate the effectiveness of this approach in a synthetic environment designed to present an exploration challenge.
arXiv Detail & Related papers (2024-10-02T11:33:13Z) - HMAMP: Hypervolume-Driven Multi-Objective Antimicrobial Peptides Design [11.891046340221735]
This paper introduces a paradigm shift by considering multiple attributes in Antimicrobial peptides (AMPs) design.
By synergizing reinforcement learning and a descent algorithm rooted in the hypervolume of AMP concept, HMAMP effectively expands exploration space and mitigates the issue of pattern collapse.
A detailed analysis of the helical structures and molecular dynamics simulations for ten potential candidate AMPs validates the superiority of HMAMP in the realm of multi-objective AMP design.
arXiv Detail & Related papers (2024-05-01T07:17:59Z) - AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides [4.826446796830595]
This study introduces a quantitative definition and benchmarking framework AMPCliff for the AC phenomenon in antimicrobial peptides (AMPs) composed by canonical amino acids.
AMPCliff quantifies the activities of AMPs by the MIC, and defines 0.9 as the minimum threshold for the normalized BLOSUM62 similarity score between a pair of aligned peptides with at least two-fold MIC changes.
Our analysis reveals that these models are capable of detecting AMP AC events and the pre-trained protein language model ESM2 demonstrates superior performance across the evaluations.
arXiv Detail & Related papers (2024-04-15T12:40:12Z) - Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation [53.27596811146316]
Diffusion models operate over a sequence of timesteps instead of instantaneous input-output relationships in previous contexts.
We present Diffusion-TracIn that incorporates this temporal dynamics and observe that samples' loss gradient norms are highly dependent on timestep.
We introduce Diffusion-ReTrac as a re-normalized adaptation that enables the retrieval of training samples more targeted to the test sample of interest.
arXiv Detail & Related papers (2024-01-17T07:58:18Z) - Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood [64.95663299945171]
Training energy-based models (EBMs) on high-dimensional data can be both challenging and time-consuming.
There exists a noticeable gap in sample quality between EBMs and other generative frameworks like GANs and diffusion models.
We propose cooperative diffusion recovery likelihood (CDRL), an effective approach to tractably learn and sample from a series of EBMs.
arXiv Detail & Related papers (2023-09-10T22:05:24Z) - Artificial intelligence-driven antimicrobial peptide discovery [0.0]
Antimicrobial peptides (AMPs) emerge as promising agents against antimicrobial resistance.
AMPs provide an alternative to conventional antibiotics.
Artificial intelligence (AI) revolutionized AMP discovery through both discrimination and generation approaches.
arXiv Detail & Related papers (2023-08-21T14:02:14Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Accelerating Antimicrobial Peptide Discovery with Latent Structure [33.288514128470425]
We propose a latent sequence-structure model for designing AMPs (LSSAMP)
LSSAMP exploits multi-scale vector quantization in the latent space to represent secondary structures.
Experimental results show that the peptides generated by LSSAMP have a high probability of antimicrobial activity.
arXiv Detail & Related papers (2022-11-28T06:43:32Z) - Graph-Based Active Machine Learning Method for Diverse and Novel
Antimicrobial Peptides Generation and Selection [57.131117785001194]
Large-scale screening of new AMP candidates is expensive, time-consuming, and now affordable in developing countries.
We propose a novel active machine learning-based framework that statistically minimizes the number of wet-lab experiments needed to design new AMPs.
arXiv Detail & Related papers (2022-09-18T14:30:48Z) - Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited
Data [125.7135706352493]
Generative adversarial networks (GANs) typically require ample data for training in order to synthesize high-fidelity images.
Recent studies have shown that training GANs with limited data remains formidable due to discriminator overfitting.
This paper introduces a novel strategy called Adaptive Pseudo Augmentation (APA) to encourage healthy competition between the generator and the discriminator.
arXiv Detail & Related papers (2021-11-12T18:13:45Z) - Accelerating Antimicrobial Discovery with Controllable Deep Generative
Models and Molecular Dynamics [109.70543391923344]
CLaSS (Controlled Latent attribute Space Sampling) is an efficient computational method for attribute-controlled generation of molecules.
We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations.
The proposed approach is demonstrated for designing non-toxic antimicrobial peptides (AMPs) with strong broad-spectrum potency.
arXiv Detail & Related papers (2020-05-22T15:57:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.