OmegAMP: Targeted AMP Discovery through Biologically Informed Generation
- URL: http://arxiv.org/abs/2504.17247v2
- Date: Wed, 29 Oct 2025 11:30:12 GMT
- Title: OmegAMP: Targeted AMP Discovery through Biologically Informed Generation
- Authors: Diogo Soares, Leon Hetzel, Paulina Szymczak, Marcelo Der Torossian Torres, Johanna Sommer, Cesar de la Fuente-Nunez, Fabian Theis, Stephan Günnemann, Ewa Szczurek,
- Abstract summary: Deep learning-based antimicrobial peptide (AMP) discovery faces critical challenges such as limited controllability.<n>We introduce OmegAMP, a framework designed for reliable AMP generation with increased controllability.<n>Our in silico experiments demonstrate that OmegAMP delivers state-of-the-art performance across key stages of the AMP discovery pipeline.
- Score: 37.08970479806285
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning-based antimicrobial peptide (AMP) discovery faces critical challenges such as limited controllability, lack of representations that efficiently model antimicrobial properties, and low experimental hit rates. To address these challenges, we introduce OmegAMP, a framework designed for reliable AMP generation with increased controllability. Its diffusion-based generative model leverages a novel conditioning mechanism to achieve fine-grained control over desired physicochemical properties and to direct generation towards specific activity profiles, including species-specific effectiveness. This is further enhanced by a biologically informed encoding space that significantly improves overall generative performance. Complementing these generative capabilities, OmegAMP leverages a novel synthetic data augmentation strategy to train classifiers for AMP filtering, drastically reducing false positive rates and thereby increasing the likelihood of experimental success. Our in silico experiments demonstrate that OmegAMP delivers state-of-the-art performance across key stages of the AMP discovery pipeline, enabling us to achieve an unprecedented success rate in wet lab experiments. We tested 25 candidate peptides, 24 of them (96%) demonstrated antimicrobial activity, proving effective even against multi-drug resistant strains. Our findings underscore OmegAMP's potential to significantly advance computational frameworks in the fight against antimicrobial resistance.
Related papers
- MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design [2.624902795082451]
antimicrobial peptides (AMP) are being explored for their potent and promising ability to fight resistant pathogens.<n>Most AMP design models struggle to balance key goals like activity, toxicity, and novelty.<n>We introduce MAC-AMP, a closed-loop multi-agent collaboration system for multi-objective AMP design.
arXiv Detail & Related papers (2026-02-16T17:01:47Z) - A deep reinforcement learning platform for antibiotic discovery [101.30486136547285]
Antimicrobial resistance (AMR) is projected to cause up to 10 million deaths annually by 2050.<n>We present ApexAmphion, a deep-learning framework for de novo design of antibiotics.
arXiv Detail & Related papers (2025-09-16T18:21:42Z) - Iterative Distillation for Reward-Guided Fine-Tuning of Diffusion Models in Biomolecular Design [53.93023688824764]
We address the problem of fine-tuning diffusion models for reward-guided generation in biomolecular design.<n>We propose an iterative distillation-based fine-tuning framework that enables diffusion models to optimize for arbitrary reward functions.<n>Our off-policy formulation, combined with KL divergence minimization, enhances training stability and sample efficiency compared to existing RL-based methods.
arXiv Detail & Related papers (2025-07-01T05:55:28Z) - Progressive Tempering Sampler with Diffusion [50.06039228068602]
We propose a neural sampler that trains diffusion models sequentially across temperatures.<n>We also introduce a novel method to combine high-temperature diffusion models to generate approximate lower-temperature samples.<n>Our method significantly improves target evaluation efficiency, outperforming diffusion-based neural samplers.
arXiv Detail & Related papers (2025-06-05T16:46:04Z) - A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation [55.53426007439564]
Estimating individualized treatment effects from observational data is a central challenge in causal inference.<n>In inverse probability weighting (IPW) is a well-established solution to this problem, but its integration into modern deep learning frameworks remains limited.<n>We propose Importance-Weighted Diffusion Distillation (IWDD), a novel generative framework that combines the pretraining of diffusion models with importance-weighted score distillation.
arXiv Detail & Related papers (2025-05-16T17:00:52Z) - Adaptive teachers for amortized samplers [76.88721198565861]
We propose an adaptive training distribution (the teacher) to guide the training of the primary amortized sampler (the student)<n>We validate the effectiveness of this approach in a synthetic environment designed to present an exploration challenge.
arXiv Detail & Related papers (2024-10-02T11:33:13Z) - Regressor-free Molecule Generation to Support Drug Response Prediction [83.25894107956735]
Conditional generation based on the target IC50 score can obtain a more effective sampling space.
Regressor-free guidance combines a diffusion model's score estimation with a regression controller model's gradient based on number labels.
arXiv Detail & Related papers (2024-05-23T13:22:17Z) - HMAMP: Hypervolume-Driven Multi-Objective Antimicrobial Peptides Design [11.891046340221735]
This paper introduces a paradigm shift by considering multiple attributes in Antimicrobial peptides (AMPs) design.
By synergizing reinforcement learning and a descent algorithm rooted in the hypervolume of AMP concept, HMAMP effectively expands exploration space and mitigates the issue of pattern collapse.
A detailed analysis of the helical structures and molecular dynamics simulations for ten potential candidate AMPs validates the superiority of HMAMP in the realm of multi-objective AMP design.
arXiv Detail & Related papers (2024-05-01T07:17:59Z) - AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides [4.826446796830595]
This study introduces a quantitative definition and benchmarking framework AMPCliff for the AC phenomenon in antimicrobial peptides (AMPs) composed by canonical amino acids.
AMPCliff quantifies the activities of AMPs by the MIC, and defines 0.9 as the minimum threshold for the normalized BLOSUM62 similarity score between a pair of aligned peptides with at least two-fold MIC changes.
Our analysis reveals that these models are capable of detecting AMP AC events and the pre-trained protein language model ESM2 demonstrates superior performance across the evaluations.
arXiv Detail & Related papers (2024-04-15T12:40:12Z) - Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation [53.27596811146316]
Diffusion models operate over a sequence of timesteps instead of instantaneous input-output relationships in previous contexts.
We present Diffusion-TracIn that incorporates this temporal dynamics and observe that samples' loss gradient norms are highly dependent on timestep.
We introduce Diffusion-ReTrac as a re-normalized adaptation that enables the retrieval of training samples more targeted to the test sample of interest.
arXiv Detail & Related papers (2024-01-17T07:58:18Z) - Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood [64.95663299945171]
Training energy-based models (EBMs) on high-dimensional data can be both challenging and time-consuming.
There exists a noticeable gap in sample quality between EBMs and other generative frameworks like GANs and diffusion models.
We propose cooperative diffusion recovery likelihood (CDRL), an effective approach to tractably learn and sample from a series of EBMs.
arXiv Detail & Related papers (2023-09-10T22:05:24Z) - Artificial intelligence-driven antimicrobial peptide discovery [0.0]
Antimicrobial peptides (AMPs) emerge as promising agents against antimicrobial resistance.
AMPs provide an alternative to conventional antibiotics.
Artificial intelligence (AI) revolutionized AMP discovery through both discrimination and generation approaches.
arXiv Detail & Related papers (2023-08-21T14:02:14Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Accelerating Antimicrobial Peptide Discovery with Latent Structure [33.288514128470425]
We propose a latent sequence-structure model for designing AMPs (LSSAMP)
LSSAMP exploits multi-scale vector quantization in the latent space to represent secondary structures.
Experimental results show that the peptides generated by LSSAMP have a high probability of antimicrobial activity.
arXiv Detail & Related papers (2022-11-28T06:43:32Z) - Graph-Based Active Machine Learning Method for Diverse and Novel
Antimicrobial Peptides Generation and Selection [57.131117785001194]
Large-scale screening of new AMP candidates is expensive, time-consuming, and now affordable in developing countries.
We propose a novel active machine learning-based framework that statistically minimizes the number of wet-lab experiments needed to design new AMPs.
arXiv Detail & Related papers (2022-09-18T14:30:48Z) - Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited
Data [125.7135706352493]
Generative adversarial networks (GANs) typically require ample data for training in order to synthesize high-fidelity images.
Recent studies have shown that training GANs with limited data remains formidable due to discriminator overfitting.
This paper introduces a novel strategy called Adaptive Pseudo Augmentation (APA) to encourage healthy competition between the generator and the discriminator.
arXiv Detail & Related papers (2021-11-12T18:13:45Z) - Accelerating Antimicrobial Discovery with Controllable Deep Generative
Models and Molecular Dynamics [109.70543391923344]
CLaSS (Controlled Latent attribute Space Sampling) is an efficient computational method for attribute-controlled generation of molecules.
We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations.
The proposed approach is demonstrated for designing non-toxic antimicrobial peptides (AMPs) with strong broad-spectrum potency.
arXiv Detail & Related papers (2020-05-22T15:57:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.