Accurate and Efficient Structural Ensemble Generation of Macrocyclic Peptides using Internal Coordinate Diffusion
- URL: http://arxiv.org/abs/2305.19800v2
- Date: Tue, 13 Aug 2024 18:19:21 GMT
- Title: Accurate and Efficient Structural Ensemble Generation of Macrocyclic Peptides using Internal Coordinate Diffusion
- Authors: Colin A. Grambow, Hayley Weir, Nathaniel L. Diamant, Gabriele Scalia, Tommaso Biancalani, Kangway V. Chuang,
- Abstract summary: RINGER is a diffusion-based transformer model that generates 3D conformational ensembles of macrocyclic peptides from their 2D representations.
We show how RINGER generates both high-quality and diverse geometries at a fraction of the computational cost.
- Score: 0.5475672579692472
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Macrocyclic peptides are an emerging therapeutic modality, yet computational approaches for accurately sampling their diverse 3D ensembles remain challenging due to their conformational diversity and geometric constraints. Here, we introduce RINGER, a diffusion-based transformer model using a redundant internal coordinate representation that generates three-dimensional conformational ensembles of macrocyclic peptides from their 2D representations. RINGER provides fast backbone and side-chain sampling while respecting key structural invariances of cyclic peptides. Through extensive benchmarking and analysis against gold-standard conformer ensembles of cyclic peptides generated with metadynamics, we demonstrate how RINGER generates both high-quality and diverse geometries at a fraction of the computational cost. Our work lays the foundation for improved sampling of cyclic geometries and the development of geometric learning methods for peptides.
Related papers
- Zero-Shot Cyclic Peptide Design via Composable Geometric Constraints [65.77915791312634]
We propose CP-Composer, a novel generative framework that enables zero-shot cyclic peptide generation.<n>Our approach decomposes complex cyclization patterns into unit constraints, which are incorporated into a diffusion model.<n>Our model, despite trained with linear peptides, is capable of generating diverse target-binding cyclic peptides, reaching success rates from 38% to 84%.
arXiv Detail & Related papers (2025-07-06T03:30:45Z) - Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling [58.384011300570585]
We introduce CpSDE, which consists of two key components: AtomSDE, a generative structure prediction model based on harmonic SDE, and Res, a residue type predictor.<n>CpSDE overcomes existing data limitations and is proficient in designing a wide variety of cyclic peptides.<n>Our experimental results demonstrate that the cyclic peptides designed by our method exhibit reliable stability and affinity.
arXiv Detail & Related papers (2025-05-27T17:24:12Z) - CLR-Wire: Towards Continuous Latent Representations for 3D Curve Wireframe Generation [11.447223770747051]
CLR ContinuousWire encodes curves as Parametric Curves along with their Parametric Curves into a continuous and fixed latent space.
This unified approach generates both geometry and topology.
arXiv Detail & Related papers (2025-04-27T09:32:42Z) - EquiCPI: SE(3)-Equivariant Geometric Deep Learning for Structure-Aware Prediction of Compound-Protein Interactions [0.0]
EquiCPI is an end-to-end geometric deep learning framework that synergizes first-principles structural modeling with SE(3)-equivariant neural networks.
At its core, EquiCPI employs SE(3)-equivariant message passing over atomic point clouds, preserving symmetry under rotations, translations, and reflections.
The proposed model is evaluated on BindingDB (affinity prediction) and DUD-E (virtual screening), EquiCPI performance on par with or exceeding the state-of-the-art deep learning competitors.
arXiv Detail & Related papers (2025-04-07T00:57:08Z) - Structure Language Models for Protein Conformation Generation [66.42864253026053]
Traditional physics-based simulation methods often struggle with sampling equilibrium conformations.
Deep generative models have shown promise in generating protein conformations as a more efficient alternative.
We introduce Structure Language Modeling as a novel framework for efficient protein conformation generation.
arXiv Detail & Related papers (2024-10-24T03:38:51Z) - DPLM-2: A Multimodal Diffusion Protein Language Model [75.98083311705182]
We introduce DPLM-2, a multimodal protein foundation model that extends discrete diffusion protein language model (DPLM) to accommodate both sequences and structures.
DPLM-2 learns the joint distribution of sequence and structure, as well as their marginals and conditionals.
Empirical evaluation shows that DPLM-2 can simultaneously generate highly compatible amino acid sequences and their corresponding 3D structures.
arXiv Detail & Related papers (2024-10-17T17:20:24Z) - Renormalized Internally-Contracted Multireference Coupled Cluster with Perturbative Triples [0.0]
We combine the many-body formulation of the internally contracted multireference coupled cluster (ic-MRCC) method with Evangelista's multireference formulation of the driven similarity renormalization group (DSRG)
We denote the new approach, the renormalized ic-MRCC (ric-MRCC) method.
We demonstrate the accuracy of our approaches in comparison to advanced multireference methods for the potential energy curves of H8, F2, H2O, N2, and Cr2.
arXiv Detail & Related papers (2024-05-25T09:15:41Z) - Denoising diffusion-based synthetic generation of three-dimensional (3D)
anisotropic microstructures from two-dimensional (2D) micrographs [0.0]
We present a framework for reconstruction of anisotropic microstructures based on conditional diffusion-based generative models (DGMs)
The proposed framework involves spatial connection of multiple 2D conditional DGMs, each trained to generate 2D microstructure samples for three different planes.
The results demonstrate that the framework is capable of reproducing not only the statistical distribution of material phases but also the material properties in 3D space.
arXiv Detail & Related papers (2023-12-13T01:36:37Z) - CREMP: Conformer-rotamer ensembles of macrocyclic peptides for machine learning [0.1747623282473278]
We introduce CREMP, a resource for the rapid development and evaluation of machine learning models for macrocyclic peptides.
CREMP contains 36,198 unique macrocyclic peptides and their high-quality structural ensembles generated using the Conformer-Rotamer Ensemble Sampling Tool (CREST)
Altogether, this new dataset contains nearly 31.3 million unique macrocycle geometries, each annotated with energies derived from semi-empirical extended tight-binding (xTB) DFT calculations.
arXiv Detail & Related papers (2023-05-14T03:50:46Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes [62.45415756883437]
Jointly matching multiple, non-rigidly deformed 3D shapes is a challenging, $mathcalNP$-hard problem.
Existing quantum shape-matching methods do not support multiple shapes and even less cycle consistency.
This paper introduces the first quantum-hybrid approach for 3D shape multi-matching.
arXiv Detail & Related papers (2023-03-28T17:59:55Z) - Entropic trust region for densest crystallographic symmetry group
packings [0.8399688944263843]
Molecular crystal structure prediction seeks the most stable periodic structure given the chemical composition of a molecule and pressure-temperature conditions.
Modern CSP solvers use global optimization methods to search for structures with minimal free energy within a complex energy landscape induced by intermolecular potentials.
We propose a class of periodic packings restricted to crystallographic symmetry groups (CSG) and design a search method for the densest CSG packings in an information-geometric framework.
arXiv Detail & Related papers (2022-02-24T08:39:07Z) - Three-fold way of entanglement dynamics in monitored quantum circuits [68.8204255655161]
We investigate the measurement-induced entanglement transition in quantum circuits built upon Dyson's three circular ensembles.
We obtain insights into the interplay between the local entanglement generation by the gates and the entanglement reduction by the measurements.
arXiv Detail & Related papers (2022-01-28T17:21:15Z) - A deep learning driven pseudospectral PCE based FFT homogenization
algorithm for complex microstructures [68.8204255655161]
It is shown that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
It is shown, that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
arXiv Detail & Related papers (2021-10-26T07:02:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.