Related papers: NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models

NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models

URL: http://arxiv.org/abs/2412.10743v2
Date: Wed, 18 Dec 2024 21:35:10 GMT
Title: NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models
Authors: Zhuoran Qiao, Feizhi Ding, Thomas Dresselhaus, Mia A. Rosenfeld, Xiaotian Han, Owen Howell, Aniketh Iyengar, Stephen Opalenski, Anders S. Christensen, Sai Krishna Sirumalla, Frederick R. Manby, Thomas F. Miller III, Matthew Welborn,
Abstract summary: We present NeuralPLexer3, a flow-based generative model that achieves state-of-the-art prediction accuracy on key biomolecular interaction types.<n> Examined through newly developed benchmarking strategies, NeuralPLexer3 excels in vital areas that are crucial to structure-based drug design.
Score: 6.75152379258166
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Structure determination is essential to a mechanistic understanding of diseases and the development of novel therapeutics. Machine-learning-based structure prediction methods have made significant advancements by computationally predicting protein and bioassembly structures from sequences and molecular topology alone. Despite substantial progress in the field, challenges remain to deliver structure prediction models to real-world drug discovery. Here, we present NeuralPLexer3 -- a physics-inspired flow-based generative model that achieves state-of-the-art prediction accuracy on key biomolecular interaction types and improves training and sampling efficiency compared to its predecessors and alternative methodologies. Examined through newly developed benchmarking strategies, NeuralPLexer3 excels in vital areas that are crucial to structure-based drug design, such as physical validity and ligand-induced conformational changes.

Related papers

DISPROTBENCH: A Disorder-Aware, Task-Rich Benchmark for Evaluating Protein Structure Prediction in Realistic Biological Contexts [76.59606029593085]
DisProtBench is a benchmark for evaluating protein structure prediction models (PSPMs) under structural disorder and complex biological conditions.<n>DisProtBench spans three key axes: data complexity, task diversity, and Interpretability.<n>Results reveal significant variability in model robustness under disorder, with low-confidence regions linked to functional prediction failures.
arXiv Detail & Related papers (2025-06-18T23:58:22Z)
Aligning Protein Conformation Ensemble Generation with Physical Feedback [29.730515284798397]
Energy-based Alignment (EBA) is a method that aligns generative models with feedback from physical models.<n>EBA achieves state-of-the-art performance in generating high-quality protein ensembles.
arXiv Detail & Related papers (2025-05-30T04:33:39Z)
A Symbolic and Statistical Learning Framework to Discover Bioprocessing Regulatory Mechanism: Cell Culture Example [2.325005809983534]
This paper introduces a symbolic and statistical learning framework to identify key regulatory mechanisms and model uncertainty.<n>A Metropolis-adjusted Langevin algorithm with adjoint sensitivity analysis is developed for posterior exploration.<n>An empirical study demonstrates its ability to recover missing regulatory mechanisms and improve model fidelity under datalimited conditions.
arXiv Detail & Related papers (2025-05-06T04:39:34Z)
PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation [4.402280157389038]
We propose PharMolixFM, a unified framework for constructing all-atom foundation models. Our framework includes three variants using state-of-the-art multi-modal generative models. PharMolixFM-Diff achieves competitive prediction accuracy in protein-small-molecule docking.
arXiv Detail & Related papers (2025-03-12T12:53:43Z)
UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion [61.690978792873196]
Existing approaches rely on either autoregressive sequence models or diffusion models. We propose UniGenX, a unified framework that combines autoregressive next-token prediction with conditional diffusion models. We validate the effectiveness of UniGenX on material and small molecule generation tasks.
arXiv Detail & Related papers (2025-03-09T16:43:07Z)
GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters. Trained on an expansive dataset comprising 386B bp of DNA, the GENERator demonstrates state-of-the-art performance across both established and newly proposed benchmarks. It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of enhancer sequences with specific activity profiles.
arXiv Detail & Related papers (2025-02-11T05:39:49Z)
No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology [5.925258390690544]
We argue that genuine "foundation models" of regulatory biology will remain out of reach unless guided by frameworks that integrate mechanistic insight with principled experimental design. We present one such ground-up, semi-mechanistic framework that unifies perturbation-based experimental designs.
arXiv Detail & Related papers (2025-01-31T14:43:16Z)
Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms. This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z)
RNA Secondary Structure Prediction Using Transformer-Based Deep Learning Models [13.781096813376145]
The Human Genome Project has led to an exponential increase in data related to the sequence, structure, and function of biomolecules. This paper discusses the fundamental concepts of RNA, RNA secondary structure, and its prediction. The application of machine learning technologies in predicting the structure of biological macromolecules is explored.
arXiv Detail & Related papers (2024-04-14T08:36:14Z)
A Data-Driven Approach to Morphogenesis under Structural Instability [1.223779595809275]
We propose a data-driven approach to understand and predict morphological complexities. A machine-learning framework is proposed based on the physical modeling of morphogenesis triggered by internal or external forcing.
arXiv Detail & Related papers (2023-08-23T00:51:43Z)
Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems. DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z)
Modeling Molecular Structures with Intrinsic Diffusion Models [2.487445341407889]
This thesis proposes Intrinsic Diffusion Modeling. It combines diffusion generative models with scientific knowledge about the flexibility of biological complexes. We demonstrate the effectiveness of this approach on two fundamental tasks at the basis of computational chemistry and biology.
arXiv Detail & Related papers (2023-02-23T03:26:48Z)
Bidirectional Generation of Structure and Properties Through a Single Molecular Foundation Model [44.60174246341653]
We present a novel multimodal molecular pre-trained model that incorporates the modalities of structure and biochemical properties. Our proposed model pipeline of data handling and training objectives aligns the structure/property features in a common embedding space. These contributions emerge synergistic knowledge, allowing us to tackle both multimodal and unimodal downstream tasks through a single model.
arXiv Detail & Related papers (2022-11-19T05:16:08Z)
State-specific protein-ligand complex structure prediction with a multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures. Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z)
Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation. We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z)
Transfer Learning for Protein Structure Classification at Low Resolution [124.5573289131546]
We show that it is possible to make accurate ($geq$80%) predictions of protein class and architecture from structures determined at low ($leq$3A) resolution. We provide proof of concept for high-speed, low-cost protein structure classification at low resolution, and a basis for extension to prediction of function.
arXiv Detail & Related papers (2020-08-11T15:01:32Z)
Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder [16.554053012204182]
This paper proposes and evaluates an alternative approach to generating functionally-relevant three-dimensional structures of a protein. A comprehensive evaluation of several deep architectures shows the promise of generative models in directly revealing the latent space for sampling novel tertiary structures.
arXiv Detail & Related papers (2020-04-08T17:40:21Z)

This list is automatically generated from the titles and abstracts of the papers in this site.