NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models
- URL: http://arxiv.org/abs/2412.10743v2
- Date: Wed, 18 Dec 2024 21:35:10 GMT
- Title: NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models
- Authors: Zhuoran Qiao, Feizhi Ding, Thomas Dresselhaus, Mia A. Rosenfeld, Xiaotian Han, Owen Howell, Aniketh Iyengar, Stephen Opalenski, Anders S. Christensen, Sai Krishna Sirumalla, Frederick R. Manby, Thomas F. Miller III, Matthew Welborn,
- Abstract summary: We present NeuralPLexer3, a flow-based generative model that achieves state-of-the-art prediction accuracy on key biomolecular interaction types.
Examined through newly developed benchmarking strategies, NeuralPLexer3 excels in vital areas that are crucial to structure-based drug design.
- Score: 6.75152379258166
- License:
- Abstract: Structure determination is essential to a mechanistic understanding of diseases and the development of novel therapeutics. Machine-learning-based structure prediction methods have made significant advancements by computationally predicting protein and bioassembly structures from sequences and molecular topology alone. Despite substantial progress in the field, challenges remain to deliver structure prediction models to real-world drug discovery. Here, we present NeuralPLexer3 -- a physics-inspired flow-based generative model that achieves state-of-the-art prediction accuracy on key biomolecular interaction types and improves training and sampling efficiency compared to its predecessors and alternative methodologies. Examined through newly developed benchmarking strategies, NeuralPLexer3 excels in vital areas that are crucial to structure-based drug design, such as physical validity and ligand-induced conformational changes.
Related papers
- GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters.
The model adheres to the central dogma of molecular biology, accurately generating protein-coding sequences.
It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of promoter sequences.
arXiv Detail & Related papers (2025-02-11T05:39:49Z) - No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology [5.925258390690544]
We argue that genuine "foundation models" of regulatory biology will remain out of reach unless guided by frameworks that integrate mechanistic insight with principled experimental design.
We present one such ground-up, semi-mechanistic framework that unifies perturbation-based experimental designs.
arXiv Detail & Related papers (2025-01-31T14:43:16Z) - PhyloGen: Language Model-Enhanced Phylogenetic Inference via Graph Structure Generation [50.80441546742053]
Phylogenetic trees elucidate evolutionary relationships among species.
Traditional Markov Chain Monte Carlo methods face slow convergence and computational burdens.
We propose PhyloGen, a novel method leveraging a pre-trained genomic language model.
arXiv Detail & Related papers (2024-12-25T08:33:05Z) - Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - RNA Secondary Structure Prediction Using Transformer-Based Deep Learning Models [13.781096813376145]
The Human Genome Project has led to an exponential increase in data related to the sequence, structure, and function of biomolecules.
This paper discusses the fundamental concepts of RNA, RNA secondary structure, and its prediction.
The application of machine learning technologies in predicting the structure of biological macromolecules is explored.
arXiv Detail & Related papers (2024-04-14T08:36:14Z) - Diffusion-Driven Generative Framework for Molecular Conformation
Prediction [0.66567375919026]
The rapid advancement of machine learning has revolutionized the precision of predictive modeling in this context.
This research introduces a cutting-edge generative framework named method.
Method views atoms as discrete entities and excels in guiding the reversal of diffusion.
arXiv Detail & Related papers (2023-12-22T11:49:39Z) - A Data-Driven Approach to Morphogenesis under Structural Instability [1.223779595809275]
We propose a data-driven approach to understand and predict morphological complexities.
A machine-learning framework is proposed based on the physical modeling of morphogenesis triggered by internal or external forcing.
arXiv Detail & Related papers (2023-08-23T00:51:43Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Transfer Learning for Protein Structure Classification at Low Resolution [124.5573289131546]
We show that it is possible to make accurate ($geq$80%) predictions of protein class and architecture from structures determined at low ($leq$3A) resolution.
We provide proof of concept for high-speed, low-cost protein structure classification at low resolution, and a basis for extension to prediction of function.
arXiv Detail & Related papers (2020-08-11T15:01:32Z) - Generating Tertiary Protein Structures via an Interpretative Variational
Autoencoder [16.554053012204182]
This paper proposes and evaluates an alternative approach to generating functionally-relevant three-dimensional structures of a protein.
A comprehensive evaluation of several deep architectures shows the promise of generative models in directly revealing the latent space for sampling novel tertiary structures.
arXiv Detail & Related papers (2020-04-08T17:40:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.