Navigating protein landscapes with a machine-learned transferable
coarse-grained model
- URL: http://arxiv.org/abs/2310.18278v1
- Date: Fri, 27 Oct 2023 17:10:23 GMT
- Title: Navigating protein landscapes with a machine-learned transferable
coarse-grained model
- Authors: Nicholas E. Charron, Felix Musil, Andrea Guljas, Yaoyi Chen, Klara
Bonneau, Aldo S. Pasos-Trejo, Jacopo Venturin, Daria Gusew, Iryna
Zaporozhets, Andreas Kr\"amer, Clark Templeton, Atharva Kelkar, Aleksander E.
P. Durumeric, Simon Olsson, Adri\`a P\'erez, Maciej Majewski, Brooke E.
Husic, Ankit Patel, Gianni De Fabritiis, Frank No\'e, Cecilia Clementi
- Abstract summary: coarse-grained (CG) model with similar prediction performance has been a long-standing challenge.
We develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences.
We demonstrate that the model successfully predicts folded structures, intermediates, metastable folded and unfolded basins, and the fluctuations of intrinsically disordered proteins.
- Score: 29.252004942896875
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The most popular and universally predictive protein simulation models employ
all-atom molecular dynamics (MD), but they come at extreme computational cost.
The development of a universal, computationally efficient coarse-grained (CG)
model with similar prediction performance has been a long-standing challenge.
By combining recent deep learning methods with a large and diverse training set
of all-atom protein simulations, we here develop a bottom-up CG force field
with chemical transferability, which can be used for extrapolative molecular
dynamics on new sequences not used during model parametrization. We demonstrate
that the model successfully predicts folded structures, intermediates,
metastable folded and unfolded basins, and the fluctuations of intrinsically
disordered proteins while it is several orders of magnitude faster than an
all-atom model. This showcases the feasibility of a universal and
computationally efficient machine-learned CG model for proteins.
Related papers
- Generative Modeling of Molecular Dynamics Trajectories [12.255021091552441]
We introduce generative modeling of molecular trajectories as a paradigm for learning flexible multi-task surrogate models of MD from data.
We show such generative models can be adapted to diverse tasks such as forward simulation, transition path sampling, and trajectory upsampling.
arXiv Detail & Related papers (2024-09-26T13:02:28Z) - A Microstructure-based Graph Neural Network for Accelerating Multiscale
Simulations [0.0]
We introduce an alternative surrogate modeling strategy that allows for keeping the multiscale nature of the problem.
We achieve this by predicting full-field microscopic strains using a graph neural network (GNN) while retaining the microscopic material model.
We demonstrate for several challenging scenarios that the surrogate can predict complex macroscopic stress-strain paths.
arXiv Detail & Related papers (2024-02-20T15:54:24Z) - A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics [73.35846234413611]
In drug discovery, molecular dynamics (MD) simulation provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites.
We propose NeuralMD, the first machine learning (ML) surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding dynamics.
We demonstrate the efficiency and effectiveness of NeuralMD, achieving over 1K$times$ speedup compared to standard numerical MD simulations.
arXiv Detail & Related papers (2024-01-26T09:35:17Z) - Top-down machine learning of coarse-grained protein force-fields [2.1485350418225244]
Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential.
Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data.
By applying Markov State Models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations.
arXiv Detail & Related papers (2023-06-20T08:31:24Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Statistically Optimal Force Aggregation for Coarse-Graining Molecular
Dynamics [55.41644538483948]
coarse-grained (CG) models have the potential to simulate large molecular complexes beyond what is possible with atomistic molecular dynamics.
A widely used methodology for learning CG force-fields maps forces from all-atom molecular dynamics to the CG representation and matches them with a CG force-field on average.
We show that there is flexibility in how to map all-atom forces to the CG representation, and that the most commonly used mapping methods are statistically inefficient and potentially even incorrect in the presence of constraints in the all-atom simulation.
arXiv Detail & Related papers (2023-02-14T14:35:39Z) - Two for One: Diffusion Models and Force Fields for Coarse-Grained
Molecular Dynamics [15.660348943139711]
We leverage connections between score-based generative models, force fields and molecular dynamics to learn a CG force field without requiring any force inputs during training.
While having a vastly simplified training setup compared to previous work, we demonstrate that our approach leads to improved performance across several small- to medium-sized protein simulations.
arXiv Detail & Related papers (2023-02-01T17:09:46Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z) - Embedded-physics machine learning for coarse-graining and collective
variable discovery without data [3.222802562733787]
We present a novel learning framework that consistently embeds underlying physics.
We propose a novel objective based on reverse Kullback-Leibler divergence that fully incorporates the available physics in the form of the atomistic force field.
We demonstrate the algorithmic advances in terms of predictive ability and the physical meaning of the revealed CVs for a bimodal potential energy function and the alanine dipeptide.
arXiv Detail & Related papers (2020-02-24T10:28:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.