Machine Learning Coarse-Grained Potentials of Protein Thermodynamics
- URL: http://arxiv.org/abs/2212.07492v1
- Date: Wed, 14 Dec 2022 20:23:11 GMT
- Title: Machine Learning Coarse-Grained Potentials of Protein Thermodynamics
- Authors: Maciej Majewski, Adri\`a P\'erez, Philipp Th\"olke, Stefan Doerr,
Nicholas E. Charron, Toni Giorgino, Brooke E. Husic, Cecilia Clementi, Frank
No\'e and Gianni De Fabritiis
- Abstract summary: We build coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics.
For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins.
The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems.
- Score: 1.3543803103181613
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A generalized understanding of protein dynamics is an unsolved scientific
problem, the solution of which is critical to the interpretation of the
structure-function relationships that govern essential biological processes.
Here, we approach this problem by constructing coarse-grained molecular
potentials based on artificial neural networks and grounded in statistical
mechanics. For training, we build a unique dataset of unbiased all-atom
molecular dynamics simulations of approximately 9 ms for twelve different
proteins with multiple secondary structure arrangements. The coarse-grained
models are capable of accelerating the dynamics by more than three orders of
magnitude while preserving the thermodynamics of the systems. Coarse-grained
simulations identify relevant structural states in the ensemble with comparable
energetics to the all-atom systems. Furthermore, we show that a single
coarse-grained potential can integrate all twelve proteins and can capture
experimental structural features of mutated proteins. These results indicate
that machine learning coarse-grained potentials could provide a feasible
approach to simulate and understand protein dynamics.
Related papers
- A DNN Biophysics Model with Topological and Electrostatic Features [0.0]
The model uses multi-scale and uniform topological and electrostatic features generated with protein structural information and force field.
The machine learning simulation on over 4000 protein structures shows the efficiency and fidelity of these features.
This model shows its potential as a general tool in assisting biophysical properties and function prediction for the broad biomolecules.
arXiv Detail & Related papers (2024-09-05T16:11:40Z) - Protein Conformation Generation via Force-Guided SE(3) Diffusion Models [48.48934625235448]
Deep generative modeling techniques have been employed to generate novel protein conformations.
We propose a force-guided SE(3) diffusion model, ConfDiff, for protein conformation generation.
arXiv Detail & Related papers (2024-03-21T02:44:08Z) - Top-down machine learning of coarse-grained protein force-fields [2.1485350418225244]
Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential.
Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data.
By applying Markov State Models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations.
arXiv Detail & Related papers (2023-06-20T08:31:24Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - A Latent Diffusion Model for Protein Structure Generation [50.74232632854264]
We propose a latent diffusion model that can reduce the complexity of protein modeling.
We show that our method can effectively generate novel protein backbone structures with high designability and efficiency.
arXiv Detail & Related papers (2023-05-06T19:10:19Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Protein Structure and Sequence Generation with Equivariant Denoising
Diffusion Probabilistic Models [3.5450828190071646]
An important task in bioengineering is designing proteins with specific 3D structures and chemical properties which enable targeted functions.
We introduce a generative model of both protein structure and sequence that can operate at significantly larger scales than previous molecular generative modeling approaches.
arXiv Detail & Related papers (2022-05-26T16:10:09Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Transferring Chemical and Energetic Knowledge Between Molecular Systems
with Machine Learning [5.27145343046974]
We propose a novel methodology for transferring knowledge obtained from simple molecular systems to a more complex one.
We focus on the classification of high and low free-energy states.
Our results show a remarkable AUC of 0.92 for transfer learning from tri-alanine to the deca-alanine system.
arXiv Detail & Related papers (2022-05-06T16:21:00Z) - BERTology Meets Biology: Interpreting Attention in Protein Language
Models [124.8966298974842]
We demonstrate methods for analyzing protein Transformer models through the lens of attention.
We show that attention captures the folding structure of proteins, connecting amino acids that are far apart in the underlying sequence, but spatially close in the three-dimensional structure.
We also present a three-dimensional visualization of the interaction between attention and protein structure.
arXiv Detail & Related papers (2020-06-26T21:50:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.