Top-down machine learning of coarse-grained protein force-fields
- URL: http://arxiv.org/abs/2306.11375v3
- Date: Tue, 27 Jun 2023 12:02:21 GMT
- Title: Top-down machine learning of coarse-grained protein force-fields
- Authors: Carles Navarro, Maciej Majewski and Gianni de Fabritiis
- Abstract summary: Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential.
Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data.
By applying Markov State Models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations.
- Score: 2.1485350418225244
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Developing accurate and efficient coarse-grained representations of proteins
is crucial for understanding their folding, function, and interactions over
extended timescales. Our methodology involves simulating proteins with
molecular dynamics and utilizing the resulting trajectories to train a neural
network potential through differentiable trajectory reweighting. Remarkably,
this method requires only the native conformation of proteins, eliminating the
need for labeled data derived from extensive simulations or memory-intensive
end-to-end differentiable simulations. Once trained, the model can be employed
to run parallel molecular dynamics simulations and sample folding events for
proteins both within and beyond the training distribution, showcasing its
extrapolation capabilities. By applying Markov State Models, native-like
conformations of the simulated proteins can be predicted from the
coarse-grained simulations. Owing to its theoretical transferability and
ability to use solely experimental static structures as training data, we
anticipate that this approach will prove advantageous for developing new
protein force fields and further advancing the study of protein dynamics,
folding, and interactions.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - EquiJump: Protein Dynamics Simulation via SO(3)-Equivariant Stochastic Interpolants [13.493198442811865]
We introduce EquiJump, a transferable SO(3)-equivariant model that bridges all-atom protein dynamics simulation time steps directly.
Our approach achieves diverse sampling methods and is benchmarked against existing models on trajectory data of fast folding proteins.
arXiv Detail & Related papers (2024-10-12T23:22:49Z) - Protein Conformation Generation via Force-Guided SE(3) Diffusion Models [48.48934625235448]
Deep generative modeling techniques have been employed to generate novel protein conformations.
We propose a force-guided SE(3) diffusion model, ConfDiff, for protein conformation generation.
arXiv Detail & Related papers (2024-03-21T02:44:08Z) - Navigating protein landscapes with a machine-learned transferable
coarse-grained model [29.252004942896875]
coarse-grained (CG) model with similar prediction performance has been a long-standing challenge.
We develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences.
We demonstrate that the model successfully predicts folded structures, intermediates, metastable folded and unfolded basins, and the fluctuations of intrinsically disordered proteins.
arXiv Detail & Related papers (2023-10-27T17:10:23Z) - A Latent Diffusion Model for Protein Structure Generation [50.74232632854264]
We propose a latent diffusion model that can reduce the complexity of protein modeling.
We show that our method can effectively generate novel protein backbone structures with high designability and efficiency.
arXiv Detail & Related papers (2023-05-06T19:10:19Z) - Machine Learning Coarse-Grained Potentials of Protein Thermodynamics [1.3543803103181613]
We build coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics.
For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins.
The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems.
arXiv Detail & Related papers (2022-12-14T20:23:11Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.