Fusing Neural and Physical: Augment Protein Conformation Sampling with
Tractable Simulations
- URL: http://arxiv.org/abs/2402.10433v2
- Date: Mon, 11 Mar 2024 20:20:16 GMT
- Title: Fusing Neural and Physical: Augment Protein Conformation Sampling with
Tractable Simulations
- Authors: Jiarui Lu, Zuobai Zhang, Bozitao Zhong, Chence Shi, Jian Tang
- Abstract summary: generative models have been leveraged as a surrogate sampler to obtain conformation ensembles with orders of magnitude faster.
In this work, we explore the few-shot setting of such pre-trained generative sampler which incorporates MD simulations in a tractable manner.
- Score: 27.984190594059868
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The protein dynamics are common and important for their biological functions
and properties, the study of which usually involves time-consuming molecular
dynamics (MD) simulations in silico. Recently, generative models has been
leveraged as a surrogate sampler to obtain conformation ensembles with orders
of magnitude faster and without requiring any simulation data (a "zero-shot"
inference). However, being agnostic of the underlying energy landscape, the
accuracy of such generative model may still be limited. In this work, we
explore the few-shot setting of such pre-trained generative sampler which
incorporates MD simulations in a tractable manner. Specifically, given a target
protein of interest, we first acquire some seeding conformations from the
pre-trained sampler followed by a number of physical simulations in parallel
starting from these seeding samples. Then we fine-tuned the generative model
using the simulation trajectories above to become a target-specific sampler.
Experimental results demonstrated the superior performance of such few-shot
conformation sampler at a tractable computational cost.
Related papers
- EquiJump: Protein Dynamics Simulation via SO(3)-Equivariant Stochastic Interpolants [13.493198442811865]
We introduce EquiJump, a transferable SO(3)-equivariant model that bridges all-atom protein dynamics simulation time steps directly.
Our approach achieves diverse sampling methods and is benchmarked against existing models on trajectory data of fast folding proteins.
arXiv Detail & Related papers (2024-10-12T23:22:49Z) - Generative Modeling of Molecular Dynamics Trajectories [12.255021091552441]
We introduce generative modeling of molecular trajectories as a paradigm for learning flexible multi-task surrogate models of MD from data.
We show such generative models can be adapted to diverse tasks such as forward simulation, transition path sampling, and trajectory upsampling.
arXiv Detail & Related papers (2024-09-26T13:02:28Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - A Multi-Grained Symmetric Differential Equation Model for Learning
Protein-Ligand Binding Dynamics [74.93549765488103]
In drug discovery, molecular dynamics simulation provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites.
We propose NeuralMD, the first machine learning surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding.
We show the efficiency and effectiveness of NeuralMD, with a 2000$times$ speedup over standard numerical MD simulation and outperforming all other ML approaches by up to 80% under the stability metric.
arXiv Detail & Related papers (2024-01-26T09:35:17Z) - Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of
Protein Simulators [29.22292758901411]
We adapt the soft prompt-based learning method to molecular dynamics tasks.
Our framework excels in accuracy for in-domain data and demonstrates strong generalization capabilities for unseen and out-of-distribution samples.
arXiv Detail & Related papers (2023-08-29T08:29:08Z) - Str2Str: A Score-based Framework for Zero-shot Protein Conformation
Sampling [23.74897713386661]
The dynamic nature of proteins is crucial for determining their biological functions and properties.
Existing learning-based approaches perform direct sampling yet heavily rely on target-specific simulation data for training.
We propose Str2Str, a novel structure-to-structure translation framework capable of zero-shot conformation sampling.
arXiv Detail & Related papers (2023-06-05T15:19:06Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Neural Posterior Estimation with Differentiable Simulators [58.720142291102135]
We present a new method to perform Neural Posterior Estimation (NPE) with a differentiable simulator.
We demonstrate how gradient information helps constrain the shape of the posterior and improves sample-efficiency.
arXiv Detail & Related papers (2022-07-12T16:08:04Z) - Deep Bayesian Active Learning for Accelerating Stochastic Simulation [74.58219903138301]
Interactive Neural Process (INP) is a deep active learning framework for simulations and with active learning approaches.
For active learning, we propose a novel acquisition function, Latent Information Gain (LIG), calculated in the latent space of NP based models.
The results demonstrate STNP outperforms the baselines in the learning setting and LIG achieves the state-of-the-art for active learning.
arXiv Detail & Related papers (2021-06-05T01:31:51Z) - Simulating the Time Projection Chamber responses at the MPD detector
using Generative Adversarial Networks [0.0]
In this work, we demonstrate a novel approach to speed up the simulation of the Time Projection Chamber tracker of the MPD experiment at the NICA accelerator complex.
Our method is based on a Generative Adrial Network - a deep learning technique allowing for implicit non-parametric estimation of the population distribution for a given set of objects.
To evaluate the quality of the proposed model, we integrate it into the MPD software stack and demonstrate that it produces high-quality events similar to the detailed simulator.
arXiv Detail & Related papers (2020-12-08T17:57:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.