May the Force be with You: Unified Force-Centric Pre-Training for 3D
Molecular Conformations
- URL: http://arxiv.org/abs/2308.14759v1
- Date: Thu, 24 Aug 2023 01:54:02 GMT
- Title: May the Force be with You: Unified Force-Centric Pre-Training for 3D
Molecular Conformations
- Authors: Rui Feng, Qi Zhu, Huan Tran, Binghong Chen, Aubrey Toland, Rampi
Ramprasad, Chao Zhang
- Abstract summary: We propose a force-centric pretraining model for 3D molecular conformations covering both equilibrium and off-equilibrium data.
For equilibrium data, we introduce zero-force regularization and forced-based denoising techniques to approximate near-equilibrium forces.
Experiments show that, with our pre-training objective, we increase forces accuracy by around 3 times compared to the un-pre-trained Equivariant Transformer model.
- Score: 19.273404278711794
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent works have shown the promise of learning pre-trained models for 3D
molecular representation. However, existing pre-training models focus
predominantly on equilibrium data and largely overlook off-equilibrium
conformations. It is challenging to extend these methods to off-equilibrium
data because their training objective relies on assumptions of conformations
being the local energy minima. We address this gap by proposing a force-centric
pretraining model for 3D molecular conformations covering both equilibrium and
off-equilibrium data. For off-equilibrium data, our model learns directly from
their atomic forces. For equilibrium data, we introduce zero-force
regularization and forced-based denoising techniques to approximate
near-equilibrium forces. We obtain a unified pre-trained model for 3D molecular
representation with over 15 million diverse conformations. Experiments show
that, with our pre-training objective, we increase forces accuracy by around 3
times compared to the un-pre-trained Equivariant Transformer model. By
incorporating regularizations on equilibrium data, we solved the problem of
unstable MD simulations in vanilla Equivariant Transformers, achieving
state-of-the-art simulation performance with 2.45 times faster inference time
than NequIP. As a powerful molecular encoder, our pre-trained model achieves
on-par performance with state-of-the-art property prediction tasks.
Related papers
- EquiFlow: Equivariant Conditional Flow Matching with Optimal Transport for 3D Molecular Conformation Prediction [14.664189950787739]
We propose EquiFlow, an equivariant conditional flow matching model with optimal transport.
It uses a modified Equiformer model to encode molecular conformations along with their atomic and bond properties into higher-degree embeddings.
Experiments on the QM9 dataset show that EquiFlow predicts small molecule conformations more accurately than current state-of-the-art models.
arXiv Detail & Related papers (2024-12-15T06:48:22Z) - Conditional Synthesis of 3D Molecules with Time Correction Sampler [58.0834973489875]
Time-Aware Conditional Synthesis (TACS) is a novel approach to conditional generation on diffusion models.
It integrates adaptively controlled plug-and-play "online" guidance into a diffusion model, driving samples toward the desired properties.
arXiv Detail & Related papers (2024-11-01T12:59:25Z) - Towards Theoretical Understandings of Self-Consuming Generative Models [56.84592466204185]
This paper tackles the emerging challenge of training generative models within a self-consuming loop.
We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models.
We present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation.
arXiv Detail & Related papers (2024-02-19T02:08:09Z) - Highly Accurate Quantum Chemical Property Prediction with Uni-Mol+ [36.61144345418364]
We propose a novel approach called Uni-Mol+ to speed up the prediction of quantum chemical (QC) properties.
We introduce a two-track Transformer model backbone and train it with the QC property prediction task.
Our benchmarking results demonstrate that the proposed Uni-Mol+ significantly improves the accuracy of QC property prediction.
arXiv Detail & Related papers (2023-03-16T07:51:34Z) - Stabilizing Machine Learning Prediction of Dynamics: Noise and
Noise-inspired Regularization [58.720142291102135]
Recent has shown that machine learning (ML) models can be trained to accurately forecast the dynamics of chaotic dynamical systems.
In the absence of mitigating techniques, this technique can result in artificially rapid error growth, leading to inaccurate predictions and/or climate instability.
We introduce Linearized Multi-Noise Training (LMNT), a regularization technique that deterministically approximates the effect of many small, independent noise realizations added to the model input during training.
arXiv Detail & Related papers (2022-11-09T23:40:52Z) - Learning Physical Dynamics with Subequivariant Graph Neural Networks [99.41677381754678]
Graph Neural Networks (GNNs) have become a prevailing tool for learning physical dynamics.
Physical laws abide by symmetry, which is a vital inductive bias accounting for model generalization.
Our model achieves on average over 3% enhancement in contact prediction accuracy across 8 scenarios on Physion and 2X lower rollout MSE on RigidFall.
arXiv Detail & Related papers (2022-10-13T10:00:30Z) - 3D Equivariant Molecular Graph Pretraining [42.957880677779556]
We tackle 3D molecular pretraining in a complete and novel sense.
We first propose to adopt an equivariant energy-based model as the backbone for pretraining, which enjoys the merit of fulfilling the symmetry of 3D space.
We evaluate our model pretrained from a large-scale 3D dataset GEOM-QM9 on two challenging 3D benchmarks: MD17 and QM9.
arXiv Detail & Related papers (2022-07-18T16:26:24Z) - Pre-training via Denoising for Molecular Property Prediction [53.409242538744444]
We describe a pre-training technique that utilizes large datasets of 3D molecular structures at equilibrium.
Inspired by recent advances in noise regularization, our pre-training objective is based on denoising.
arXiv Detail & Related papers (2022-05-31T22:28:34Z) - Uncertainty quantification and inverse modeling for subsurface flow in
3D heterogeneous formations using a theory-guided convolutional
encoder-decoder network [5.018057056965207]
We build surrogate models for dynamic 3D subsurface single-phase flow problems with multiple vertical producing wells.
The surrogate model provides efficient pressure estimation of the entire formation at any timestep.
The well production rate or bottom hole pressure can then be determined based on Peaceman's formula.
arXiv Detail & Related papers (2021-11-14T10:11:46Z) - Equivariant vector field network for many-body system modeling [65.22203086172019]
Equivariant Vector Field Network (EVFN) is built on a novel equivariant basis and the associated scalarization and vectorization layers.
We evaluate our method on predicting trajectories of simulated Newton mechanics systems with both full and partially observed data.
arXiv Detail & Related papers (2021-10-26T14:26:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.