A Universal Framework for Featurization of Atomistic Systems
- URL: http://arxiv.org/abs/2102.02390v2
- Date: Fri, 5 Feb 2021 01:38:16 GMT
- Title: A Universal Framework for Featurization of Atomistic Systems
- Authors: Xiangyun Lei, Andrew J. Medford
- Abstract summary: Reactive force fields based on physics or machine learning can be used to bridge the gap in time and length scales.
We introduce the Gaussian multi-pole (GMP) featurization scheme that utilizes physically-relevant multi-pole expansions of the electron density around atoms.
We demonstrate that GMP-based models can achieve chemical accuracy for the QM9 dataset, and their accuracy remains reasonable even when extrapolating to new elements.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Molecular dynamics simulations are an invaluable tool in numerous scientific
fields. However, the ubiquitous classical force fields cannot describe reactive
systems, and quantum molecular dynamics are too computationally demanding to
treat large systems or long timescales. Reactive force fields based on physics
or machine learning can be used to bridge the gap in time and length scales,
but these force fields require substantial effort to construct and are highly
specific to given chemical composition and application. The extreme flexibility
of machine learning models promises to yield reactive force fields that provide
a more general description of chemical bonding. However, a significant
limitation of machine learning models is the use of element-specific features,
leading to models that scale poorly with the number of elements. This work
introduces the Gaussian multi-pole (GMP) featurization scheme that utilizes
physically-relevant multi-pole expansions of the electron density around atoms
to yield feature vectors that interpolate between element types and have a
fixed dimension regardless of the number of elements present. We combine GMP
with neural networks to directly compare it to the widely-used Behler-Parinello
symmetry functions for the MD17 dataset, revealing that it exhibits improved
accuracy and computational efficiency. Further, we demonstrate that GMP-based
models can achieve chemical accuracy for the QM9 dataset, and their accuracy
remains reasonable even when extrapolating to new elements. Finally, we test
GMP-based models for the Open Catalysis Project (OCP) dataset, revealing
comparable performance and improved learning rates when compared to graph
convolutional deep learning models. The results indicate that this
featurization scheme fills a critical gap in the construction of efficient and
transferable reactive force fields.
Related papers
- Neural P$^3$M: A Long-Range Interaction Modeling Enhancer for Geometric
GNNs [66.98487644676906]
We introduce Neural P$3$M, a versatile enhancer of geometric GNNs to expand the scope of their capabilities.
It exhibits flexibility across a wide range of molecular systems and demonstrates remarkable accuracy in predicting energies and forces.
It also achieves an average improvement of 22% on the OE62 dataset while integrating with various architectures.
arXiv Detail & Related papers (2024-09-26T08:16:59Z) - Multi-task learning for molecular electronic structure approaching coupled-cluster accuracy [9.81014501502049]
We develop a unified machine learning method for electronic structures of organic molecules using the gold-standard CCSD(T) calculations as training data.
Tested on hydrocarbon molecules, our model outperforms DFT with the widely-used hybrid and double hybrid functionals in computational costs and prediction accuracy of various quantum chemical properties.
arXiv Detail & Related papers (2024-05-09T19:51:27Z) - Physics-informed active learning for accelerating quantum chemical simulations [10.56535364437456]
We introduce the end-to-end AL for constructing robust data-efficient potentials in quantum chemical simulations.
Our protocol is based on the physics-informed sampling of training points, automatic selection of initial data, uncertainty quantification, and convergence monitoring.
These investigations took us days instead of weeks of pure quantum chemical calculations on a high-performance computing cluster.
arXiv Detail & Related papers (2024-04-18T00:17:01Z) - QH9: A Quantum Hamiltonian Prediction Benchmark for QM9 Molecules [69.25826391912368]
We generate a new Quantum Hamiltonian dataset, named as QH9, to provide precise Hamiltonian matrices for 999 or 2998 molecular dynamics trajectories.
We show that current machine learning models have the capacity to predict Hamiltonian matrices for arbitrary molecules.
arXiv Detail & Related papers (2023-06-15T23:39:07Z) - Molecular Geometry-aware Transformer for accurate 3D Atomic System
modeling [51.83761266429285]
We propose a novel Transformer architecture that takes nodes (atoms) and edges (bonds and nonbonding atom pairs) as inputs and models the interactions among them.
Moleformer achieves state-of-the-art on the initial state to relaxed energy prediction of OC20 and is very competitive in QM9 on predicting quantum chemical properties.
arXiv Detail & Related papers (2023-02-02T03:49:57Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Gaussian Moments as Physically Inspired Molecular Descriptors for
Accurate and Scalable Machine Learning Potentials [0.0]
We propose a machine learning method for constructing high-dimensional potential energy surfaces based on feed-forward neural networks.
The accuracy of the developed approach in representing both chemical and configurational spaces is comparable to the one of several established machine learning models.
arXiv Detail & Related papers (2021-09-15T16:46:46Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - End-to-End Differentiable Molecular Mechanics Force Field Construction [0.5269923665485903]
We propose an alternative approach that uses graph neural networks to perceive chemical environments.
The entire process is modular and end-to-end differentiable with respect to model parameters.
We show that this approach is not only sufficiently to reproduce legacy atom types, but that it can learn to accurately reproduce and extend existing molecular mechanics force fields.
arXiv Detail & Related papers (2020-10-02T20:59:46Z) - Graph Neural Network for Hamiltonian-Based Material Property Prediction [56.94118357003096]
We present and compare several different graph convolution networks that are able to predict the band gap for inorganic materials.
The models are developed to incorporate two different features: the information of each orbital itself and the interaction between each other.
The results show that our model can get a promising prediction accuracy with cross-validation.
arXiv Detail & Related papers (2020-05-27T13:32:10Z) - Predicting molecular dipole moments by combining atomic partial charges
and atomic dipoles [3.0980025155565376]
"MuML" models are fitted together to reproduce molecular $boldsymbolmu$ computed using high-level coupled-cluster theory.
We demonstrate that the uncertainty in the predictions can be estimated reliably using a calibrated committee model.
arXiv Detail & Related papers (2020-03-27T14:35:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.