Multi-Fidelity Machine Learning for Excited State Energies of Molecules
- URL: http://arxiv.org/abs/2305.11292v1
- Date: Thu, 18 May 2023 20:21:22 GMT
- Title: Multi-Fidelity Machine Learning for Excited State Energies of Molecules
- Authors: Vivin Vinod, Sayan Maity, Peter Zaspel, Ulrich Kleinekath\"ofer
- Abstract summary: It is shown that the multi-fidelity machine learning model can achieve the same accuracy as a machine learning model built only on high cost training data.
The numerical gain observed in these benchmark test calculations was over a factor of 30 but certainly can be much higher for high accuracy data.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The accurate but fast calculation of molecular excited states is still a very
challenging topic. For many applications, detailed knowledge of the energy
funnel in larger molecular aggregates is of key importance requiring highly
accurate excited state energies. To this end, machine learning techniques can
be an extremely useful tool though the cost of generating highly accurate
training datasets still remains a severe challenge. To overcome this hurdle,
this work proposes the use of multi-fidelity machine learning where very little
training data from high accuracies is combined with cheaper and less accurate
data to achieve the accuracy of the costlier level. In the present study, the
approach is employed to predict the first excited state energies for three
molecules of increasing size, namely, benzene, naphthalene, and anthracene. The
energies are trained and tested for conformations stemming from classical
molecular dynamics simulations and from real-time density functional
tight-binding calculations. It can be shown that the multi-fidelity machine
learning model can achieve the same accuracy as a machine learning model built
only on high cost training data while having a much lower computational effort
to generate the data. The numerical gain observed in these benchmark test
calculations was over a factor of 30 but certainly can be much higher for high
accuracy data.
Related papers
- Predicting ionic conductivity in solids from the machine-learned potential energy landscape [68.25662704255433]
Superionic materials are essential for advancing solid-state batteries, which offer improved energy density and safety.
Conventional computational methods for identifying such materials are resource-intensive and not easily scalable.
We propose an approach for the quick and reliable evaluation of ionic conductivity through the analysis of a universal interatomic potential.
arXiv Detail & Related papers (2024-11-11T09:01:36Z) - MACK: Mismodeling Addressed with Contrastive Knowledge [0.6099917303150076]
As machine learning models become more complex they can become increasingly sensitive to differences between this simulation and the real data collected by experiments.
We present a generic methodology based on contrastive learning which is able to greatly mitigate this negative effect.
While we demonstrate the efficacy of this technique using the task of jet-tagging at the Large Hadron Collider, it is applicable to a wide array of different tasks both in and out of the field of high energy physics.
arXiv Detail & Related papers (2024-10-17T18:18:41Z) - Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task Learning [79.75718786477638]
We exploit the specialty of molecular tasks that there are physical laws connecting them, and design consistency training approaches.
We demonstrate that the more accurate energy data can improve the accuracy of structure prediction.
We also find that consistency training can directly leverage force and off-equilibrium structure data to improve structure prediction.
arXiv Detail & Related papers (2024-10-14T03:11:33Z) - Physics-Informed Weakly Supervised Learning for Interatomic Potentials [17.165117198519248]
We introduce a physics-informed, weakly supervised approach for training machine-learned interatomic potentials.
We demonstrate reduced energy and force errors -- often lower by a factor of two -- for various baseline models and benchmark data sets.
arXiv Detail & Related papers (2024-07-23T12:49:04Z) - Optimal design of experiments in the context of machine-learning inter-atomic potentials: improving the efficiency and transferability of kernel based methods [0.7234862895932991]
Data-driven, machine learning (ML) models of atomistic interactions can relate nuanced aspects of atomic arrangements into predictions of energies and forces.
The main challenge stems from the fact that descriptors of chemical environments are often sparse high-dimensional objects without a well-defined continuous metric.
We will demonstrate that classical concepts of statistical planning of experiments and optimal design can help to mitigate such problems at a relatively low computational cost.
arXiv Detail & Related papers (2024-05-14T14:14:23Z) - Machine Learning Force Fields with Data Cost Aware Training [94.78998399180519]
Machine learning force fields (MLFF) have been proposed to accelerate molecular dynamics (MD) simulation.
Even for the most data-efficient MLFFs, reaching chemical accuracy can require hundreds of frames of force and energy labels.
We propose a multi-stage computational framework -- ASTEROID, which lowers the data cost of MLFFs by leveraging a combination of cheap inaccurate data and expensive accurate data.
arXiv Detail & Related papers (2023-06-05T04:34:54Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Quantum-tailored machine-learning characterization of a superconducting
qubit [50.591267188664666]
We develop an approach to characterize the dynamics of a quantum device and learn device parameters.
This approach outperforms physics-agnostic recurrent neural networks trained on numerically generated and experimental data.
This demonstration shows how leveraging domain knowledge improves the accuracy and efficiency of this characterization task.
arXiv Detail & Related papers (2021-06-24T15:58:57Z) - Simple and efficient algorithms for training machine learning potentials
to force data [2.924868086534434]
Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy.
One limiting factor is the quantity of available training data, which can be expensive to obtain.
We present a new algorithm for efficient force training, and benchmark its accuracy by training to forces from real-world datasets for organic chemistry and bulk aluminum.
arXiv Detail & Related papers (2020-06-09T19:36:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.