Machine learning to tame divergent density functional approximations: a
new path to consensus materials design principles
- URL: http://arxiv.org/abs/2106.13109v1
- Date: Thu, 24 Jun 2021 15:43:57 GMT
- Title: Machine learning to tame divergent density functional approximations: a
new path to consensus materials design principles
- Authors: Chenru Duan, Shuxin Chen, Michael G. Taylor, Fang Liu, and Heather J.
Kulik
- Abstract summary: We introduce an approach to rapidly obtain property predictions from 23 representative DFAs spanning multiple families and "rungs"
We train independent ML models for each DFA and observe convergent trends in feature importance.
By requiring consensus of the ANN-predicted DFA properties, we improve correspondence of these computational lead compounds with literature-mined, experimental compounds.
- Score: 4.700621178941319
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Computational virtual high-throughput screening (VHTS) with density
functional theory (DFT) and machine-learning (ML)-acceleration is essential in
rapid materials discovery. By necessity, efficient DFT-based workflows are
carried out with a single density functional approximation (DFA). Nevertheless,
properties evaluated with different DFAs can be expected to disagree for the
cases with challenging electronic structure (e.g., open shell transition metal
complexes, TMCs) for which rapid screening is most needed and accurate
benchmarks are often unavailable. To quantify the effect of DFA bias, we
introduce an approach to rapidly obtain property predictions from 23
representative DFAs spanning multiple families and "rungs" (e.g., semi-local to
double hybrid) and basis sets on over 2,000 TMCs. Although computed properties
(e.g., spin-state ordering and frontier orbital gap) naturally differ by DFA,
high linear correlations persist across all DFAs. We train independent ML
models for each DFA and observe convergent trends in feature importance; these
features thus provide DFA-invariant, universal design rules. We devise a
strategy to train ML models informed by all 23 DFAs and use them to predict
properties (e.g., spin-splitting energy) of over 182k TMCs. By requiring
consensus of the ANN-predicted DFA properties, we improve correspondence of
these computational lead compounds with literature-mined, experimental
compounds over the single-DFA approach typically employed. Both feature
analysis and consensus-based ML provide efficient, alternative paths to
overcome accuracy limitations of practical DFT.
Related papers
- Multi-task learning for molecular electronic structure approaching coupled-cluster accuracy [9.81014501502049]
We develop a unified machine learning method for electronic structures of organic molecules using the gold-standard CCSD(T) calculations as training data.
Tested on hydrocarbon molecules, our model outperforms DFT with the widely-used hybrid and double hybrid functionals in computational costs and prediction accuracy of various quantum chemical properties.
arXiv Detail & Related papers (2024-05-09T19:51:27Z) - Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning [50.73666458313015]
Large Language Models (LLMs) have demonstrated significant potential in performing multiple tasks in multimedia applications.
MoE has been emerged as a promising solution with its sparse architecture for effective task decoupling.
Intuition-MoR1E achieves superior efficiency and 2.15% overall accuracy improvement across 14 public datasets.
arXiv Detail & Related papers (2024-04-13T12:14:58Z) - Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line [65.14099135546594]
Recent test-time adaptation (TTA) methods drastically strengthen the ACL and AGL trends in models, even in shifts where models showed very weak correlations before.
Our results show that by combining TTA with AGL-based estimation methods, we can estimate the OOD performance of models with high precision for a broader set of distribution shifts.
arXiv Detail & Related papers (2023-10-07T23:21:25Z) - Prototype-based Aleatoric Uncertainty Quantification for Cross-modal
Retrieval [139.21955930418815]
Cross-modal Retrieval methods build similarity relations between vision and language modalities by jointly learning a common representation space.
However, the predictions are often unreliable due to the Aleatoric uncertainty, which is induced by low-quality data, e.g., corrupt images, fast-paced videos, and non-detailed texts.
We propose a novel Prototype-based Aleatoric Uncertainty Quantification (PAU) framework to provide trustworthy predictions by quantifying the uncertainty arisen from the inherent data ambiguity.
arXiv Detail & Related papers (2023-09-29T09:41:19Z) - Physics Inspired Hybrid Attention for SAR Target Recognition [61.01086031364307]
We propose a physics inspired hybrid attention (PIHA) mechanism and the once-for-all (OFA) evaluation protocol to address the issues.
PIHA leverages the high-level semantics of physical information to activate and guide the feature group aware of local semantics of target.
Our method outperforms other state-of-the-art approaches in 12 test scenarios with same ASC parameters.
arXiv Detail & Related papers (2023-09-27T14:39:41Z) - Grad DFT: a software library for machine learning enhanced density
functional theory [0.0]
Density functional theory (DFT) stands as a cornerstone in computational quantum chemistry and materials science.
Recent work has begun to explore how machine learning can expand the capabilities of DFT.
We present Grad DFT: a fully differentiable JAX-based DFT library, enabling quick prototyping and experimentation with machine learning-enhanced exchange-correlation energy functionals.
arXiv Detail & Related papers (2023-09-23T00:25:06Z) - Mitigating the Alignment Tax of RLHF [76.4300447532456]
aligning LLMs under Reinforcement Learning with Human Feedback can lead to forgetting pretrained abilities, also known as the alignment tax.
We propose model averaging to maximize alignment performance while incurring minimal alignment tax.
We validate HMA's performance across a range of RLHF algorithms over OpenLLaMA-3B and further extend our findings to Mistral-7B.
arXiv Detail & Related papers (2023-09-12T14:16:54Z) - Accurate machine learning force fields via experimental and simulation
data fusion [0.0]
Machine Learning (ML)-based force fields are attracting ever-increasing interest due to their capacity to span scales of classical interatomic potentials at quantum-level accuracy.
Here we leverage both Density Functional Theory (DFT) calculations and experimentally measured mechanical properties and lattice parameters to train an ML potential of titanium.
We demonstrate that the fused data learning strategy can concurrently satisfy all target objectives, thus resulting in a molecular model of higher accuracy compared to the models trained with a single source data.
arXiv Detail & Related papers (2023-08-17T18:22:19Z) - A Transferable Recommender Approach for Selecting the Best Density
Functional Approximations in Chemical Discovery [0.4063872661554894]
No single density functional approximation with universal accuracy has been identified, leading to uncertainty in the quality of data generated from DFT.
We build a DFA recommender that selects the DFA with the lowest expected error with respect to gold standard but cost-prohibitive coupled cluster theory.
Our recommender predicts top-performing DFAs and yields excellent accuracy (ca. 2 kcal/mol) for chemical discovery, outperforming both individual transfer learning models and the single best functional in a set of 48 DFAs.
arXiv Detail & Related papers (2022-07-21T20:45:57Z) - Putting Density Functional Theory to the Test in
Machine-Learning-Accelerated Materials Discovery [2.7810723668216575]
We describe the advances needed in accuracy, efficiency, and approach beyond what is typical in conventional DFT-based machine learning (ML)
For DFT to be trusted for a given data point in a high- throughput screen, it must pass a series of tests.
For DFT to be trusted for a given data point in a high- throughput screen, it must pass a series of tests.
arXiv Detail & Related papers (2022-05-06T00:34:50Z) - Pseudo-Spherical Contrastive Divergence [119.28384561517292]
We propose pseudo-spherical contrastive divergence (PS-CD) to generalize maximum learning likelihood of energy-based models.
PS-CD avoids the intractable partition function and provides a generalized family of learning objectives.
arXiv Detail & Related papers (2021-11-01T09:17:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.