Uncertainty-aware Mixed-variable Machine Learning for Materials Design
- URL: http://arxiv.org/abs/2207.04994v1
- Date: Mon, 11 Jul 2022 16:37:17 GMT
- Title: Uncertainty-aware Mixed-variable Machine Learning for Materials Design
- Authors: Hengrui Zhang, Wei "Wayne" Chen, Akshay Iyer, Daniel W. Apley, Wei
Chen
- Abstract summary: We survey frequentist and Bayesian approaches to uncertainty quantification of machine learning with mixed variables.
We examine the efficacy of the two models in the optimization of mathematical functions, as well as properties of structural and functional materials.
Our results provide practical guidance on choosing between frequentist and Bayesian uncertainty-aware machine learning models for mixed-variable BO in materials design.
- Score: 9.259285449415676
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data-driven design shows the promise of accelerating materials discovery but
is challenging due to the prohibitive cost of searching the vast design space
of chemistry, structure, and synthesis methods. Bayesian Optimization (BO)
employs uncertainty-aware machine learning models to select promising designs
to evaluate, hence reducing the cost. However, BO with mixed numerical and
categorical variables, which is of particular interest in materials design, has
not been well studied. In this work, we survey frequentist and Bayesian
approaches to uncertainty quantification of machine learning with mixed
variables. We then conduct a systematic comparative study of their performances
in BO using a popular representative model from each group, the random
forest-based Lolo model (frequentist) and the latent variable Gaussian process
model (Bayesian). We examine the efficacy of the two models in the optimization
of mathematical functions, as well as properties of structural and functional
materials, where we observe performance differences as related to problem
dimensionality and complexity. By investigating the machine learning models'
predictive and uncertainty estimation capabilities, we provide interpretations
of the observed performance differences. Our results provide practical guidance
on choosing between frequentist and Bayesian uncertainty-aware machine learning
models for mixed-variable BO in materials design.
Related papers
- Active Learning of Model Discrepancy with Bayesian Experimental Design [0.0]
We propose an efficient approach to learn the model discrepancy based on the data from a sequential experimental design (BED)
We show that the proposed method is efficient and robust to the active learning of high-dimensional model discrepancy, using data suggested by the sequential BED.
We also demonstrate that the proposed method is compatible with both classical numerical solvers and modern auto-differentiable solvers.
arXiv Detail & Related papers (2025-02-07T22:54:20Z) - Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning [78.72226641279863]
Sparse Mixture of Expert (SMoE) models have emerged as a scalable alternative to dense models in language modeling.
Our research explores task-specific model pruning to inform decisions about designing SMoE architectures.
We introduce an adaptive task-aware pruning technique UNCURL to reduce the number of experts per MoE layer in an offline manner post-training.
arXiv Detail & Related papers (2024-09-02T22:35:03Z) - Online simulator-based experimental design for cognitive model selection [74.76661199843284]
We propose BOSMOS: an approach to experimental design that can select between computational models without tractable likelihoods.
In simulated experiments, we demonstrate that the proposed BOSMOS technique can accurately select models in up to 2 orders of magnitude less time than existing LFI alternatives.
arXiv Detail & Related papers (2023-03-03T21:41:01Z) - Statistical Aspects of SHAP: Functional ANOVA for Model Interpretation [0.456877715768796]
We show that challenges in SHAP approximations largely relate to the choice of a feature distribution and the number of $2p$ ANOVA terms estimated.
We argue that the connection between machine learning explainability and sensitivity analysis is illuminating in this case, but the immediate practical consequences are not obvious.
arXiv Detail & Related papers (2022-08-21T21:46:15Z) - On the Influence of Enforcing Model Identifiability on Learning dynamics
of Gaussian Mixture Models [14.759688428864159]
We propose a technique for extracting submodels from singular models.
Our method enforces model identifiability during training.
We show how the method can be applied to more complex models like deep neural networks.
arXiv Detail & Related papers (2022-06-17T07:50:22Z) - Equivariant vector field network for many-body system modeling [65.22203086172019]
Equivariant Vector Field Network (EVFN) is built on a novel equivariant basis and the associated scalarization and vectorization layers.
We evaluate our method on predicting trajectories of simulated Newton mechanics systems with both full and partially observed data.
arXiv Detail & Related papers (2021-10-26T14:26:25Z) - Modeling Systems with Machine Learning based Differential Equations [0.0]
We propose the design of time-continuous models of dynamical systems as solutions of differential equations.
Our results suggest that this approach can be an useful technique in the case of synthetic or experimental data.
arXiv Detail & Related papers (2021-09-09T19:10:46Z) - Model Uncertainty and Correctability for Directed Graphical Models [3.326320568999945]
We develop information-theoretic, robust uncertainty quantification methods and non-parametric stress tests for directed graphical models.
We provide a mathematically rigorous approach to correctability that guarantees a systematic selection for improvement of components of a graphical model.
We demonstrate our methods in two physico-chemical examples, namely quantum scale-informed chemical kinetics and materials screening to improve the efficiency of fuel cells.
arXiv Detail & Related papers (2021-07-17T04:30:37Z) - Model-agnostic multi-objective approach for the evolutionary discovery
of mathematical models [55.41644538483948]
In modern data science, it is more interesting to understand the properties of the model, which parts could be replaced to obtain better results.
We use multi-objective evolutionary optimization for composite data-driven model learning to obtain the algorithm's desired properties.
arXiv Detail & Related papers (2021-07-07T11:17:09Z) - Scalable Gaussian Processes for Data-Driven Design using Big Data with
Categorical Factors [14.337297795182181]
Gaussian processes (GP) have difficulties in accommodating big datasets, categorical inputs, and multiple responses.
We propose a GP model that utilizes latent variables and functions obtained through variational inference to address the aforementioned challenges simultaneously.
Our approach is demonstrated for machine learning of ternary oxide materials and topology optimization of a multiscale compliant mechanism.
arXiv Detail & Related papers (2021-06-26T02:17:23Z) - Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual
Model-Based Reinforcement Learning [109.74041512359476]
We study a number of design decisions for the predictive model in visual MBRL algorithms.
We find that a range of design decisions that are often considered crucial, such as the use of latent spaces, have little effect on task performance.
We show how this phenomenon is related to exploration and how some of the lower-scoring models on standard benchmarks will perform the same as the best-performing models when trained on the same training data.
arXiv Detail & Related papers (2020-12-08T18:03:21Z) - Predictive modeling approaches in laser-based material processing [59.04160452043105]
This study aims to automate and forecast the effect of laser processing on material structures.
The focus is centred on the performance of representative statistical and machine learning algorithms.
Results can set the basis for a systematic methodology towards reducing material design, testing and production cost.
arXiv Detail & Related papers (2020-06-13T17:28:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.