Data-efficient and Interpretable Inverse Materials Design using a Disentangled Variational Autoencoder
- URL: http://arxiv.org/abs/2409.06740v2
- Date: Sun, 17 Nov 2024 01:11:44 GMT
- Title: Data-efficient and Interpretable Inverse Materials Design using a Disentangled Variational Autoencoder
- Authors: Cheng Zeng, Zulqarnain Khan, Nathan L. Post,
- Abstract summary: Inverse materials design has proven successful in accelerating novel material discovery.
Many inverse materials design methods use unsupervised learning where a latent space is learned to offer a compact description of materials representations.
Here, we present a semi-supervised learning approach based on a disentangled variational autoencoder to learn a probabilistic relationship between features, latent variables and target properties.
- Score: 2.563209727695243
- License:
- Abstract: Inverse materials design has proven successful in accelerating novel material discovery. Many inverse materials design methods use unsupervised learning where a latent space is learned to offer a compact description of materials representations. A latent space learned this way is likely to be entangled, in terms of the target property and other properties of the materials. This makes the inverse design process ambiguous. Here, we present a semi-supervised learning approach based on a disentangled variational autoencoder to learn a probabilistic relationship between features, latent variables and target properties. This approach is data efficient because it combines all labelled and unlabelled data in a coherent manner, and it uses expert-informed prior distributions to improve model robustness even with limited labelled data. It is in essence interpretable, as the learnable target property is disentangled out of the other properties of the materials, and an extra layer of interpretability can be provided by a post-hoc analysis of the classification head of the model. We demonstrate this new approach on an experimental high-entropy alloy dataset with chemical compositions as input and single-phase formation as the single target property. High-entropy alloys were chosen as example materials because of the vast chemical space of their possible combinations of compositions and atomic configurations. While single property is used in this work, the disentangled model can be extended to customize for inverse design of materials with multiple target properties.
Related papers
- Targeting the partition function of chemically disordered materials with a generative approach based on inverse variational autoencoders [0.0]
We propose a novel approach where generative machine learning is used to yield a representative set of configurations for accurate property evaluation.
Our method employs a specific type of variational autoencoder with inverse roles for the encoder and decoder.
We illustrate our approach by computing point-defect formation energies and concentrations in (U, Pu)O2 mixed-oxide fuels.
arXiv Detail & Related papers (2024-08-27T10:05:37Z) - VAE for Modified 1-Hot Generative Materials Modeling, A Step Towards
Inverse Material Design [0.0]
In inverse material design, where one seeks to design a material with a prescribed set of properties, a significant challenge is ensuring synthetic viability of a proposed new material.
We encode an implicit dataset relationships, namely that certain materials can be decomposed into other ones in the dataset.
We present a VAE model capable of preserving this property in the latent space and generating new samples with the same.
arXiv Detail & Related papers (2023-12-25T04:04:47Z) - Hodge-Aware Contrastive Learning [101.56637264703058]
Simplicial complexes prove effective in modeling data with multiway dependencies.
We develop a contrastive self-supervised learning approach for processing simplicial data.
arXiv Detail & Related papers (2023-09-14T00:40:07Z) - Disentanglement via Latent Quantization [60.37109712033694]
In this work, we construct an inductive bias towards encoding to and decoding from an organized latent space.
We demonstrate the broad applicability of this approach by adding it to both basic data-re (vanilla autoencoder) and latent-reconstructing (InfoGAN) generative models.
arXiv Detail & Related papers (2023-05-28T06:30:29Z) - Differentiable graph-structured models for inverse design of lattice
materials [0.0]
Architected materials possessing physico-chemical properties adaptable to disparate environmental conditions embody a disruptive new domain of materials science.
We propose a new computational approach using graph-based representation for regular and irregular lattice materials.
arXiv Detail & Related papers (2023-04-11T18:00:21Z) - Dynamic Latent Separation for Deep Learning [67.62190501599176]
A core problem in machine learning is to learn expressive latent variables for model prediction on complex data.
Here, we develop an approach that improves expressiveness, provides partial interpretation, and is not restricted to specific applications.
arXiv Detail & Related papers (2022-10-07T17:56:53Z) - Learning Conditional Invariance through Cycle Consistency [60.85059977904014]
We propose a novel approach to identify meaningful and independent factors of variation in a dataset.
Our method involves two separate latent subspaces for the target property and the remaining input information.
We demonstrate on synthetic and molecular data that our approach identifies more meaningful factors which lead to sparser and more interpretable models.
arXiv Detail & Related papers (2021-11-25T17:33:12Z) - How to See Hidden Patterns in Metamaterials with Interpretable Machine
Learning [82.67551367327634]
We develop a new interpretable, multi-resolution machine learning framework for finding patterns in the unit-cells of materials.
Specifically, we propose two new interpretable representations of metamaterials, called shape-frequency features and unit-cell templates.
arXiv Detail & Related papers (2021-11-10T21:19:02Z) - Model-agnostic multi-objective approach for the evolutionary discovery
of mathematical models [55.41644538483948]
In modern data science, it is more interesting to understand the properties of the model, which parts could be replaced to obtain better results.
We use multi-objective evolutionary optimization for composite data-driven model learning to obtain the algorithm's desired properties.
arXiv Detail & Related papers (2021-07-07T11:17:09Z) - Machine learning with persistent homology and chemical word embeddings
improves prediction accuracy and interpretability in metal-organic frameworks [0.07874708385247352]
We introduce an end-to-end machine learning model that automatically generates descriptors that capture a complex representation of a material's structure and chemistry.
It automatically encapsulates geometric and chemical information directly from the material system.
Our results show considerable improvement in both accuracy and transferability across targets compared to models constructed from the commonly-used, manually-curated features.
arXiv Detail & Related papers (2020-10-01T16:31:46Z) - Ensemble learning reveals dissimilarity between rare-earth transition
metal binary alloys with respect to the Curie temperature [0.0]
We propose a data-driven method to extract dissimilarity between materials, with respect to a given target physical property.
The proposed method can be considered as a potential tool for obtaining a deeper understanding of the structure of data.
arXiv Detail & Related papers (2020-08-20T07:46:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.