Materials Expert-Artificial Intelligence for Materials Discovery
- URL: http://arxiv.org/abs/2312.02796v1
- Date: Tue, 5 Dec 2023 14:29:18 GMT
- Title: Materials Expert-Artificial Intelligence for Materials Discovery
- Authors: Yanjun Liu, Milena Jovanovic, Krishnanand Mallayya, Wesley J. Maddox,
Andrew Gordon Wilson, Sebastian Klemenz, Leslie M. Schoop, Eun-Ah Kim
- Abstract summary: We introduce "Materials Expert-Artificial Intelligence" (ME-AI) to encapsulate and articulate this human intuition.
The ME-AI learned descriptors independently reproduce expert intuition and expand upon it.
Our success points to the "machine bottling human insight" approach as promising for machine learning-aided material discovery.
- Score: 39.67752644916519
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The advent of material databases provides an unprecedented opportunity to
uncover predictive descriptors for emergent material properties from vast data
space. However, common reliance on high-throughput ab initio data necessarily
inherits limitations of such data: mismatch with experiments. On the other
hand, experimental decisions are often guided by an expert's intuition honed
from experiences that are rarely articulated. We propose using machine learning
to "bottle" such operational intuition into quantifiable descriptors using
expertly curated measurement-based data. We introduce "Materials
Expert-Artificial Intelligence" (ME-AI) to encapsulate and articulate this
human intuition. As a first step towards such a program, we focus on the
topological semimetal (TSM) among square-net materials as the property inspired
by the expert-identified descriptor based on structural information: the
tolerance factor. We start by curating a dataset encompassing 12 primary
features of 879 square-net materials, using experimental data whenever
possible. We then use Dirichlet-based Gaussian process regression using a
specialized kernel to reveal composite descriptors for square-net topological
semimetals. The ME-AI learned descriptors independently reproduce expert
intuition and expand upon it. Specifically, new descriptors point to
hypervalency as a critical chemical feature predicting TSM within square-net
compounds. Our success with a carefully defined problem points to the "machine
bottling human insight" approach as promising for machine learning-aided
material discovery.
Related papers
- SciQu: Accelerating Materials Properties Prediction with Automated Literature Mining for Self-Driving Laboratories [0.7673339435080445]
Assessing different material properties to predict specific attributes is a fundamental requirement for materials science-based applications.
Our study addresses these challenges by leveraging machine learning to analyze material properties with greater precision and efficiency.
By automating the data extraction process and using the extracted information to train machine learning models, our developed model, SciQu, optimize material properties.
arXiv Detail & Related papers (2024-07-11T08:12:46Z) - Dynamic In-context Learning with Conversational Models for Data Extraction and Materials Property Prediction [0.0]
PropertyExtractor is an open-source tool that blends zero-shot with few-shot in-context learning.
Our tests on material data demonstrate precision and recall that exceed 95% with an error rate of approximately 9%.
arXiv Detail & Related papers (2024-05-16T21:15:51Z) - Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study [65.11303133775857]
Mixture-of-Experts (MoE) computation amalgamates predictions from several specialized sub-models (referred to as experts)
Sparse MoE selectively engages only a limited number, or even just one expert, significantly reducing overhead while empirically preserving, and sometimes even enhancing, performance.
arXiv Detail & Related papers (2024-03-26T05:48:02Z) - Advancing Extrapolative Predictions of Material Properties through Learning to Learn [1.3274508420845539]
We use attention-based architecture of neural networks and meta-learning algorithms to acquire extrapolative generalization capability.
We highlight the potential of such extrapolatively trained models, particularly with their ability to rapidly adapt to unseen material domains.
arXiv Detail & Related papers (2024-03-25T09:30:19Z) - Instruct and Extract: Instruction Tuning for On-Demand Information
Extraction [86.29491354355356]
On-Demand Information Extraction aims to fulfill the personalized demands of real-world users.
We present a benchmark named InstructIE, inclusive of both automatically generated training data, as well as the human-annotated test set.
Building on InstructIE, we further develop an On-Demand Information Extractor, ODIE.
arXiv Detail & Related papers (2023-10-24T17:54:25Z) - Deep networks for system identification: a Survey [56.34005280792013]
System identification learns mathematical descriptions of dynamic systems from input-output data.
Main aim of the identified model is to predict new data from previous observations.
We discuss architectures commonly adopted in the literature, like feedforward, convolutional, and recurrent networks.
arXiv Detail & Related papers (2023-01-30T12:38:31Z) - Symbolic Knowledge Extraction from Opaque Predictors Applied to
Cosmic-Ray Data Gathered with LISA Pathfinder [0.0]
Several techniques exist to combine the impressive predictive performance of opaque machine learning models with human-intelligible prediction explanations.
In this paper are reported the results of different knowledge extractors applied to an ensemble predictor capable of reproducing cosmic-ray data gathered on board the LISA Pathfinder space mission.
arXiv Detail & Related papers (2022-09-10T15:35:40Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - Audacity of huge: overcoming challenges of data scarcity and data
quality for machine learning in computational materials discovery [1.0036312061637764]
Machine learning (ML)-accelerated discovery requires large amounts of high-fidelity data to reveal predictive structure-property relationships.
For many properties of interest in materials discovery, the challenging nature and high cost of data generation has resulted in a data landscape that is scarcely populated and of dubious quality.
In the absence of manual curation, increasingly sophisticated natural language processing and automated image analysis are making it possible to learn structure-property relationships from the literature.
arXiv Detail & Related papers (2021-11-02T21:43:58Z) - Representation Learning for Sequence Data with Deep Autoencoding
Predictive Components [96.42805872177067]
We propose a self-supervised representation learning method for sequence data, based on the intuition that useful representations of sequence data should exhibit a simple structure in the latent space.
We encourage this latent structure by maximizing an estimate of predictive information of latent feature sequences, which is the mutual information between past and future windows at each time step.
We demonstrate that our method recovers the latent space of noisy dynamical systems, extracts predictive features for forecasting tasks, and improves automatic speech recognition when used to pretrain the encoder on large amounts of unlabeled data.
arXiv Detail & Related papers (2020-10-07T03:34:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.