MOFSimplify: Machine Learning Models with Extracted Stability Data of
Three Thousand Metal-Organic Frameworks
- URL: http://arxiv.org/abs/2109.08098v1
- Date: Thu, 16 Sep 2021 16:37:37 GMT
- Title: MOFSimplify: Machine Learning Models with Extracted Stability Data of
Three Thousand Metal-Organic Frameworks
- Authors: A. Nandy, G. Terrones, N. Arunachalam, C. Duan, D. W. Kastner, and H.
J. Kulik
- Abstract summary: We use natural language processing to mine literature on metal-organic framework (MOF) stability measures.
We train machine learning models to predict stability on new MOFs with quantified uncertainty.
Our web interface, MOFSimplify, provides users access to our curated data and enables them to harness that data for predictions on new MOFs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We report a workflow and the output of a natural language processing
(NLP)-based procedure to mine the extant metal-organic framework (MOF)
literature describing structurally characterized MOFs and their solvent removal
and thermal stabilities. We obtain over 2,000 solvent removal stability
measures from text mining and 3,000 thermal decomposition temperatures from
thermogravimetric analysis data. We assess the validity of our NLP methods and
the accuracy of our extracted data by comparing to a hand-labeled subset.
Machine learning (ML, i.e. artificial neural network) models trained on this
data using graph- and pore-geometry-based representations enable prediction of
stability on new MOFs with quantified uncertainty. Our web interface,
MOFSimplify, provides users access to our curated data and enables them to
harness that data for predictions on new MOFs. MOFSimplify also encourages
community feedback on existing data and on ML model predictions for
community-based active learning for improved MOF stability models.
Related papers
- Improving the Stability of GNN Force Field Models by Reducing Feature Correlation [9.546348080237747]
We propose a feature correlation based method for GNNFF models to enhance the stability of MD simulation.
We show our method can significantly improve stability for GNNFF models especially in out-of-distribution data with less than 3% computational overhead.
arXiv Detail & Related papers (2025-02-18T05:18:22Z) - Hybrid machine learning based scale bridging framework for permeability prediction of fibrous structures [0.0]
This study introduces a hybrid machine learning-based scale-bridging framework for predicting the permeability of fibrous textile structures.
Four methodologies were evaluated: Single Scale Method (SSM), Simple Upscaling Method (SUM), Scale-Bridging Method (SBM), and Fully Resolved Model (FRM)
arXiv Detail & Related papers (2025-02-07T16:09:25Z) - On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - Fine-Tuned Language Models Generate Stable Inorganic Materials as Text [57.01994216693825]
Fine-tuning large language models on text-encoded atomistic data is simple to implement yet reliable.
We show that our strongest model can generate materials predicted to be metastable at about twice the rate of CDVAE.
Because of text prompting's inherent flexibility, our models can simultaneously be used for unconditional generation of stable material.
arXiv Detail & Related papers (2024-02-06T20:35:28Z) - Towards Long-Term predictions of Turbulence using Neural Operators [68.8204255655161]
It aims to develop reduced-order/surrogate models for turbulent flow simulations using Machine Learning.
Different model structures are analyzed, with U-NET structures performing better than the standard FNO in accuracy and stability.
arXiv Detail & Related papers (2023-07-25T14:09:53Z) - A physics-constrained machine learning method for mapping gapless land
surface temperature [6.735896406986559]
In this paper, a physics- ML model is proposed to generate gapless LST with physical meanings and high accuracy.
The light-boosting machine (LGBM) model, which uses only remote sensing data as gradient input serves as the pure ML model.
Compared with a pure physical method and pure ML methods, the PC-LGBM model improves the prediction accuracy and physical interpretability of LST.
arXiv Detail & Related papers (2023-07-03T01:44:48Z) - A Database of Ultrastable MOFs Reassembled from Stable Fragments with
Machine Learning Models [0.3710026260502075]
We leverage community knowledge and machine learning models to identify metal-organic frameworks (MOFs) that are thermally stable and stable upon activation.
We make a new hypothetical MOF database of over 50,000 structures that samples orders of magnitude more connectivity nets and inorganic building blocks than prior databases.
This database shows an order of magnitude enrichment of ultrastable MOF structures that are stable upon activation and more than one standard deviation more thermally stable than the average experimentally characterized MOF.
arXiv Detail & Related papers (2022-10-25T17:38:42Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - Using Machine Learning and Data Mining to Leverage Community Knowledge
for the Engineering of Stable Metal-Organic Frameworks [0.9187159782788578]
MOFs hold promise for engineering challenges ranging from gas separations to stability.
To overcome limitation, we extract thousands of published reports of the key aspects of MOF stability necessary for their practical application.
We use natural language processing and automated image analysis to obtain over 2,000 solvent-removal measures and 3,000 thermal temperatures.
arXiv Detail & Related papers (2021-06-24T21:35:26Z) - Learning representations with end-to-end models for improved remaining
useful life prognostics [64.80885001058572]
The remaining Useful Life (RUL) of equipment is defined as the duration between the current time and its failure.
We propose an end-to-end deep learning model based on multi-layer perceptron and long short-term memory layers (LSTM) to predict the RUL.
We will discuss how the proposed end-to-end model is able to achieve such good results and compare it to other deep learning and state-of-the-art methods.
arXiv Detail & Related papers (2021-04-11T16:45:18Z) - VAE-LIME: Deep Generative Model Based Approach for Local Data-Driven
Model Interpretability Applied to the Ironmaking Industry [70.10343492784465]
It is necessary to expose to the process engineer, not solely the model predictions, but also their interpretability.
Model-agnostic local interpretability solutions based on LIME have recently emerged to improve the original method.
We present in this paper a novel approach, VAE-LIME, for local interpretability of data-driven models forecasting the temperature of the hot metal produced by a blast furnace.
arXiv Detail & Related papers (2020-07-15T07:07:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.