Topological Descriptors Help Predict Guest Adsorption in Nanoporous
Materials
- URL: http://arxiv.org/abs/2001.05972v3
- Date: Fri, 6 Mar 2020 23:19:22 GMT
- Title: Topological Descriptors Help Predict Guest Adsorption in Nanoporous
Materials
- Authors: Aditi S. Krishnapriyan, Maciej Haranczyk, Dmitriy Morozov
- Abstract summary: We use persistent homology to describe the geometry of nanoporous materials at various scales.
We combine our topological descriptor with traditional structural features and investigate the relative importance of each to the prediction tasks.
- Score: 0.09668407688201358
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning has emerged as an attractive alternative to experiments and
simulations for predicting material properties. Usually, such an approach
relies on specific domain knowledge for feature design: each learning target
requires careful selection of features that an expert recognizes as important
for the specific task. The major drawback of this approach is that computation
of only a few structural features has been implemented so far, and it is
difficult to tell a priori which features are important for a particular
application. The latter problem has been empirically observed for predictors of
guest uptake in nanoporous materials: local and global porosity features become
dominant descriptors at low and high pressures, respectively. We investigate a
feature representation of materials using tools from topological data analysis.
Specifically, we use persistent homology to describe the geometry of nanoporous
materials at various scales. We combine our topological descriptor with
traditional structural features and investigate the relative importance of each
to the prediction tasks. We demonstrate an application of this feature
representation by predicting methane adsorption in zeolites, for pressures in
the range of 1-200 bar. Our results not only show a considerable improvement
compared to the baseline, but they also highlight that topological features
capture information complementary to the structural features: this is
especially important for the adsorption at low pressure, a task particularly
difficult for the traditional features. Furthermore, by investigation of the
importance of individual topological features in the adsorption model, we are
able to pinpoint the location of the pores that correlate best to adsorption at
different pressure, contributing to our atom-level understanding of
structure-property relationships.
Related papers
- Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy [0.4711628883579317]
We present an alternative lightweight curiosity algorithm which actively samples regions with unexplored structure-property relations.
We show that the algorithm outperforms random sampling for predicting properties from structures, and provides a convenient tool for efficient mapping of structure-property relationships in materials science.
arXiv Detail & Related papers (2025-04-28T17:31:29Z) - A Pipeline for Data-Driven Learning of Topological Features with Applications to Protein Stability Prediction [0.0]
We propose a data-driven method to learn interpretable topological features of biomolecular data.
We compare models that leverage automatically-learned structural features against models trained on a large set of biophysical features determined by subject-matter experts (SME)
Our models, based only on topological features of the protein structures, achieved 92%-99% of the performance of SME-based models in terms of the average precision score.
arXiv Detail & Related papers (2024-08-09T03:52:27Z) - Predicting Many Properties of Crystals by a Single Deep Learning Model [2.7892599615881144]
We introduce CrystalBERT, a transformer-based framework that integrates space group, elemental, and unit cell information.
By incorporating all these features, we achieve a high accuracy of 91% in topological classification, surpassing prior studies and identifying previously misclassified topological materials.
arXiv Detail & Related papers (2024-05-29T09:56:00Z) - Keypoint Description by Symmetry Assessment -- Applications in
Biometrics [49.547569925407814]
We present a model-based feature extractor to describe neighborhoods around keypoints by finite expansion.
The iso-curves of such functions are highly symmetric w.r.t. the origin (a keypoint) and the estimated parameters have well defined geometric interpretations.
arXiv Detail & Related papers (2023-11-03T00:49:25Z) - Graph neural networks for the prediction of molecular structure-property
relationships [59.11160990637615]
Graph neural networks (GNNs) are a novel machine learning method that directly work on the molecular graph.
GNNs allow to learn properties in an end-to-end fashion, thereby avoiding the need for informative descriptors.
We describe the fundamentals of GNNs and demonstrate the application of GNNs via two examples for molecular property prediction.
arXiv Detail & Related papers (2022-07-25T11:30:44Z) - What Image Features Boost Housing Market Predictions? [81.32205133298254]
We propose a set of techniques for the extraction of visual features for efficient numerical inclusion in predictive algorithms.
We discuss techniques such as Shannon's entropy, calculating the center of gravity, employing image segmentation, and using Convolutional Neural Networks.
The set of 40 image features selected here carries a significant amount of predictive power and outperforms some of the strongest metadata predictors.
arXiv Detail & Related papers (2021-07-15T06:32:10Z) - Smart Vectorizations for Single and Multiparameter Persistence [8.504400925390296]
We introduce two new topological summaries for single and multi-persistence persistence, namely, saw functions and multi-persistence grid functions.
These new topological summaries can be regarded as the complexity measures of the evolving subspaces determined by the filtration.
We derive theoretical guarantees on the stability of the new saw and multi-persistence grid functions and illustrate their applicability for graph classification tasks.
arXiv Detail & Related papers (2021-04-10T15:09:31Z) - Machine learning with persistent homology and chemical word embeddings
improves prediction accuracy and interpretability in metal-organic frameworks [0.07874708385247352]
We introduce an end-to-end machine learning model that automatically generates descriptors that capture a complex representation of a material's structure and chemistry.
It automatically encapsulates geometric and chemical information directly from the material system.
Our results show considerable improvement in both accuracy and transferability across targets compared to models constructed from the commonly-used, manually-curated features.
arXiv Detail & Related papers (2020-10-01T16:31:46Z) - The role of feature space in atomistic learning [62.997667081978825]
Physically-inspired descriptors play a key role in the application of machine-learning techniques to atomistic simulations.
We introduce a framework to compare different sets of descriptors, and different ways of transforming them by means of metrics and kernels.
We compare representations built in terms of n-body correlations of the atom density, quantitatively assessing the information loss associated with the use of low-order features.
arXiv Detail & Related papers (2020-09-06T14:12:09Z) - A Trainable Optimal Transport Embedding for Feature Aggregation and its
Relationship to Attention [96.77554122595578]
We introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal transport plan between the set and a trainable reference.
Our approach scales to large datasets and allows end-to-end training of the reference, while also providing a simple unsupervised learning mechanism with small computational cost.
arXiv Detail & Related papers (2020-06-22T08:35:58Z) - Explainable Deep Relational Networks for Predicting Compound-Protein
Affinities and Contacts [80.69440684790925]
DeepRelations is a physics-inspired deep relational network with intrinsically explainable architecture.
It shows superior interpretability to the state-of-the-art.
It boosts the AUPRC of contact prediction 9.5, 16.9, 19.3 and 5.7-fold for the test, compound-unique, protein-unique, and both-unique sets.
arXiv Detail & Related papers (2019-12-29T00:14:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.