Decoding Structure-Spectrum Relationships with Physically Organized
Latent Spaces
- URL: http://arxiv.org/abs/2301.04724v1
- Date: Wed, 11 Jan 2023 21:30:22 GMT
- Title: Decoding Structure-Spectrum Relationships with Physically Organized
Latent Spaces
- Authors: Zhu Liang, Matthew R. Carbone, Wei Chen, Fanchen Meng, Eli Stavitski,
Deyu Lu, Mark S. Hybertsen, and Xiaohui Qu
- Abstract summary: A new semi-supervised machine learning method for the discovery of structure-spectrum relationships is developed and demonstrated.
This method constructs a one-to-one mapping between individual structure descriptors and spectral trends.
The RankAAE methodology produces a continuous and interpretable latent space, where each dimension can track an individual structure descriptor.
- Score: 6.36075035468233
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A new semi-supervised machine learning method for the discovery of
structure-spectrum relationships is developed and demonstrated using the
specific example of interpreting X-ray absorption near-edge structure (XANES)
spectra. This method constructs a one-to-one mapping between individual
structure descriptors and spectral trends. Specifically, an adversarial
autoencoder is augmented with a novel rank constraint (RankAAE). The RankAAE
methodology produces a continuous and interpretable latent space, where each
dimension can track an individual structure descriptor. As a part of this
process, the model provides a robust and quantitative measure of the
structure-spectrum relationship by decoupling intertwined spectral
contributions from multiple structural characteristics. This makes it ideal for
spectral interpretation and the discovery of new descriptors. The capability of
this procedure is showcased by considering five local structure descriptors and
a database of over fifty thousand simulated XANES spectra across eight
first-row transition metal oxide families. The resulting structure-spectrum
relationships not only reproduce known trends in the literature, but also
reveal unintuitive ones that are visually indiscernible in large data sets. The
results suggest that the RankAAE methodology has great potential to assist
researchers to interpret complex scientific data, test physical hypotheses, and
reveal new patterns that extend scientific insight.
Related papers
- Multi-modal Machine Learning Analysis of X-ray Absorption Near-Edge Spectra and Pair Distribution Functions: Performance and Interpretability towards Experimental Design [0.0]
We combine information from X-ray absorption near-edge spectra (XANES) and atomic pair distribution functions (PDFs) to extract information about local structure and chemistry of transition metal oxides.
Specifically, we trained random forest models on XANES, PDF, and both of them combined, to extract charge (oxidation) state, coordination number, and mean nearest-neighbor bond length of transition metal cations in oxides.
We find that XANES-only models tend to outperform the PDF-only models for all the tasks, and information from XANES often dominated when the two inputs were combined
arXiv Detail & Related papers (2024-10-22T22:49:31Z) - Fast and Reliable Probabilistic Reflectometry Inversion with Prior-Amortized Neural Posterior Estimation [73.81105275628751]
Finding all structures compatible with reflectometry data is computationally prohibitive for standard algorithms.
We address this lack of reliability with a probabilistic deep learning method that identifies all realistic structures in seconds.
Our method, Prior-Amortized Neural Posterior Estimation (PANPE), combines simulation-based inference with novel adaptive priors.
arXiv Detail & Related papers (2024-07-26T10:29:16Z) - Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for Chemistry [0.1747623282473278]
This dataset comprises simulated $1$H-NMR, $13$C-NMR, HSQC-NMR, Infrared, and Mass spectra for 790k molecules extracted from chemical reactions in patent data.
We provide benchmarks for evaluating single-modality tasks such as structure elucidation, predicting the spectra for a target molecule, and functional group predictions.
arXiv Detail & Related papers (2024-07-04T12:52:48Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - Spectroscopy-Guided Discovery of Three-Dimensional Structures of
Disordered Materials with Diffusion Models [6.97950396242977]
We introduce a new framework based on the diffusion model to predict 3D structures of disordered materials from a target property.
We show that our model can steer the generative process to tailor atomic arrangements for a specific XANES spectrum.
Our work represents a significant stride in bridging the gap between materials characterization and atomic structure determination.
arXiv Detail & Related papers (2023-12-09T05:40:10Z) - On the Joint Interaction of Models, Data, and Features [82.60073661644435]
We introduce a new tool, the interaction tensor, for empirically analyzing the interaction between data and model through features.
Based on these observations, we propose a conceptual framework for feature learning.
Under this framework, the expected accuracy for a single hypothesis and agreement for a pair of hypotheses can both be derived in closed-form.
arXiv Detail & Related papers (2023-06-07T21:35:26Z) - Spatio-Temporal Structure Consistency for Semi-supervised Medical Image
Classification [8.656046905043876]
We propose a novel Spatio-Temporal Structure Consistent (STSC) learning framework.
Specifically, a gram matrix is derived to combine the spatial structure consistency and temporal structure consistency.
We show that our method outperforms state-of-the-art SSL methods.
arXiv Detail & Related papers (2023-03-03T04:18:09Z) - Discovering Class-Specific GAN Controls for Semantic Image Synthesis [73.91655061467988]
We propose a novel method for finding spatially disentangled class-specific directions in the latent space of pretrained SIS models.
We show that the latent directions found by our method can effectively control the local appearance of semantic classes.
arXiv Detail & Related papers (2022-12-02T21:39:26Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Spectral Analysis Network for Deep Representation Learning and Image
Clustering [53.415803942270685]
This paper proposes a new network structure for unsupervised deep representation learning based on spectral analysis.
It can identify the local similarities among images in patch level and thus more robust against occlusion.
It can learn more clustering-friendly representations and is capable to reveal the deep correlations among data samples.
arXiv Detail & Related papers (2020-09-11T05:07:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.