Dara: Automated multiple-hypothesis phase identification and refinement from powder X-ray diffraction
- URL: http://arxiv.org/abs/2510.19667v1
- Date: Wed, 22 Oct 2025 15:13:47 GMT
- Title: Dara: Automated multiple-hypothesis phase identification and refinement from powder X-ray diffraction
- Authors: Yuxing Fei, Matthew J. McDermott, Christopher L. Rom, Shilong Wang, Gerbrand Ceder,
- Abstract summary: Dara is a framework designed to automate the robust identification and refinement of multiple phases from powder XRD data.<n>Key features include structural database filtering, automatic clustering of isostructural phases during tree expansion, peak-matching-based scoring to identify promising phases for refinement.
- Score: 2.7076607398164705
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Powder X-ray diffraction (XRD) is a foundational technique for characterizing crystalline materials. However, the reliable interpretation of XRD patterns, particularly in multiphase systems, remains a manual and expertise-demanding task. As a characterization method that only provides structural information, multiple reference phases can often be fit to a single pattern, leading to potential misinterpretation when alternative solutions are overlooked. To ease humans' efforts and address the challenge, we introduce Dara (Data-driven Automated Rietveld Analysis), a framework designed to automate the robust identification and refinement of multiple phases from powder XRD data. Dara performs an exhaustive tree search over all plausible phase combinations within a given chemical space and validates each hypothesis using a robust Rietveld refinement routine (BGMN). Key features include structural database filtering, automatic clustering of isostructural phases during tree expansion, peak-matching-based scoring to identify promising phases for refinement. When ambiguity exists, Dara generates multiple hypothesis which can then be decided between by human experts or with further characteriztion tools. By enhancing the reliability and accuracy of phase identification, Dara enables scalable analysis of realistic complex XRD patterns and provides a foundation for integration into multimodal characterization workflows, moving toward fully self-driving materials discovery.
Related papers
- AI-Driven Structure Refinement of X-ray Diffraction [6.970927372252305]
We introduce the whole-pattern expectation--maximization algorithm, which turns Bragg's law into an explicit constraint within a batch expectation--maximization framework.<n>By providing Bragg-consistent, uncertainty-aware intensity as a refinement-ready interface, WPEM closes the gap between AI-generated hypotheses and diffraction-admissible structure refinement.
arXiv Detail & Related papers (2026-02-18T11:14:35Z) - On the Identifiability of Regime-Switching Models with Multi-Lag Dependencies [42.274922469778545]
We develop a general theoretical framework for multi-lag Regime-Switching Models (RSMs)<n>We prove identifiability of both the number of regimes and the multi-lag transitions in a nonlinear-Gaussian setting.<n>Our results hold in a fully unsupervised setting through architectural and noise assumptions.
arXiv Detail & Related papers (2026-01-06T15:12:39Z) - FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis [92.23551599659186]
Time series analysis plays a vital role in fields such as finance, healthcare, industry, and meteorology.<n>FusAD is a unified analysis framework designed for diverse time series tasks.
arXiv Detail & Related papers (2025-12-16T04:34:27Z) - DiSE: A diffusion probabilistic model for automatic structure elucidation of organic compounds [17.43184484460819]
DiSE is an end-to-end diffusion-based generative model that integrates multiple spectroscopic modalities.<n>It achieves superior accuracy, strong generalization across chemically diverse datasets, and robustness to experimental data despite being trained on calculated spectra.
arXiv Detail & Related papers (2025-10-30T08:10:03Z) - Unifying Tree Search Algorithm and Reward Design for LLM Reasoning: A Survey [92.71325249013535]
Deliberative tree search is a cornerstone of Large Language Model (LLM) research.<n>This paper introduces a unified framework that deconstructs search algorithms into three core components.
arXiv Detail & Related papers (2025-10-11T03:29:18Z) - Spectra-to-Structure and Structure-to-Spectra Inference Across the Periodic Table [49.65586812435899]
XAStruct is a learning-based system capable of both predicting XAS spectra from crystal structures and inferring local structural descriptors from XAS input.<n>XAStruct is trained on a large-scale dataset spanning over 70 elements across the periodic table.
arXiv Detail & Related papers (2025-06-13T15:58:05Z) - Anomaly Detection and Generation with Diffusion Models: A Survey [51.61574868316922]
Anomaly detection (AD) plays a pivotal role across diverse domains, including cybersecurity, finance, healthcare, and industrial manufacturing.<n>Recent advancements in deep learning, specifically diffusion models (DMs), have sparked significant interest.<n>This survey aims to guide researchers and practitioners in leveraging DMs for innovative AD solutions across diverse applications.
arXiv Detail & Related papers (2025-06-11T03:29:18Z) - Spatial-Temporal-Spectral Unified Modeling for Remote Sensing Dense Prediction [20.1863553357121]
Current deep learning architectures for remote sensing are fundamentally rigid.<n>We introduce the Spatial-Temporal-Spectral Unified Network (STSUN) for unified modeling.<n> STSUN can adapt to input and output data with arbitrary spatial sizes, temporal lengths, and spectral bands.<n>It unifies various dense prediction tasks and diverse semantic class predictions.
arXiv Detail & Related papers (2025-05-18T07:39:17Z) - End-to-End Crystal Structure Prediction from Powder X-Ray Diffraction [37.563382606039006]
This study introduces XtalNet, the first equivariant deep generative model for end-to-end crystal structure prediction from PXRD.<n>XtalNet leverages PXRD as an additional condition, eliminating ambiguity and enabling the generation of complex organic structures with up to 400 atoms in the unit cell.<n>XtalNet achieves a top-10 Match Rate of 90.2% and 79% for hMOF-100 and hMOF-400 in conditional crystal structure prediction task, respectively.
arXiv Detail & Related papers (2024-01-08T12:50:17Z) - Probabilistic Phase Labeling and Lattice Refinement for Autonomous
Material Research [20.78180998995325]
We present CrystalShift, an efficient algorithm for probabilistic XRD phase labeling.
We demonstrate that CrystalShift provides robust probability, outperforming existing methods on synthetic and experimental datasets.
In addition to efficient phase-mapping, CrystalShift offers quantitative insights into materials' structural parameters, which facilitate expert evaluation and AI-based modeling of the phase space.
arXiv Detail & Related papers (2023-08-15T17:38:38Z) - Latent Processes Identification From Multi-View Time Series [17.33428123777779]
We propose a novel framework that employs the contrastive learning technique to invert the data generative process for enhanced identifiability.
MuLTI integrates a permutation mechanism that merges corresponding overlapped variables by the establishment of an optimal transport formula.
arXiv Detail & Related papers (2023-05-14T14:21:58Z) - Factorization of Multi-Agent Sampling-Based Motion Planning [72.42734061131569]
Modern robotics often involves multiple embodied agents operating within a shared environment.
Standard sampling-based algorithms can be used to search for solutions in the robots' joint space.
We integrate the concept of factorization into sampling-based algorithms, which requires only minimal modifications to existing methods.
We present a general implementation of a factorized SBA, derive an analytical gain in terms of sample complexity for PRM*, and showcase empirical results for RRG.
arXiv Detail & Related papers (2023-04-01T15:50:18Z) - Exploring Supervised Machine Learning for Multi-Phase Identification and
Quantification from Powder X-Ray Diffraction Spectra [1.0660480034605242]
Powder X-ray diffraction analysis is a critical component of materials characterization methodologies.
Deep learning has become a prime focus for predicting crystallographic parameters and features from X-ray spectra.
Here, we are interested in conventional supervised learning algorithms in lieu of deep learning for multi-label crystalline phase identification.
arXiv Detail & Related papers (2022-11-16T00:36:13Z) - An Incremental Phase Mapping Approach for X-ray Diffraction Patterns
using Binary Peak Representations [3.3323560120870988]
We introduce an incremental phase mapping approach based on binary peak representations using a new threshold based fuzzy dissimilarity measure.
We evaluate our method on the composition space of two ternary alloy systems- Co-Ni-Ta and Co-Ti-Ta.
arXiv Detail & Related papers (2022-11-08T05:05:21Z) - Tracking perovskite crystallization via deep learning-based feature
detection on 2D X-ray scattering data [137.47124933818066]
We propose an automated pipeline for the analysis of X-ray diffraction images based on the Faster R-CNN deep learning architecture.
We demonstrate our method on real-time tracking of organic-inorganic perovskite structure crystallization and test it on two applications.
arXiv Detail & Related papers (2022-02-22T15:39:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.