Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction
- URL: http://arxiv.org/abs/2602.17176v1
- Date: Thu, 19 Feb 2026 08:43:25 GMT
- Title: Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction
- Authors: Shi Yin, Jinming Mu, Xudong Zhu, Lixin He,
- Abstract summary: Crystal structure prediction (CSP) aims to predict the three-dimensional atomic arrangement of a crystal from its composition.<n>Existing deep learning models often treat crystallographic symmetry only as a soft or rely on space group and Wyckoff templates retrieved from known structures.<n>In contrast, our approach leverages large language models to encode chemical semantics and directly generate fine-grained Wyckoff patterns from composition.
- Score: 3.5802790319269717
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Crystal structure prediction (CSP), which aims to predict the three-dimensional atomic arrangement of a crystal from its composition, is central to materials discovery and mechanistic understanding. Existing deep learning models often treat crystallographic symmetry only as a soft heuristic or rely on space group and Wyckoff templates retrieved from known structures, which limits both physical fidelity and the ability to discover genuinely new material structures. In contrast to retrieval-based methods, our approach leverages large language models to encode chemical semantics and directly generate fine-grained Wyckoff patterns from composition, effectively circumventing the limitations inherent to database lookups. Crucially, we incorporate domain knowledge into the generative process through an efficient constrained-optimization search that rigorously enforces algebraic consistency between site multiplicities and atomic stoichiometry. By integrating this symmetry-consistent template into a diffusion backbone, our approach constrains the stochastic generative trajectory to a physically valid geometric manifold. This framework achieves state-of-the-art performance across stability, uniqueness, and novelty (SUN) benchmarks, alongside superior matching performance, thereby establishing a new paradigm for the rigorous exploration of targeted crystallographic space. This framework enables efficient expansion into previously uncharted materials space, eliminating reliance on existing databases or a priori structural knowledge.
Related papers
- CrystalFormer-CSP: Thinking Fast and Slow for Crystal Structure Prediction [2.110303171517621]
We present CrystalFormerCSP, an efficient framework that unifies data-driven and physics-driven optimization approaches to predict stable crystal structures for given chemical compositions.<n>The approach combines pretrained generative models for space-group-informed structure generation and a universal machine learning force field for energy minimization.<n>We demonstrate the effectiveness of CrystalFormer-CSP on benchmark problems and showcase its usage via web interface and language model integration.
arXiv Detail & Related papers (2025-12-20T07:22:58Z) - OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction [63.318434943975255]
We introduce OXtal, a large-scale 100M parameter all-atom diffusion model that learns the conditional joint distribution over intramolecular conformations and periodic packing.<n>By leveraging a large dataset of 600K experimentally validated crystal structures, OXtal achieves orders-of-improvement over prior ab initio machine learning CSP methods.<n> OXtal attains over 80% packing similarity rate, demonstrating its ability to model both thermodynamic and kinetic regularities of molecular crystallization.
arXiv Detail & Related papers (2025-12-07T20:46:30Z) - Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning [13.437119411600499]
We introduce a reinforcement learning framework that guides latent denoising diffusion models toward diverse, yet thermodynamically viable crystalline compounds.<n>Our approach integrates group relative policy optimisation with verifiable, multi-objective rewards that jointly balance creativity, stability, and diversity.<n>This approach establishes a modular foundation for controllable AI-driven inverse design that addresses the novelty-validity trade-off across scientific discovery applications of generative models.
arXiv Detail & Related papers (2025-11-10T14:48:49Z) - CLOUD: A Scalable and Physics-Informed Foundation Model for Crystal Representation Learning [0.0]
We introduce CLOUD (Crystal Language mOdel for Unified and Differentiable materials modeling), a transformer-based framework trained on a novel Symmetry-Consistent SCOPE (SCOPE)<n>CLOUD is pre-trained on over six million crystal structures and achieves competitive performance in predicting a wide range of material properties.<n>As proof of concept of differentiable materials modeling, CLOUD is applied to predict the phonon internal energy and heat capacity.
arXiv Detail & Related papers (2025-06-19T15:45:24Z) - High-Fidelity Scientific Simulation Surrogates via Adaptive Implicit Neural Representations [51.90920900332569]
Implicit neural representations (INRs) offer a compact and continuous framework for modeling spatially structured data.<n>Recent approaches address this by introducing additional features along rigid geometric structures.<n>We propose a simple yet effective alternative: Feature-Adaptive INR (FA-INR)
arXiv Detail & Related papers (2025-06-07T16:45:17Z) - Geometry-Editable and Appearance-Preserving Object Compositon [67.98806888489385]
General object composition (GOC) aims to seamlessly integrate a target object into a background scene with desired geometric properties.<n>Recent approaches derive semantic embeddings and integrate them into advanced diffusion models to enable geometry-editable generation.<n>We introduce a Disentangled Geometry-editable and Appearance-preserving Diffusion model that first leverages semantic embeddings to implicitly capture desired geometric transformations.
arXiv Detail & Related papers (2025-05-27T09:05:28Z) - Learning Identifiable Structures Helps Avoid Bias in DNN-based Supervised Causal Learning [56.22841701016295]
Supervised Causal Learning (SCL) is an emerging paradigm in this field.<n>Existing Deep Neural Network (DNN)-based methods commonly adopt the "Node-Edge approach"
arXiv Detail & Related papers (2025-02-15T19:10:35Z) - ReciNet: Reciprocal Space-Aware Long-Range Modeling for Crystalline Property Prediction [28.923772205970497]
ReciNet is a novel architecture that integrates geometric GNNs and reciprocal blocks to model short-range and long-range interactions.<n>We show that ReciNet achieves state-of-the-art predictive accuracy across a range of crystal property prediction tasks.
arXiv Detail & Related papers (2025-02-04T22:31:39Z) - Open Materials Generation with Stochastic Interpolants [14.939468363546384]
Open Materials Generation (OMatG) is a unifying framework for the generative design and discovery of crystalline materials.<n>OMatG employs inorganic interpolants to bridge an arbitrary base distribution to the target distribution of inorganic crystals.<n>We benchmark OMatG's performance on two tasks: Crystal Structure Prediction and 'de novo' generation.
arXiv Detail & Related papers (2025-02-04T18:56:47Z) - MIND: Microstructure INverse Design with Generative Hybrid Neural Representation [25.55691106041371]
inverse design of microstructures plays a pivotal role in optimizing metamaterials with specific, targeted physical properties.<n>We present a novel generative model that integrates latent diffusion with Holoplane, an advanced hybrid neural representation that simultaneously encodes both geometric and physical properties.<n>Our approach generalizes across multiple microstructure classes, enabling the generation of diverse, tileable microstructures with significantly improved property accuracy and enhanced control over geometric validity.
arXiv Detail & Related papers (2025-02-01T20:25:47Z) - Efficient Symmetry-Aware Materials Generation via Hierarchical Generative Flow Networks [52.13486402193811]
New solid-state materials require rapidly exploring the vast space of crystal structures and locating stable regions.
Existing methods struggle to explore large material spaces and generate diverse samples with desired properties and requirements.
We propose a novel generative model employing a hierarchical exploration strategy to efficiently exploit the symmetry of the materials space to generate crystal structures given desired properties.
arXiv Detail & Related papers (2024-11-06T23:53:34Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.