CrystalFormer-CSP: Thinking Fast and Slow for Crystal Structure Prediction
- URL: http://arxiv.org/abs/2512.18251v1
- Date: Sat, 20 Dec 2025 07:22:58 GMT
- Title: CrystalFormer-CSP: Thinking Fast and Slow for Crystal Structure Prediction
- Authors: Zhendong Cao, Shigang Ou, Lei Wang,
- Abstract summary: We present CrystalFormerCSP, an efficient framework that unifies data-driven and physics-driven optimization approaches to predict stable crystal structures for given chemical compositions.<n>The approach combines pretrained generative models for space-group-informed structure generation and a universal machine learning force field for energy minimization.<n>We demonstrate the effectiveness of CrystalFormer-CSP on benchmark problems and showcase its usage via web interface and language model integration.
- Score: 2.110303171517621
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Crystal structure prediction is a fundamental problem in materials science. We present CrystalFormer-CSP, an efficient framework that unifies data-driven heuristic and physics-driven optimization approaches to predict stable crystal structures for given chemical compositions. The approach combines pretrained generative models for space-group-informed structure generation and a universal machine learning force field for energy minimization. Reinforcement fine-tuning can be employed to further boost the accuracy of the framework. We demonstrate the effectiveness of CrystalFormer-CSP on benchmark problems and showcase its usage via web interface and language model integration.
Related papers
- Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction [3.5802790319269717]
Crystal structure prediction (CSP) aims to predict the three-dimensional atomic arrangement of a crystal from its composition.<n>Existing deep learning models often treat crystallographic symmetry only as a soft or rely on space group and Wyckoff templates retrieved from known structures.<n>In contrast, our approach leverages large language models to encode chemical semantics and directly generate fine-grained Wyckoff patterns from composition.
arXiv Detail & Related papers (2026-02-19T08:43:25Z) - OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction [63.318434943975255]
We introduce OXtal, a large-scale 100M parameter all-atom diffusion model that learns the conditional joint distribution over intramolecular conformations and periodic packing.<n>By leveraging a large dataset of 600K experimentally validated crystal structures, OXtal achieves orders-of-improvement over prior ab initio machine learning CSP methods.<n> OXtal attains over 80% packing similarity rate, demonstrating its ability to model both thermodynamic and kinetic regularities of molecular crystallization.
arXiv Detail & Related papers (2025-12-07T20:46:30Z) - CrystalDiT: A Diffusion Transformer for Crystal Generation [52.45780803467369]
We present CrystalDiT, a diffusion transformer for crystal structure generation that achieves state-of-the-art performance.<n>CrystalDiT employs a unified transformer that imposes a powerful inductive bias: treating lattice and atomic properties as a single, interdependent system.
arXiv Detail & Related papers (2025-08-13T12:53:32Z) - Siamese Foundation Models for Crystal Structure Prediction [70.63218101004398]
Crystal Structure Prediction (CSP) aims to generate stable crystal structures from compositions.<n>CSP remains a relatively underexplored area.<n>We propose Siamese foundation models specifically designed to address CSP.
arXiv Detail & Related papers (2025-03-13T15:44:16Z) - Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction [16.846289696048647]
We propose a Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction (TransVAE-CSP)<n>TransVAE-CSP learns the characteristic distribution space of stable materials, enabling both the reconstruction and generation of crystal structures.<n> Experimental results on the carbon_24, perov_5, and mp_20 datasets demonstrate that TransVAE-CSP outperforms existing methods in structure reconstruction and generation tasks.
arXiv Detail & Related papers (2025-02-13T15:45:36Z) - deCIFer: Crystal Structure Prediction from Powder Diffraction Data using Autoregressive Language Models [1.231476564107544]
We introduce an autoregressive language model that performs crystal structure prediction (CSP) from powder diffraction data.<n>The presented model, deCIFer, generates crystal structures in the widely used Crystallographic Information File (CIF) format.<n>We train deCIFer on nearly 2.3M crystal structures and validate on diverse sets of PXRD patterns for characterizing challenging inorganic crystal systems.
arXiv Detail & Related papers (2025-02-04T10:09:47Z) - Generative Hierarchical Materials Search [91.93125016916463]
We propose Generative Hierarchical Materials Search (GenMS) for controllable generation of crystal structures.
GenMS consists of (1) a language model that takes high-level natural language as input and generates intermediate textual information about a crystal.
GenMS additionally uses a graph neural network to predict properties (e.g., formation energy) from the generated crystal structures.
arXiv Detail & Related papers (2024-09-10T17:51:28Z) - Crystal-LSBO: Automated Design of De Novo Crystals with Latent Space Bayesian Optimization [11.988832749427077]
We introduce Crystal-LSBO, a de novo design framework for crystals specifically tailored to enhance explorability.
Our study pioneers the use of LSBO for de novo crystal design, demonstrating its efficacy through optimization tasks.
arXiv Detail & Related papers (2024-05-28T07:03:49Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z) - Crystal-GFN: sampling crystals with desirable properties and constraints [103.79058968784163]
We introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials.
In this paper, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench.
The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
arXiv Detail & Related papers (2023-10-07T21:36:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.