CrystalDiT: A Diffusion Transformer for Crystal Generation
- URL: http://arxiv.org/abs/2508.16614v2
- Date: Wed, 27 Aug 2025 03:45:12 GMT
- Title: CrystalDiT: A Diffusion Transformer for Crystal Generation
- Authors: Xiaohan Yi, Guikun Xu, Xi Xiao, Zhong Zhang, Liu Liu, Yatao Bian, Peilin Zhao,
- Abstract summary: We present CrystalDiT, a diffusion transformer for crystal structure generation that achieves state-of-the-art performance.<n>CrystalDiT employs a unified transformer that imposes a powerful inductive bias: treating lattice and atomic properties as a single, interdependent system.
- Score: 52.45780803467369
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present CrystalDiT, a diffusion transformer for crystal structure generation that achieves state-of-the-art performance by challenging the trend of architectural complexity. Instead of intricate, multi-stream designs, CrystalDiT employs a unified transformer that imposes a powerful inductive bias: treating lattice and atomic properties as a single, interdependent system. Combined with a periodic table-based atomic representation and a balanced training strategy, our approach achieves 9.62% SUN (Stable, Unique, Novel) rate on MP-20, substantially outperforming recent methods including FlowMM (4.38%) and MatterGen (3.42%). Notably, CrystalDiT generates 63.28% unique and novel structures while maintaining comparable stability rates, demonstrating that architectural simplicity can be more effective than complexity for materials discovery. Our results suggest that in data-limited scientific domains, carefully designed simple architectures outperform sophisticated alternatives that are prone to overfitting.
Related papers
- OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction [63.318434943975255]
We introduce OXtal, a large-scale 100M parameter all-atom diffusion model that learns the conditional joint distribution over intramolecular conformations and periodic packing.<n>By leveraging a large dataset of 600K experimentally validated crystal structures, OXtal achieves orders-of-improvement over prior ab initio machine learning CSP methods.<n> OXtal attains over 80% packing similarity rate, demonstrating its ability to model both thermodynamic and kinetic regularities of molecular crystallization.
arXiv Detail & Related papers (2025-12-07T20:46:30Z) - MatLLMSearch: Crystal Structure Discovery with Evolution-Guided Large Language Models [27.083255538087215]
We show that pre-trained Large Language Models (LLMs) can inherently generate novel and stable crystal structures without additional fine-tuning.<n>Our framework employs LLMs as intelligent proposal agents within an evolutionary pipeline that guides them to perform implicit crossover and mutation operations.<n>We demonstrate that MatLLMSearch achieves a 78.38% metastable rate validated by machine learning interatomic potentials and 31.7% DFT-verified stability.
arXiv Detail & Related papers (2025-02-28T10:41:16Z) - Active Learning for Conditional Inverse Design with Crystal Generation and Foundation Atomic Models [2.8408587358426725]
We present an active learning framework that combines crystal generation models and foundation atomic models to enhance the accuracy and efficiency of inverse design.<n>Our framework is general to accommodate different crystal generation and foundation atomic models, and establishes a scalable approach for AI-driven materials discovery.
arXiv Detail & Related papers (2025-02-24T09:15:38Z) - Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction [16.846289696048647]
We propose a Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction (TransVAE-CSP)<n>TransVAE-CSP learns the characteristic distribution space of stable materials, enabling both the reconstruction and generation of crystal structures.<n> Experimental results on the carbon_24, perov_5, and mp_20 datasets demonstrate that TransVAE-CSP outperforms existing methods in structure reconstruction and generation tasks.
arXiv Detail & Related papers (2025-02-13T15:45:36Z) - Efficient Symmetry-Aware Materials Generation via Hierarchical Generative Flow Networks [52.13486402193811]
New solid-state materials require rapidly exploring the vast space of crystal structures and locating stable regions.
Existing methods struggle to explore large material spaces and generate diverse samples with desired properties and requirements.
We propose a novel generative model employing a hierarchical exploration strategy to efficiently exploit the symmetry of the materials space to generate crystal structures given desired properties.
arXiv Detail & Related papers (2024-11-06T23:53:34Z) - Generative Inverse Design of Crystal Structures via Diffusion Models with Transformers [1.2289361708127877]
New inorganic materials with promising properties pose a critical challenge, both scientifically and for industrial applications.
Discovery of new inorganic materials with promising properties poses a critical challenge, both scientifically and for industrial applications.
In this study, we explore a new type of diffusion model for the generative inverse design of crystal structures, with a backbone based on a Transformer architecture.
arXiv Detail & Related papers (2024-06-13T16:03:15Z) - Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding [10.170537065646323]
Predicting physical properties of materials from their crystal structures is a fundamental problem in materials science.
We show that crystal structures are infinitely repeating, periodic arrangements of atoms, whose fully connected attention results in infinitely connected attention.
We propose a simple yet effective Transformer-based encoder architecture for crystal structures called Crystalformer.
arXiv Detail & Related papers (2024-03-18T11:37:42Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z) - Crystal-GFN: sampling crystals with desirable properties and constraints [103.79058968784163]
We introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials.
In this paper, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench.
The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
arXiv Detail & Related papers (2023-10-07T21:36:55Z) - An invertible crystallographic representation for general inverse design
of inorganic crystals with targeted properties [10.853822721106205]
We present a framework capable of general inverse design (not limited to a given set of elements or crystal structures)
The framework generates new crystals with user-defined formation energies, bandgap, thermoelectric (TE) power factor, and combinations thereof.
Results represent a significant step toward property-driven general inverse design using generative models.
arXiv Detail & Related papers (2020-05-15T15:58:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.