OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs
- URL: http://arxiv.org/abs/2602.20195v1
- Date: Sun, 22 Feb 2026 04:01:06 GMT
- Title: OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs
- Authors: Mohammadmahdi Vahediahmar, Matthew A. McDonald, Feng Liu,
- Abstract summary: We introduce a flow-matching model for predicting organic crystal structures directly from molecular graphs.<n>A bond-aware loss guides the model toward realistic local chemistry by enforcing distributions of bond lengths and connectivity.<n>Experiments show that our method achieves a Match Rate more than 10 times higher than existing baselines.
- Score: 4.5375644408112565
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Crystal structure prediction is a long-standing challenge in materials science, with most data-driven methods developed for inorganic systems. This leaves an important gap for organic crystals, which are central to pharmaceuticals, polymers, and functional materials, but present unique challenges, such as larger unit cells and strict chemical connectivity. We introduce a flow-matching model for predicting organic crystal structures directly from molecular graphs. The architecture integrates molecular connectivity with periodic boundary conditions while preserving the symmetries of crystalline systems. A bond-aware loss guides the model toward realistic local chemistry by enforcing distributions of bond lengths and connectivity. To support reliable and efficient training, we built a curated dataset of organic crystals, along with a preprocessing pipeline that precomputes bonds and edges, substantially reducing computational overhead during both training and inference. Experiments show that our method achieves a Match Rate more than 10 times higher than existing baselines while requiring fewer sampling steps for inference. These results establish generative modeling as a practical and scalable framework for organic crystal structure prediction.
Related papers
- MolCrystalFlow: Molecular Crystal Structure Prediction via Flow Matching [6.02313590078714]
We present MolCrystalFlow, a flow-based generative model for molecular crystal structure prediction.<n>The framework disentangles intramolecular complexity from intermolecular packing by embedding molecules as rigid bodies.<n>We benchmark our model against state-of-the-art generative models for large-size periodic crystals and rule-based structure generation methods.
arXiv Detail & Related papers (2026-02-17T21:22:08Z) - OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction [63.318434943975255]
We introduce OXtal, a large-scale 100M parameter all-atom diffusion model that learns the conditional joint distribution over intramolecular conformations and periodic packing.<n>By leveraging a large dataset of 600K experimentally validated crystal structures, OXtal achieves orders-of-improvement over prior ab initio machine learning CSP methods.<n> OXtal attains over 80% packing similarity rate, demonstrating its ability to model both thermodynamic and kinetic regularities of molecular crystallization.
arXiv Detail & Related papers (2025-12-07T20:46:30Z) - A Generation Framework with Strict Constraints for Crystal Materials Design [8.736399863675524]
We present a new constrained generation framework that takes multiple constraints as input and enables the generation of crystal structures with specific chemical and properties.<n>Our method generates crystal structures with a probability of meeting the target properties that is more than twice that of existing approaches.
arXiv Detail & Related papers (2024-11-13T09:36:50Z) - Generative Hierarchical Materials Search [91.93125016916463]
We propose Generative Hierarchical Materials Search (GenMS) for controllable generation of crystal structures.
GenMS consists of (1) a language model that takes high-level natural language as input and generates intermediate textual information about a crystal.
GenMS additionally uses a graph neural network to predict properties (e.g., formation energy) from the generated crystal structures.
arXiv Detail & Related papers (2024-09-10T17:51:28Z) - UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment [51.49238426241974]
This paper introduces UAlign, a template-free graph-to-sequence pipeline for retrosynthesis prediction.
By combining graph neural networks and Transformers, our method can more effectively leverage the inherent graph structure of molecules.
arXiv Detail & Related papers (2024-03-25T03:23:03Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z) - Data-Driven Score-Based Models for Generating Stable Structures with
Adaptive Crystal Cells [1.515687944002438]
This work aims at the generation of new crystal structures with desired properties, such as chemical stability and specified chemical composition.
The novelty of the presented approach resides in the fact that the lattice of the crystal cell is not fixed.
A multigraph crystal representation is introduced that respects symmetry constraints, yielding computational advantages.
arXiv Detail & Related papers (2023-10-16T02:53:24Z) - Crystal-GFN: sampling crystals with desirable properties and constraints [103.79058968784163]
We introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials.
In this paper, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench.
The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
arXiv Detail & Related papers (2023-10-07T21:36:55Z) - A data-driven interpretation of the stability of molecular crystals [0.0]
Predicting the stability of crystal structures formed from molecular building blocks is a non-trivial scientific problem.
We introduce a structural descriptor tailored to the prediction of the binding energy for a curated dataset of organic crystals.
We then interpret this library using a low-dimensional representation of the structure-energy landscape.
arXiv Detail & Related papers (2022-09-21T23:32:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.