Crystal-LSBO: Automated Design of De Novo Crystals with Latent Space Bayesian Optimization
- URL: http://arxiv.org/abs/2405.17881v2
- Date: Sat, 28 Sep 2024 01:25:05 GMT
- Title: Crystal-LSBO: Automated Design of De Novo Crystals with Latent Space Bayesian Optimization
- Authors: Onur Boyar, Yanheng Gu, Yuji Tanaka, Shunsuke Tonogai, Tomoya Itakura, Ichiro Takeuchi,
- Abstract summary: We introduce Crystal-LSBO, a de novo design framework for crystals specifically tailored to enhance explorability.
Our study pioneers the use of LSBO for de novo crystal design, demonstrating its efficacy through optimization tasks.
- Score: 11.988832749427077
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Generative modeling of crystal structures is significantly challenged by the complexity of input data, which constrains the ability of these models to explore and discover novel crystals. This complexity often confines de novo design methodologies to merely small perturbations of known crystals and hampers the effective application of advanced optimization techniques. One such optimization technique, Latent Space Bayesian Optimization (LSBO) has demonstrated promising results in uncovering novel objects across various domains, especially when combined with Variational Autoencoders (VAEs). Recognizing LSBO's potential and the critical need for innovative crystal discovery, we introduce Crystal-LSBO, a de novo design framework for crystals specifically tailored to enhance explorability within LSBO frameworks. Crystal-LSBO employs multiple VAEs, each dedicated to a distinct aspect of crystal structure: lattice, coordinates, and chemical elements, orchestrated by an integrative model that synthesizes these components into a cohesive output. This setup not only streamlines the learning process but also produces explorable latent spaces thanks to the decreased complexity of the learning task for each model, enabling LSBO approaches to operate. Our study pioneers the use of LSBO for de novo crystal design, demonstrating its efficacy through optimization tasks focused mainly on formation energy values. Our results highlight the effectiveness of our methodology, offering a new perspective for de novo crystal discovery.
Related papers
- Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction [3.5802790319269717]
Crystal structure prediction (CSP) aims to predict the three-dimensional atomic arrangement of a crystal from its composition.<n>Existing deep learning models often treat crystallographic symmetry only as a soft or rely on space group and Wyckoff templates retrieved from known structures.<n>In contrast, our approach leverages large language models to encode chemical semantics and directly generate fine-grained Wyckoff patterns from composition.
arXiv Detail & Related papers (2026-02-19T08:43:25Z) - PeroMAS: A Multi-agent System of Perovskite Material Discovery [51.859972927223936]
Perovskite Solar Cells (PSCs) are renowned for their superior optoelectronic performance and cost potential.<n>Existing AI approaches focus predominantly on discrete models, including material design, process optimization, and property prediction.<n>We propose a multi-agent system for perovskite material discovery, named PeroMAS.
arXiv Detail & Related papers (2026-02-10T09:33:06Z) - CrystalFormer-CSP: Thinking Fast and Slow for Crystal Structure Prediction [2.110303171517621]
We present CrystalFormerCSP, an efficient framework that unifies data-driven and physics-driven optimization approaches to predict stable crystal structures for given chemical compositions.<n>The approach combines pretrained generative models for space-group-informed structure generation and a universal machine learning force field for energy minimization.<n>We demonstrate the effectiveness of CrystalFormer-CSP on benchmark problems and showcase its usage via web interface and language model integration.
arXiv Detail & Related papers (2025-12-20T07:22:58Z) - OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction [63.318434943975255]
We introduce OXtal, a large-scale 100M parameter all-atom diffusion model that learns the conditional joint distribution over intramolecular conformations and periodic packing.<n>By leveraging a large dataset of 600K experimentally validated crystal structures, OXtal achieves orders-of-improvement over prior ab initio machine learning CSP methods.<n> OXtal attains over 80% packing similarity rate, demonstrating its ability to model both thermodynamic and kinetic regularities of molecular crystallization.
arXiv Detail & Related papers (2025-12-07T20:46:30Z) - CrystalICL: Enabling In-Context Learning for Crystal Generation [12.641999605656409]
Large language models (LLMs) have demonstrated strong in-context learning (ICL) capabilities.<n>Existing LLM-based crystal generation approaches are limited to zero-shot scenarios and are unable to benefit from few-shot scenarios.<n>We propose CrystalICL, a novel model designed for few-shot crystal generation.
arXiv Detail & Related papers (2025-08-27T07:49:27Z) - CrystalDiT: A Diffusion Transformer for Crystal Generation [52.45780803467369]
We present CrystalDiT, a diffusion transformer for crystal structure generation that achieves state-of-the-art performance.<n>CrystalDiT employs a unified transformer that imposes a powerful inductive bias: treating lattice and atomic properties as a single, interdependent system.
arXiv Detail & Related papers (2025-08-13T12:53:32Z) - Large Language Models Are Innate Crystal Structure Generators [30.44669215588058]
We show that pre-trained Large Language Models can inherently generate stable crystal structures without additional training.
Our framework MatLLMSearch integrates pre-trained LLMs with evolutionary search algorithms, achieving a 78.38% metastable rate.
arXiv Detail & Related papers (2025-02-28T10:41:16Z) - Active Learning for Conditional Inverse Design with Crystal Generation and Foundation Atomic Models [2.8408587358426725]
We present an active learning framework that combines crystal generation models and foundation atomic models to enhance the accuracy and efficiency of inverse design.
Our framework is general to accommodate different crystal generation and foundation atomic models, and establishes a scalable approach for AI-driven materials discovery.
arXiv Detail & Related papers (2025-02-24T09:15:38Z) - Efficient Symmetry-Aware Materials Generation via Hierarchical Generative Flow Networks [52.13486402193811]
New solid-state materials require rapidly exploring the vast space of crystal structures and locating stable regions.
Existing methods struggle to explore large material spaces and generate diverse samples with desired properties and requirements.
We propose a novel generative model employing a hierarchical exploration strategy to efficiently exploit the symmetry of the materials space to generate crystal structures given desired properties.
arXiv Detail & Related papers (2024-11-06T23:53:34Z) - Cliqueformer: Model-Based Optimization with Structured Transformers [102.55764949282906]
We develop a model that learns the structure of an MBO task and empirically leads to improved designs.
We evaluate Cliqueformer on various tasks, ranging from high-dimensional black-box functions to real-world tasks of chemical and genetic design.
arXiv Detail & Related papers (2024-10-17T00:35:47Z) - Adaptive Constraint Integration for Simultaneously Optimizing Crystal Structures with Multiple Targeted Properties [7.559885439354167]
Simultaneous Multi-property optimization using Adaptive Crystal Synthesizer (SMOACS)
SMOACS enables the integration of adaptive constraints into the optimization process without necessitating model retraining.
We have demonstrated the band gap optimization while meeting a challenging constraint, that is, maintaining electrical neutrality in large atomic configurations up to 135 atom sites.
arXiv Detail & Related papers (2024-10-11T06:35:48Z) - Space Group Informed Transformer for Crystalline Materials Generation [2.405914457225118]
We introduce CrystalFormer, a transformer-based autoregressive model specifically designed for space group-controlled generation of crystalline materials.
The incorporation of space group symmetry significantly simplifies the crystal space, which is crucial for data and compute efficient generative modeling of crystalline materials.
arXiv Detail & Related papers (2024-03-23T06:01:45Z) - Diffusion Model for Data-Driven Black-Box Optimization [54.25693582870226]
We focus on diffusion models, a powerful generative AI technology, and investigate their potential for black-box optimization.
We study two practical types of labels: 1) noisy measurements of a real-valued reward function and 2) human preference based on pairwise comparisons.
Our proposed method reformulates the design optimization problem into a conditional sampling problem, which allows us to leverage the power of diffusion models.
arXiv Detail & Related papers (2024-03-20T00:41:12Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z) - Data-Driven Score-Based Models for Generating Stable Structures with
Adaptive Crystal Cells [1.515687944002438]
This work aims at the generation of new crystal structures with desired properties, such as chemical stability and specified chemical composition.
The novelty of the presented approach resides in the fact that the lattice of the crystal cell is not fixed.
A multigraph crystal representation is introduced that respects symmetry constraints, yielding computational advantages.
arXiv Detail & Related papers (2023-10-16T02:53:24Z) - Crystal-GFN: sampling crystals with desirable properties and constraints [103.79058968784163]
We introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials.
In this paper, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench.
The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
arXiv Detail & Related papers (2023-10-07T21:36:55Z) - Neural Structure Fields with Application to Crystal Structure
Autoencoders [10.680545976155173]
We propose neural structure fields (NeSF) as an accurate and practical approach for representing crystal structures using neural networks.
NeSF overcomes the tradeoff between spatial resolution and computational complexity and can represent any crystal structure.
We propose an autoencoder of crystal structures that can recover various crystal structures, such as those of perovskite structure materials and cuprate superconductors.
arXiv Detail & Related papers (2022-12-08T16:41:41Z) - Disentangling multiple scattering with deep learning: application to
strain mapping from electron diffraction patterns [48.53244254413104]
We implement a deep neural network called FCU-Net to invert highly nonlinear electron diffraction patterns into quantitative structure factor images.
We trained the FCU-Net using over 200,000 unique dynamical diffraction patterns which include many different combinations of crystal structures.
Our simulated diffraction pattern library, implementation of FCU-Net, and trained model weights are freely available in open source repositories.
arXiv Detail & Related papers (2022-02-01T03:53:39Z) - Machine Learning Framework for Quantum Sampling of Highly-Constrained,
Continuous Optimization Problems [101.18253437732933]
We develop a generic, machine learning-based framework for mapping continuous-space inverse design problems into surrogate unconstrained binary optimization problems.
We showcase the framework's performance on two inverse design problems by optimizing thermal emitter topologies for thermophotovoltaic applications and (ii) diffractive meta-gratings for highly efficient beam steering.
arXiv Detail & Related papers (2021-05-06T02:22:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.