Establishing baselines for generative discovery of inorganic crystals
- URL: http://arxiv.org/abs/2501.02144v1
- Date: Sat, 04 Jan 2025 00:14:59 GMT
- Title: Establishing baselines for generative discovery of inorganic crystals
- Authors: Nathan J. Szymanski, Christopher J. Bartel,
- Abstract summary: Generative artificial intelligence offers a promising avenue for materials discovery, yet its advantages over traditional methods remain unclear.<n>In this work, we benchmark two baseline approaches - random enumeration of charge-balanced prototypes and data-driven ion exchange of known compounds.<n>Our results show that established methods such as ion exchange perform comparably well in generating stable materials, although many of these materials tend to closely resemble known compounds.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Generative artificial intelligence offers a promising avenue for materials discovery, yet its advantages over traditional methods remain unclear. In this work, we introduce and benchmark two baseline approaches - random enumeration of charge-balanced prototypes and data-driven ion exchange of known compounds - against three generative models: a variational autoencoder, a large language model, and a diffusion model. Our results show that established methods such as ion exchange perform comparably well in generating stable materials, although many of these materials tend to closely resemble known compounds. In contrast, generative models excel at proposing novel structural frameworks and, when sufficient training data is available, can more effectively target properties such as electronic band gap and bulk modulus while maintaining a high stability rate. To enhance the performance of both the baseline and generative approaches, we implement a post-generation screening step in which all proposed structures are passed through stability and property filters from pre-trained machine learning models including universal interatomic potentials. This low-cost filtering step leads to substantial improvement in the success rates of all methods, remains computationally efficient, and ultimately provides a practical pathway toward more effective generative strategies for materials discovery.
Related papers
- Synergistic Development of Perovskite Memristors and Algorithms for Robust Analog Computing [53.77822620185878]
We propose a synergistic methodology to concurrently optimize perovskite memristor fabrication and develop robust analog DNNs.<n>We develop "BayesMulti", a training strategy utilizing BO-guided noise injection to improve the resistance of analog DNNs to memristor imperfections.<n>Our integrated approach enables use of analog computing in much deeper and wider networks, achieving up to 100-fold improvements.
arXiv Detail & Related papers (2024-12-03T19:20:08Z) - Heuristically Adaptive Diffusion-Model Evolutionary Strategy [1.8299322342860518]
Diffusion Models represent a significant advancement in generative modeling.
Our research reveals a fundamental connection between diffusion models and evolutionary algorithms.
Our framework marks a major algorithmic transition, offering increased flexibility, precision, and control in evolutionary optimization processes.
arXiv Detail & Related papers (2024-11-20T16:06:28Z) - Predicting ionic conductivity in solids from the machine-learned potential energy landscape [68.25662704255433]
We propose an approach for the quick and reliable screening of ionic conductors through the analysis of a universal interatomic potential.
Eight out of the ten highest-ranked materials are confirmed to be superionic at room temperature in first-principles calculations.
Our method achieves a speed-up factor of approximately 50 compared to molecular dynamics driven by a machine-learning potential, and is at least 3,000 times faster compared to first-principles molecular dynamics.
arXiv Detail & Related papers (2024-11-11T09:01:36Z) - Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
AliDiff is a novel framework to align pretrained target diffusion models with preferred functional properties.
It can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - PILOT: Equivariant diffusion for pocket conditioned de novo ligand generation with multi-objective guidance via importance sampling [8.619610909783441]
We propose an in-silico approach for the $textitde novo$ generation of 3D ligand structures using the equivariant diffusion model PILOT.
Its multi-objective-based importance sampling strategy is designed to direct the model towards molecules that exhibit desired characteristics.
We employ PILOT to generate novel metrics for unseen protein pockets from the Kinodata-3D dataset.
arXiv Detail & Related papers (2024-05-23T17:58:28Z) - DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization [49.85944390503957]
DecompOpt is a structure-based molecular optimization method based on a controllable and diffusion model.
We show that DecompOpt can efficiently generate molecules with improved properties than strong de novo baselines.
arXiv Detail & Related papers (2024-03-07T02:53:40Z) - Physics-informed generative model for drug-like molecule conformers [0.0]
We present a diffusion-based, generative model for conformer generation.
Our model is focused on the reproduction of bonded structure and is constructed from the associated terms traditionally found in classical force fields.
Deep learning is used to infer atom typing and geometric parameters from a training set.
arXiv Detail & Related papers (2024-02-29T17:11:08Z) - Improved prediction of ligand-protein binding affinities by meta-modeling [1.3859669037499769]
We develop a framework to integrate published force-field-based empirical docking and sequence-based deep learning models.
We show that many of our meta-models significantly improve affinity predictions over base models.
Our best meta-models achieve comparable performance to state-of-the-art deep learning tools exclusively based on 3D structures.
arXiv Detail & Related papers (2023-10-05T23:46:45Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - 3D pride without 2D prejudice: Bias-controlled multi-level generative
models for structure-based ligand design [1.978587235008588]
Data sparsity and bias are two main roadblocks to the development of 3D-aware models.
We propose a first-in-kind training protocol based on multi-level contrastive learning for improved bias control and data efficiency.
arXiv Detail & Related papers (2022-04-22T12:23:59Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.