Geometric Deep Learning for Molecular Crystal Structure Prediction
- URL: http://arxiv.org/abs/2303.10140v1
- Date: Fri, 17 Mar 2023 17:27:47 GMT
- Title: Geometric Deep Learning for Molecular Crystal Structure Prediction
- Authors: Michael Kilgour, Jutta Rogal, Mark Tuckerman
- Abstract summary: We develop and test new machine learning strategies for accelerating molecular crystal structure ranking and crystal property prediction.
Our density prediction model, MolXtalNet-D, achieves state of the art performance, with lower than 2% mean absolute error on a large and diverse test dataset.
Our crystal ranking tool, MolXtalNet-S, correctly discriminates experimental samples from synthetically generated fakes.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We develop and test new machine learning strategies for accelerating
molecular crystal structure ranking and crystal property prediction using tools
from geometric deep learning on molecular graphs. Leveraging developments in
graph-based learning and the availability of large molecular crystal datasets,
we train models for density prediction and stability ranking which are
accurate, fast to evaluate, and applicable to molecules of widely varying size
and composition. Our density prediction model, MolXtalNet-D, achieves state of
the art performance, with lower than 2% mean absolute error on a large and
diverse test dataset. Our crystal ranking tool, MolXtalNet-S, correctly
discriminates experimental samples from synthetically generated fakes and is
further validated through analysis of the submissions to the Cambridge
Structural Database Blind Tests 5 and 6. Our new tools are computationally
cheap and flexible enough to be deployed within an existing crystal structure
prediction pipeline both to reduce the search space and score/filter crystal
candidates.
Related papers
- Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - CrystalX: Ultra-Precision Crystal Structure Resolution and Error Correction Using Deep Learning [44.02107704258412]
We develop CrystalX, a model for ultra-precise structural analysis at the full-atom level.
To validate the model, we employed over 50,000 X-ray diffraction measurements.
This deep learning model revolutionizes the time frame for crystal structure analysis, slashing it down to seconds.
arXiv Detail & Related papers (2024-10-17T16:12:55Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Latent Conservative Objective Models for Data-Driven Crystal Structure
Prediction [62.36797874900395]
In computational chemistry, crystal structure prediction is an optimization problem.
One approach to tackle this problem involves building simulators based on density functional theory (DFT) followed by running search in simulation.
We show that our approach, dubbed LCOMs (latent conservative objective models), performs comparably to the best current approaches in terms of success rate of structure prediction.
arXiv Detail & Related papers (2023-10-16T04:35:44Z) - Data-Driven Score-Based Models for Generating Stable Structures with
Adaptive Crystal Cells [1.515687944002438]
This work aims at the generation of new crystal structures with desired properties, such as chemical stability and specified chemical composition.
The novelty of the presented approach resides in the fact that the lattice of the crystal cell is not fixed.
A multigraph crystal representation is introduced that respects symmetry constraints, yielding computational advantages.
arXiv Detail & Related papers (2023-10-16T02:53:24Z) - Crystal-GFN: sampling crystals with desirable properties and constraints [103.79058968784163]
We introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials.
In this paper, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench.
The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
arXiv Detail & Related papers (2023-10-07T21:36:55Z) - CrysGNN : Distilling pre-trained knowledge to enhance property
prediction for crystalline materials [25.622724168215097]
This paper presents CrysGNN, a new pre-trained GNN framework for crystalline materials.
It captures both node and graph level structural information of crystal graphs using unlabelled material data.
We conduct extensive experiments to show that with distilled knowledge from the pre-trained model, all the SOTA algorithms are able to outperform their own vanilla version with good margins.
arXiv Detail & Related papers (2023-01-14T08:12:01Z) - Calibration and generalizability of probabilistic models on low-data
chemical datasets with DIONYSUS [0.0]
We perform an extensive study of the calibration and generalizability of probabilistic machine learning models on small chemical datasets.
We analyse the quality of their predictions and uncertainties in a variety of tasks (binary, regression) and datasets.
We offer practical insights into model and feature choice for modelling small chemical datasets, a common scenario in new chemical experiments.
arXiv Detail & Related papers (2022-12-03T08:19:06Z) - Tracking perovskite crystallization via deep learning-based feature
detection on 2D X-ray scattering data [137.47124933818066]
We propose an automated pipeline for the analysis of X-ray diffraction images based on the Faster R-CNN deep learning architecture.
We demonstrate our method on real-time tracking of organic-inorganic perovskite structure crystallization and test it on two applications.
arXiv Detail & Related papers (2022-02-22T15:39:00Z) - Disentangling multiple scattering with deep learning: application to
strain mapping from electron diffraction patterns [48.53244254413104]
We implement a deep neural network called FCU-Net to invert highly nonlinear electron diffraction patterns into quantitative structure factor images.
We trained the FCU-Net using over 200,000 unique dynamical diffraction patterns which include many different combinations of crystal structures.
Our simulated diffraction pattern library, implementation of FCU-Net, and trained model weights are freely available in open source repositories.
arXiv Detail & Related papers (2022-02-01T03:53:39Z) - On Open and Strong-Scaling Tools for Atom Probe Crystallography:
High-Throughput Methods for Indexing Crystal Structure and Orientation [0.0]
Volumetric crystal structure indexing and orientation mapping are key data processing steps for quantitative studies of spatial correlations.
For atom probe tomography (APT) experiments, the strategy of making comparisons between measured and analytically computed patterns is less robust because many APT datasets may contain substantial noise.
We report how this enables the development of an open-source software tool for strong-scaling and automated identifying of crystal structure and mapping crystal orientation in nanocrystalline APT datasets with multiple phases.
arXiv Detail & Related papers (2020-09-01T22:50:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.