Accelerating the screening of amorphous polymer electrolytes by learning
to reduce random and systematic errors in molecular dynamics simulations
- URL: http://arxiv.org/abs/2101.05339v1
- Date: Wed, 13 Jan 2021 20:46:24 GMT
- Title: Accelerating the screening of amorphous polymer electrolytes by learning
to reduce random and systematic errors in molecular dynamics simulations
- Authors: Tian Xie, Arthur France-Lanord, Yanming Wang, Jeffrey Lopez, Michael
Austin Stolberg, Megan Hill, Graham Michael Leverick, Rafael
Gomez-Bombarelli, Jeremiah A. Johnson, Yang Shao-Horn, Jeffrey C. Grossman
- Abstract summary: In this work, we aim to screen amorphous polymer electrolytes which are promising candidates for the next generation lithium-ion battery technology.
We demonstrate that a multi-task graph neural network can learn from a large amount of noisy, biased data and a small number of unbiased data.
We screen a space of 6247 polymer electrolytes, orders of magnitude larger than previous computational studies.
- Score: 0.8061216572215162
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning has been widely adopted to accelerate the screening of
materials. Most existing studies implicitly assume that the training data are
generated through a deterministic, unbiased process, but this assumption might
not hold for the simulation of some complex materials. In this work, we aim to
screen amorphous polymer electrolytes which are promising candidates for the
next generation lithium-ion battery technology but extremely expensive to
simulate due to their structural complexity. We demonstrate that a multi-task
graph neural network can learn from a large amount of noisy, biased data and a
small number of unbiased data and reduce both random and systematic errors in
predicting the transport properties of polymer electrolytes. This observation
allows us to achieve accurate predictions on the properties of complex
materials by learning to reduce errors in the training data, instead of running
repetitive, expensive simulations which is conventionally used to reduce
simulation errors. With this approach, we screen a space of 6247 polymer
electrolytes, orders of magnitude larger than previous computational studies.
We also find a good extrapolation performance to the top polymers from a larger
space of 53362 polymers and 31 experimentally-realized polymers. The strategy
employed in this work may be applicable to a broad class of material discovery
problems that involve the simulation of complex, amorphous materials.
Related papers
- Transferring a molecular foundation model for polymer property
predictions [3.067983186439152]
Self-supervised pretraining of transformer models requires large-scale datasets.
We show that using transformers pretrained on small molecules and fine-tuned on polymer properties achieve comparable accuracy to those trained on augmented polymer datasets.
arXiv Detail & Related papers (2023-10-25T19:55:00Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Improving Molecular Representation Learning with Metric
Learning-enhanced Optimal Transport [49.237577649802034]
We develop a novel optimal transport-based algorithm termed MROT to enhance their generalization capability for molecular regression problems.
MROT significantly outperforms state-of-the-art models, showing promising potential in accelerating the discovery of new substances.
arXiv Detail & Related papers (2022-02-13T04:56:18Z) - Designing Machine Learning Surrogates using Outputs of Molecular
Dynamics Simulations as Soft Labels [0.0]
We show that statistical uncertainties associated with the outputs of molecular dynamics simulations can be utilized to train artificial neural networks.
We design soft labels for the simulation outputs by incorporating the uncertainties in the estimated average output quantities.
The approach is illustrated with the design of a surrogate for molecular dynamics simulations of confined electrolytes.
arXiv Detail & Related papers (2021-10-27T19:00:40Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - Data-Efficient Learning for Complex and Real-Time Physical Problem
Solving using Augmented Simulation [49.631034790080406]
We present a task for navigating a marble to the center of a circular maze.
We present a model that learns to move a marble in the complex environment within minutes of interacting with the real system.
arXiv Detail & Related papers (2020-11-14T02:03:08Z) - Polymers for Extreme Conditions Designed Using Syntax-Directed
Variational Autoencoders [53.34780987686359]
Machine learning tools are now commonly employed to virtually screen material candidates with desired properties.
This approach is inefficient, and severely constrained by the candidates that human imagination can conceive.
We utilize syntax-directed variational autoencoders (VAE) in tandem with Gaussian process regression (GPR) models to discover polymers expected to be robust under three extreme conditions.
arXiv Detail & Related papers (2020-11-04T21:36:59Z) - Polymer Informatics: Current Status and Critical Next Steps [1.3238373064156097]
Surrogate models are trained on available polymer data for instant property prediction.
Data-driven strategies to tackle unique challenges resulting from the extraordinary chemical and physical diversity of polymers at small and large scales are being explored.
Methods to solve inverse problems, wherein polymer recommendations are made using advanced AI algorithms that meet application targets, are being investigated.
arXiv Detail & Related papers (2020-11-01T14:17:22Z) - Exploring the potential of transfer learning for metamodels of
heterogeneous material deformation [0.0]
We show that transfer learning can be used to leverage both low-fidelity simulation data and simulation data.
We extend Mechanical MNIST, our open source benchmark dataset of heterogeneous material undergoing large deformation.
We show that transferring the knowledge stored in metamodels trained on these low-fidelity simulation results can vastly improve the performance of metamodels used to predict the results of high-fidelity simulations.
arXiv Detail & Related papers (2020-10-28T12:43:46Z) - Numerical simulation, clustering and prediction of multi-component
polymer precipitation [0.7349727826230861]
Multi-component polymer systems are of interest in organic photovoltaic and drug delivery applications.
We use a modified Cahn-Hilliard model to simulate polymer precipitation.
To reduce the required computational costs, we apply machine learning techniques for clustering and consequent prediction of the simulated polymer blend images.
arXiv Detail & Related papers (2020-07-10T09:10:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.