Machine Learning in Thermodynamics: Prediction of Activity Coefficients
by Matrix Completion
- URL: http://arxiv.org/abs/2001.10675v1
- Date: Wed, 29 Jan 2020 03:16:23 GMT
- Title: Machine Learning in Thermodynamics: Prediction of Activity Coefficients
by Matrix Completion
- Authors: Fabian Jirasek, Rodrigo A. S. Alves, Julie Damay, Robert A.
Vandermeulen, Robert Bamler, Michael Bortz, Stephan Mandt, Marius Kloft, Hans
Hasse
- Abstract summary: We propose a probabilistic matrix factorization model for predicting the activity coefficients in arbitrary binary mixtures.
Our method outperforms the state-of-the-art method that has been refined over three decades.
This opens perspectives to novel methods for predicting physico-chemical properties of binary mixtures.
- Score: 34.7384528263504
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Activity coefficients, which are a measure of the non-ideality of liquid
mixtures, are a key property in chemical engineering with relevance to modeling
chemical and phase equilibria as well as transport processes. Although
experimental data on thousands of binary mixtures are available, prediction
methods are needed to calculate the activity coefficients in many relevant
mixtures that have not been explored to-date. In this report, we propose a
probabilistic matrix factorization model for predicting the activity
coefficients in arbitrary binary mixtures. Although no physical descriptors for
the considered components were used, our method outperforms the
state-of-the-art method that has been refined over three decades while
requiring much less training effort. This opens perspectives to novel methods
for predicting physico-chemical properties of binary mixtures with the
potential to revolutionize modeling and simulation in chemical engineering.
Related papers
- Hierarchical Matrix Completion for the Prediction of Properties of Binary Mixtures [3.0478550046333965]
We introduce a novel generic approach for improving data-driven models.
We lump components that behave similarly into chemical classes and model them jointly.
Using clustering leads to significantly improved predictions compared to an MCM without clustering.
arXiv Detail & Related papers (2024-10-08T14:04:30Z) - Balancing Molecular Information and Empirical Data in the Prediction of Physico-Chemical Properties [8.649679686652648]
We propose a general method for combining molecular descriptors with representation learning.
The proposed hybrid model exploits chemical structure information using graph neural networks.
It automatically detects cases where structure-based predictions are unreliable, in which case it corrects them by representation-learning based predictions.
arXiv Detail & Related papers (2024-06-12T10:51:00Z) - Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance [55.872926690722714]
We study the predictability of model performance regarding the mixture proportions in function forms.
We propose nested use of the scaling laws of training steps, model sizes, and our data mixing law.
Our method effectively optimize the training mixture of a 1B model trained for 100B tokens in RedPajama.
arXiv Detail & Related papers (2024-03-25T17:14:00Z) - Accurate melting point prediction through autonomous physics-informed
learning [52.217497897835344]
We present an algorithm for computing melting points by autonomously learning from coexistence simulations in the NPT ensemble.
We demonstrate how incorporating physical models of the solid-liquid coexistence evolution enhances the algorithm's accuracy and enables optimal decision-making.
arXiv Detail & Related papers (2023-06-23T07:53:09Z) - A smile is all you need: Predicting limiting activity coefficients from
SMILES with natural language processing [0.1349420109127767]
We introduce the SMILES-to-Properties-Transformer (SPT), a natural language processing network to predict binary limiting activity coefficients from SMILES codes.
We train our network on a large dataset of synthetic data sampled from COSMO-RS and fine-tune the model on experimental data.
This training strategy enables SPT to accurately predict limiting activity coefficients even for unknown molecules, cutting the mean prediction error in half compared to state-of-the-art models.
arXiv Detail & Related papers (2022-06-15T07:11:37Z) - Hybridizing Physical and Data-driven Prediction Methods for
Physicochemical Properties [19.50116420011026]
We present a generic way to hybridize physical and data-driven methods for predicting physicochemical properties.
The approach distills' the physical method's predictions into a prior model and combines it with sparse experimental data using Bayesian inference.
arXiv Detail & Related papers (2022-02-17T18:15:03Z) - A deep learning driven pseudospectral PCE based FFT homogenization
algorithm for complex microstructures [68.8204255655161]
It is shown that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
It is shown, that the proposed method is able to predict central moments of interest while being magnitudes faster to evaluate than traditional approaches.
arXiv Detail & Related papers (2021-10-26T07:02:14Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - An active learning approach for improving the performance of equilibrium
based chemical simulations [0.0]
In this paper, we propose a novel sequential data-driven method for dealing with equilibrium based chemical simulations.
The proposed method sequentially chooses the most relevant input data at which the function to estimate has to be evaluated to build a surrogate model.
Our active learning method is validated through numerical experiments and applied to a complex chemical system commonly used in geoscience.
arXiv Detail & Related papers (2021-10-15T14:17:28Z) - Quantum-Classical Hybrid Algorithm for the Simulation of All-Electron
Correlation [58.720142291102135]
We present a novel hybrid-classical algorithm that computes a molecule's all-electron energy and properties on the classical computer.
We demonstrate the ability of the quantum-classical hybrid algorithms to achieve chemically relevant results and accuracy on currently available quantum computers.
arXiv Detail & Related papers (2021-06-22T18:00:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.