Identification of high order closure terms from fully kinetic
simulations using machine learning
- URL: http://arxiv.org/abs/2110.09916v1
- Date: Tue, 19 Oct 2021 12:27:02 GMT
- Title: Identification of high order closure terms from fully kinetic
simulations using machine learning
- Authors: Brecht Laperre, Jorge Amaya and Giovanni Lapenta
- Abstract summary: We show how two different machine learning models can synthesize higher-order moments extracted from a kinetic simulation.
The accuracy of the models and their ability to generalize are evaluated and compared to a baseline model.
We learn that both models can capture heat flux and pressure tensor very well, with the gradient boosting regressor being the most stable of the two models.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Simulations of large-scale plasma systems are typically based on fluid
approximations. However, these methods do not capture the small-scale physical
processes available to fully kinetic models. Traditionally, empirical closure
terms are used to express high order moments of the Boltzmann equation, e.g.
the pressure tensor and heat flux. In this paper, we propose different closure
terms extracted using machine learning techniques as an alternative. We show in
this work how two different machine learning models, a multi-layer perceptron
and a gradient boosting regressor, can synthesize higher-order moments
extracted from a fully kinetic simulation. The accuracy of the models and their
ability to generalize are evaluated and compared to a baseline model. When
trained from more extreme simulations, the models showed better extrapolation
in comparison to traditional simulations, indicating the importance of
outliers. We learn that both models can capture heat flux and pressure tensor
very well, with the gradient boosting regressor being the most stable of the
two models in terms of the accuracy. The performance of the tested models in
the regression task opens the way for new experiments in multi-scale modelling.
Related papers
- von Mises Quasi-Processes for Bayesian Circular Regression [57.88921637944379]
We explore a family of expressive and interpretable distributions over circle-valued random functions.
The resulting probability model has connections with continuous spin models in statistical physics.
For posterior inference, we introduce a new Stratonovich-like augmentation that lends itself to fast Markov Chain Monte Carlo sampling.
arXiv Detail & Related papers (2024-06-19T01:57:21Z) - Deep generative modelling of canonical ensemble with differentiable thermal properties [0.9421843976231371]
We propose a variational modelling method with differentiable temperature for canonical ensembles.
Using a deep generative model, the free energy is estimated and minimized simultaneously in a continuous temperature range.
The training process requires no dataset, and works with arbitrary explicit density generative models.
arXiv Detail & Related papers (2024-04-29T03:41:49Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - On Least Square Estimation in Softmax Gating Mixture of Experts [78.3687645289918]
We investigate the performance of the least squares estimators (LSE) under a deterministic MoE model.
We establish a condition called strong identifiability to characterize the convergence behavior of various types of expert functions.
Our findings have important practical implications for expert selection.
arXiv Detail & Related papers (2024-02-05T12:31:18Z) - Analysis of Interpolating Regression Models and the Double Descent
Phenomenon [3.883460584034765]
It is commonly assumed that models which interpolate noisy training data are poor to generalize.
The best models obtained are overparametrized and the testing error exhibits the double descent behavior as the model order increases.
We derive a result based on the behavior of the smallest singular value of the regression matrix that explains the peak location and the double descent shape of the testing error as a function of model order.
arXiv Detail & Related papers (2023-04-17T09:44:33Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Discovery of sparse hysteresis models for piezoelectric materials [1.3669389861593737]
This article presents an approach for modelling in piezoelectric materials using sparse-regression techniques.
The presented approach is compared to traditional regression-based and neural network methods, demonstrating its efficiency and robustness.
arXiv Detail & Related papers (2023-02-10T15:21:36Z) - Automated Dissipation Control for Turbulence Simulation with Shell
Models [1.675857332621569]
The application of machine learning (ML) techniques, especially neural networks, has seen tremendous success at processing images and language.
In this work we construct a strongly simplified representation of turbulence by using the Gledzer-Ohkitani-Yamada shell model.
We propose an approach that aims to reconstruct statistical properties of turbulence such as the self-similar inertial-range scaling.
arXiv Detail & Related papers (2022-01-07T15:03:52Z) - Closed-form Continuous-Depth Models [99.40335716948101]
Continuous-depth neural models rely on advanced numerical differential equation solvers.
We present a new family of models, termed Closed-form Continuous-depth (CfC) networks, that are simple to describe and at least one order of magnitude faster.
arXiv Detail & Related papers (2021-06-25T22:08:51Z) - Hybrid modeling: Applications in real-time diagnosis [64.5040763067757]
We outline a novel hybrid modeling approach that combines machine learning inspired models and physics-based models.
We are using such models for real-time diagnosis applications.
arXiv Detail & Related papers (2020-03-04T00:44:57Z) - Data-Driven Discovery of Coarse-Grained Equations [0.0]
Multiscale modeling and simulations are two areas where learning on simulated data can lead to such discovery.
We replace the human discovery of such models with a machine-learning strategy based on sparse regression that can be executed in two modes.
A series of examples demonstrates the accuracy, robustness, and limitations of our approach to equation discovery.
arXiv Detail & Related papers (2020-01-30T23:41:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.