Theory-guided Auto-Encoder for Surrogate Construction and Inverse
Modeling
- URL: http://arxiv.org/abs/2011.08618v1
- Date: Tue, 17 Nov 2020 13:23:03 GMT
- Title: Theory-guided Auto-Encoder for Surrogate Construction and Inverse
Modeling
- Authors: Nanzhe Wang, Haibin Chang, Dongxiao Zhang
- Abstract summary: The framework is built based on the Auto-Encoder architecture of convolutional neural network (CNN)
The governing equations of the studied problems can be discretized and the finite difference scheme of the equations can be embedded into the training of CNN.
The trained TgAE can be used to construct a surrogate that approximates the relationship between the model parameters and responses with limited labeled data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: A Theory-guided Auto-Encoder (TgAE) framework is proposed for surrogate
construction and is further used for uncertainty quantification and inverse
modeling tasks. The framework is built based on the Auto-Encoder (or
Encoder-Decoder) architecture of convolutional neural network (CNN) via a
theory-guided training process. In order to achieve the theory-guided training,
the governing equations of the studied problems can be discretized and the
finite difference scheme of the equations can be embedded into the training of
CNN. The residual of the discretized governing equations as well as the data
mismatch constitute the loss function of the TgAE. The trained TgAE can be used
to construct a surrogate that approximates the relationship between the model
parameters and responses with limited labeled data. In order to test the
performance of the TgAE, several subsurface flow cases are introduced. The
results show the satisfactory accuracy of the TgAE surrogate and efficiency of
uncertainty quantification tasks can be improved with the TgAE surrogate. The
TgAE also shows good extrapolation ability for cases with different correlation
lengths and variances. Furthermore, the parameter inversion task has been
implemented with the TgAE surrogate and satisfactory results can be obtained.
Related papers
- Enhanced uncertainty quantification variational autoencoders for the solution of Bayesian inverse problems [0.0]
We build upon existing research by proposing a novel loss function to train variational autoencoders for inverse problems.
We provide a theoretical proof of the convergence of the latent states of variational autoencoders to the posterior distribution of the model parameters.
arXiv Detail & Related papers (2025-02-18T18:17:49Z) - Designing DNNs for a trade-off between robustness and processing performance in embedded devices [1.474723404975345]
Machine learning-based embedded systems need to be robust against soft errors.
This paper investigates the suitability of using bounded AFs to improve model robustness against perturbations.
We analyze encoder-decoder fully convolutional models aimed at performing semantic segmentation tasks on hyperspectral images for scene understanding in autonomous driving.
arXiv Detail & Related papers (2024-12-04T19:34:33Z) - Canonical Correlation Guided Deep Neural Network [14.188285111418516]
We present a canonical correlation guided learning framework, which allows to be realized by deep neural networks (CCDNN)
In the proposed method, the optimization formulation is not restricted to maximize correlation, instead we make canonical correlation as a constraint.
To reduce the redundancy induced by correlation, a redundancy filter is designed.
arXiv Detail & Related papers (2024-09-28T16:08:44Z) - DeltaPhi: Learning Physical Trajectory Residual for PDE Solving [54.13671100638092]
We propose and formulate the Physical Trajectory Residual Learning (DeltaPhi)
We learn the surrogate model for the residual operator mapping based on existing neural operator networks.
We conclude that, compared to direct learning, physical residual learning is preferred for PDE solving.
arXiv Detail & Related papers (2024-06-14T07:45:07Z) - Recurrent neural networks and transfer learning for elasto-plasticity in
woven composites [0.0]
This article presents Recurrent Neural Network (RNN) models as a surrogate for computationally intensive meso-scale simulation of woven composites.
A mean-field model generates a comprehensive data set representing elasto-plastic behavior.
In simulations, arbitrary six-dimensional strain histories are used to predict stresses under random walking as the source task and cyclic loading conditions as the target task.
arXiv Detail & Related papers (2023-11-22T14:47:54Z) - End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes [52.818579746354665]
This paper proposes the first end-to-end differentiable meta-BO framework that generalises neural processes to learn acquisition functions via transformer architectures.
We enable this end-to-end framework with reinforcement learning (RL) to tackle the lack of labelled acquisition data.
arXiv Detail & Related papers (2023-05-25T10:58:46Z) - Analyzing the Performance of Deep Encoder-Decoder Networks as Surrogates
for a Diffusion Equation [0.0]
We study the use of encoder-decoder convolutional neural network (CNN) as surrogates for steady-state diffusion solvers.
Our results indicate that increasing the size of the training set has a substantial effect on reducing performance fluctuations and overall error.
arXiv Detail & Related papers (2023-02-07T22:53:19Z) - Uncertainty quantification of two-phase flow in porous media via
coupled-TgNN surrogate model [6.705438773768439]
Uncertainty quantification (UQ) of subsurface two-phase flow usually requires numerous executions of forward simulations under varying conditions.
In this work, a novel coupled theory-guided neural network (TgNN) based surrogate model is built to facilitate efficiency under the premise of satisfactory accuracy.
arXiv Detail & Related papers (2022-05-28T02:33:46Z) - Adaptive Anomaly Detection for Internet of Things in Hierarchical Edge
Computing: A Contextual-Bandit Approach [81.5261621619557]
We propose an adaptive anomaly detection scheme with hierarchical edge computing (HEC)
We first construct multiple anomaly detection DNN models with increasing complexity, and associate each of them to a corresponding HEC layer.
Then, we design an adaptive model selection scheme that is formulated as a contextual-bandit problem and solved by using a reinforcement learning policy network.
arXiv Detail & Related papers (2021-08-09T08:45:47Z) - Rate Distortion Characteristic Modeling for Neural Image Compression [59.25700168404325]
End-to-end optimization capability offers neural image compression (NIC) superior lossy compression performance.
distinct models are required to be trained to reach different points in the rate-distortion (R-D) space.
We make efforts to formulate the essential mathematical functions to describe the R-D behavior of NIC using deep network and statistical modeling.
arXiv Detail & Related papers (2021-06-24T12:23:05Z) - Autoencoding Variational Autoencoder [56.05008520271406]
We study the implications of this behaviour on the learned representations and also the consequences of fixing it by introducing a notion of self consistency.
We show that encoders trained with our self-consistency approach lead to representations that are robust (insensitive) to perturbations in the input introduced by adversarial attacks.
arXiv Detail & Related papers (2020-12-07T14:16:14Z) - Deep Learning of Dynamic Subsurface Flow via Theory-guided Generative
Adversarial Network [0.0]
Theory-guided generative adversarial network (TgGAN) is proposed to solve dynamic partial differential equations (PDEs)
TgGAN is proposed for dynamic subsurface flow with heterogeneous model parameters.
Numerical results demonstrate that the TgGAN model is robust and reliable for deep learning of dynamic PDEs.
arXiv Detail & Related papers (2020-06-02T02:53:26Z) - Supervised Learning for Non-Sequential Data: A Canonical Polyadic
Decomposition Approach [85.12934750565971]
Efficient modelling of feature interactions underpins supervised learning for non-sequential tasks.
To alleviate this issue, it has been proposed to implicitly represent the model parameters as a tensor.
For enhanced expressiveness, we generalize the framework to allow feature mapping to arbitrarily high-dimensional feature vectors.
arXiv Detail & Related papers (2020-01-27T22:38:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.