AI enhanced data assimilation and uncertainty quantification applied to
Geological Carbon Storage
- URL: http://arxiv.org/abs/2402.06110v1
- Date: Fri, 9 Feb 2024 00:24:46 GMT
- Title: AI enhanced data assimilation and uncertainty quantification applied to
Geological Carbon Storage
- Authors: G. S. Seabra (1, 2), N. T. M\"ucke (3, 4), V. L. S. Silva (2, 5), D.
Voskov (1, 6), F. Vossepoel (1) ((1) TU Delft, Netherlands, (2) Petrobras,
Brazil, (3) Centrum Wiskunde & Informatica, Netherlands, (4) Utrecht
University, Netherlands, (5) Imperial College London, United Kingdom, (6)
Stanford University, USA)
- Abstract summary: We introduce the Surrogate-based hybrid ESMDA (SH-ESMDA), an adaptation of the traditional Ensemble Smoother with Multiple Data Assimilation (ESMDA)
We also introduce Surrogate-based Hybrid RML (SH-RML), a variational data assimilation approach that relies on the randomized maximum likelihood (RML)
Our comparative analyses show that SH-RML offers better uncertainty compared to conventional ESMDA for the case study.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This study investigates the integration of machine learning (ML) and data
assimilation (DA) techniques, focusing on implementing surrogate models for
Geological Carbon Storage (GCS) projects while maintaining high fidelity
physical results in posterior states. Initially, we evaluate the surrogate
modeling capability of two distinct machine learning models, Fourier Neural
Operators (FNOs) and Transformer UNet (T-UNet), in the context of CO$_2$
injection simulations within channelized reservoirs. We introduce the
Surrogate-based hybrid ESMDA (SH-ESMDA), an adaptation of the traditional
Ensemble Smoother with Multiple Data Assimilation (ESMDA). This method uses
FNOs and T-UNet as surrogate models and has the potential to make the standard
ESMDA process at least 50% faster or more, depending on the number of
assimilation steps. Additionally, we introduce Surrogate-based Hybrid RML
(SH-RML), a variational data assimilation approach that relies on the
randomized maximum likelihood (RML) where both the FNO and the T-UNet enable
the computation of gradients for the optimization of the objective function,
and a high-fidelity model is employed for the computation of the posterior
states. Our comparative analyses show that SH-RML offers better uncertainty
quantification compared to conventional ESMDA for the case study.
Related papers
- Likelihood-Free Inference and Hierarchical Data Assimilation for Geological Carbon Storage [0.0]
We develop a hierarchical data assimilation framework for carbon storage.
Uses Monte Carlo-based approximate Bayesian computation.
Reduces computational costs by using a 3D recurrent R-U-Net deep-learning surrogate model.
arXiv Detail & Related papers (2024-10-20T06:15:56Z) - Geodesic Optimization for Predictive Shift Adaptation on EEG data [53.58711912565724]
Domain adaptation methods struggle when distribution shifts occur simultaneously in $X$ and $y$.
This paper proposes a novel method termed Geodesic Optimization for Predictive Shift Adaptation (GOPSA) to address test-time multi-source DA.
GOPSA has the potential to combine the advantages of mixed-effects modeling with machine learning for biomedical applications of EEG.
arXiv Detail & Related papers (2024-07-04T12:15:42Z) - A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics [73.35846234413611]
In drug discovery, molecular dynamics (MD) simulation provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites.
We propose NeuralMD, the first machine learning (ML) surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding dynamics.
We demonstrate the efficiency and effectiveness of NeuralMD, achieving over 1K$times$ speedup compared to standard numerical MD simulations.
arXiv Detail & Related papers (2024-01-26T09:35:17Z) - Online Variational Sequential Monte Carlo [49.97673761305336]
We build upon the variational sequential Monte Carlo (VSMC) method, which provides computationally efficient and accurate model parameter estimation and Bayesian latent-state inference.
Online VSMC is capable of performing efficiently, entirely on-the-fly, both parameter estimation and particle proposal adaptation.
arXiv Detail & Related papers (2023-12-19T21:45:38Z) - Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood [64.95663299945171]
Training energy-based models (EBMs) on high-dimensional data can be both challenging and time-consuming.
There exists a noticeable gap in sample quality between EBMs and other generative frameworks like GANs and diffusion models.
We propose cooperative diffusion recovery likelihood (CDRL), an effective approach to tractably learn and sample from a series of EBMs.
arXiv Detail & Related papers (2023-09-10T22:05:24Z) - Heterogeneous Multi-Task Gaussian Cox Processes [61.67344039414193]
We present a novel extension of multi-task Gaussian Cox processes for modeling heterogeneous correlated tasks jointly.
A MOGP prior over the parameters of the dedicated likelihoods for classification, regression and point process tasks can facilitate sharing of information between heterogeneous tasks.
We derive a mean-field approximation to realize closed-form iterative updates for estimating model parameters.
arXiv Detail & Related papers (2023-08-29T15:01:01Z) - Bayesian tomography using polynomial chaos expansion and deep generative
networks [0.0]
We present a strategy combining the excellent reconstruction performances of a variational autoencoder (VAE) with the accuracy of PCA-PCE surrogate modeling.
Within the MCMC process, the parametrization of the VAE is leveraged for prior exploration and sample proposals.
arXiv Detail & Related papers (2023-07-09T16:44:37Z) - Adaptive Latent Factor Analysis via Generalized Momentum-Incorporated
Particle Swarm Optimization [6.2303427193075755]
A gradient descent (SGD) algorithm is an effective learning strategy to build a latent factor analysis (LFA) model on a high-dimensional and incomplete (HDI) matrix.
A particle swarm optimization (PSO) algorithm is commonly adopted to make an SGD-based LFA model's hyper- parameters, i.e., learning rate and regularization coefficient, self-adaptation.
This paper incorporates more historical information into each particle's evolutionary process for avoiding premature convergence.
arXiv Detail & Related papers (2022-08-04T03:15:07Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - Generalised Latent Assimilation in Heterogeneous Reduced Spaces with
Machine Learning Surrogate Models [10.410970649045943]
We develop a system which combines reduced-order surrogate models with a novel data assimilation technique.
Generalised Latent Assimilation can benefit both the efficiency provided by the reduced-order modelling and the accuracy of data assimilation.
arXiv Detail & Related papers (2022-04-07T15:13:12Z) - Active Learning Methods for Efficient Hybrid Biophysical Variable
Retrieval [6.093845877765489]
Kernel-based machine learning regression algorithms (MLRAs) are potentially powerful methods for operational biophysical variable retrieval schemes.
They face difficulties in coping with large training datasets.
Active learning (AL) methods enable to select the most informative samples in a dataset.
This letter introduces six AL methods for achieving optimized biophysical variable estimation with a manageable training dataset.
arXiv Detail & Related papers (2020-12-07T08:56:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.