Making Invisible Visible: Data-Driven Seismic Inversion with
Physics-Informed Data Augmentation
- URL: http://arxiv.org/abs/2106.11892v2
- Date: Wed, 23 Jun 2021 14:06:02 GMT
- Title: Making Invisible Visible: Data-Driven Seismic Inversion with
Physics-Informed Data Augmentation
- Authors: Yuxin Yang, Xitong Zhang, Qiang Guan, Youzuo Lin
- Abstract summary: We develop new physics-informed data augmentation techniques based on convolutional neural networks.
Specifically, our generative models leverage different physics knowledge (such as governing equations, observable perception, and physics phenomena) to improve the quality of the synthetic data.
We show that data-driven seismic imaging can be significantly enhanced by using our physics-informed data augmentation techniques.
- Score: 6.079137591620588
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning and data-driven approaches have shown great potential in
scientific domains. The promise of data-driven techniques relies on the
availability of a large volume of high-quality training datasets. Due to the
high cost of obtaining data through expensive physical experiments,
instruments, and simulations, data augmentation techniques for scientific
applications have emerged as a new direction for obtaining scientific data
recently. However, existing data augmentation techniques originating from
computer vision, yield physically unacceptable data samples that are not
helpful for the domain problems that we are interested in. In this paper, we
develop new physics-informed data augmentation techniques based on
convolutional neural networks. Specifically, our generative models leverage
different physics knowledge (such as governing equations, observable
perception, and physics phenomena) to improve the quality of the synthetic
data. To validate the effectiveness of our data augmentation techniques, we
apply them to solve a subsurface seismic full-waveform inversion using
simulated CO$_2$ leakage data. Our interest is to invert for subsurface
velocity models associated with very small CO$_2$ leakage. We validate the
performance of our methods using comprehensive numerical tests. Via comparison
and analysis, we show that data-driven seismic imaging can be significantly
enhanced by using our physics-informed data augmentation techniques.
Particularly, the imaging quality has been improved by 15% in test scenarios of
general-sized leakage and 17% in small-sized leakage when using an augmented
training set obtained with our techniques.
Related papers
- Research and Implementation of Data Enhancement Techniques for Graph Neural Networks [10.575426305555538]
In practical engineering applications, some data are affected by the conditions under which more data cannot be obtained or the cost of obtaining data is too high.
This paper firstly analyses the key points of the data enhancement technology of graph neural network, and at the same time introduces the composition of graph neural network in depth.
arXiv Detail & Related papers (2024-06-18T14:07:38Z) - Post-training Model Quantization Using GANs for Synthetic Data
Generation [57.40733249681334]
We investigate the use of synthetic data as a substitute for the calibration with real data for the quantization method.
We compare the performance of models quantized using data generated by StyleGAN2-ADA and our pre-trained DiStyleGAN, with quantization using real data and an alternative data generation method based on fractal images.
arXiv Detail & Related papers (2023-05-10T11:10:09Z) - Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions [57.30984060215482]
We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
arXiv Detail & Related papers (2023-01-07T11:37:32Z) - Minimizing the Accumulated Trajectory Error to Improve Dataset
Distillation [151.70234052015948]
We propose a novel approach that encourages the optimization algorithm to seek a flat trajectory.
We show that the weights trained on synthetic data are robust against the accumulated errors perturbations with the regularization towards the flat trajectory.
Our method, called Flat Trajectory Distillation (FTD), is shown to boost the performance of gradient-matching methods by up to 4.7%.
arXiv Detail & Related papers (2022-11-20T15:49:11Z) - An Adversarial Active Sampling-based Data Augmentation Framework for
Manufacturable Chip Design [55.62660894625669]
Lithography modeling is a crucial problem in chip design to ensure a chip design mask is manufacturable.
Recent developments in machine learning have provided alternative solutions in replacing the time-consuming lithography simulations with deep neural networks.
We propose a litho-aware data augmentation framework to resolve the dilemma of limited data and improve the machine learning model performance.
arXiv Detail & Related papers (2022-10-27T20:53:39Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - The Bearable Lightness of Big Data: Towards Massive Public Datasets in
Scientific Machine Learning [0.0]
We show that lossy compression algorithms offer a realistic pathway for exposing high-fidelity scientific data to open-source data repositories.
In this paper, we outline, construct, and evaluate the requirements for establishing a big data framework.
arXiv Detail & Related papers (2022-07-25T21:44:53Z) - The Imaginative Generative Adversarial Network: Automatic Data
Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action
Recognition [27.795763107984286]
We present a novel automatic data augmentation model, which approximates the distribution of the input data and samples new data from this distribution.
Our results show that the augmentation strategy is fast to train and can improve classification accuracy for both neural networks and state-of-the-art methods.
arXiv Detail & Related papers (2021-05-27T11:07:09Z) - Physics-Consistent Data-driven Waveform Inversion with Adaptive Data
Augmentation [12.564534712461331]
We develop a new hybrid computational approach to solve full-waveform inversion (FWI)
We develop a data augmentation strategy that can improve the representativity of the training set.
We apply our method to synthetic elastic seismic waveform data generated from a subsurface geologic model built on a carbon sequestration site at Kimberlina, California.
arXiv Detail & Related papers (2020-09-03T17:12:55Z) - Complex Wavelet SSIM based Image Data Augmentation [0.0]
We look at the MNIST handwritten dataset an image dataset used for digit recognition.
We take a detailed look into one of the most popular augmentation techniques used for this data set elastic deformation.
We propose to use a similarity measure called Complex Wavelet Structural Similarity Index Measure (CWSSIM) to selectively filter out the irrelevant data.
arXiv Detail & Related papers (2020-07-11T21:11:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.