PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with Devito
- URL: http://arxiv.org/abs/2411.12636v1
- Date: Tue, 19 Nov 2024 16:49:58 GMT
- Title: PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with Devito
- Authors: Pascal Tribel, Gianluca Bontempi,
- Abstract summary: PyAWD is a Python library designed to generate high-dimensional synthetic-acoustic wave propagation datasets.
By allowing fine control over parameters such as wave speed, external forces, spatial and temporal discretization, PyAWD enables the creation of ML-scale datasets.
- Score: 0.1657441317977376
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Seismic data is often sparse and unevenly distributed due to the high costs and logistical challenges associated with deploying physical seismometers, limiting the application of Machine Learning (ML) in earthquake analysis. To address this gap, we introduce PyAWD, a Python library designed to generate high-resolution synthetic datasets simulating spatio-temporal acoustic wave propagation in both two-dimensional and three-dimensional heterogeneous media. By allowing fine control over parameters such as wave speed, external forces, spatial and temporal discretization, and media composition, PyAWD enables the creation of ML-scale datasets that capture the complexity of seismic wave behavior. We illustrate the library's potential with an epicenter retrieval task, showcasing its suitability for designing complex, accurate seismic problems that support advanced ML approaches in the absence or lack of dense real-world data.
Related papers
- A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset [8.453075713579631]
We present UB-Diff'', a novel diffusion model for multi-modal paired scientific data generation.
One major innovation is a one-in-two-out encoder-decoder network structure, which can ensure pairwise data is obtained from a co-latent representation.
Experimental results on the OpenFWI dataset show that UB-Diff significantly outperforms existing techniques in terms of Fr'echet Inception Distance (FID) score and pairwise evaluation.
arXiv Detail & Related papers (2025-01-01T19:49:38Z) - The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning [4.812580392361432]
Well is a large-scale collection of numerical simulations of a wide variety of physical systems.
These datasets can be used individually or as part of a broader benchmark suite.
We provide a unified PyTorch interface for training and evaluating models.
arXiv Detail & Related papers (2024-11-30T19:42:14Z) - High Resolution Seismic Waveform Generation using Denoising Diffusion [3.5046784866523932]
This study introduces a novel, efficient, and scalable generative model for high-frequency seismic waveform generation.
A spectrogram representation of seismic waveform data is reduced to a lower-dimensional submanifold via an autoencoder.
A state-of-the-art diffusion model is trained to generate this latent representation, conditioned on key input parameters.
arXiv Detail & Related papers (2024-10-25T07:01:48Z) - LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free
Environment [59.320414108383055]
We present LiveHPS, a novel single-LiDAR-based approach for scene-level human pose and shape estimation.
We propose a huge human motion dataset, named FreeMotion, which is collected in various scenarios with diverse human poses.
arXiv Detail & Related papers (2024-02-27T03:08:44Z) - IntraSeismic: a coordinate-based learning approach to seismic inversion [14.625250755761662]
IntraSeismic is a novel hybrid seismic inversion method that seamlessly combines coordinate-based learning with the physics of the post-stack modeling operator.
Key features of IntraSeismic are unparalleled performance in 2D and 3D post-stack seismic inversion, rapid convergence rates, and ability to seamlessly include hard constraints.
Synthetic and field data applications of IntraSeismic are presented to validate the effectiveness of the proposed method.
arXiv Detail & Related papers (2023-12-17T00:29:25Z) - Closing the loop: Autonomous experiments enabled by
machine-learning-based online data analysis in synchrotron beamline
environments [80.49514665620008]
Machine learning can be used to enhance research involving large or rapidly generated datasets.
In this study, we describe the incorporation of ML into a closed-loop workflow for X-ray reflectometry (XRR)
We present solutions that provide an elementary data analysis in real time during the experiment without introducing the additional software dependencies in the beamline control software environment.
arXiv Detail & Related papers (2023-06-20T21:21:19Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Wavelet-based Loss for High-frequency Interface Dynamics [30.246577036044332]
We present a new method based on a wavelet loss formulation, which remains transparent in terms of what is optimized.
We show that our method can successfully reconstruct high-frequency details in an illustrative synthetic test case.
We also evaluate the performance when applied to more complex surfaces based on physical simulations.
arXiv Detail & Related papers (2022-09-06T09:09:14Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - Physics-informed deep-learning applications to experimental fluid
mechanics [2.992602379681373]
High-resolution reconstruction of flow-field data from low-resolution and noisy measurements is of interest in experimental fluid mechanics.
Deep-learning approaches have been shown suitable for such super-resolution tasks.
In this study, we apply physics-informed neural networks (PINNs) for super-resolution of flow-field data in time and space.
arXiv Detail & Related papers (2022-03-29T09:58:30Z) - Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data [50.23363975709122]
We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing panel data.
We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem.
We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms.
arXiv Detail & Related papers (2022-02-18T22:41:51Z) - Dynamic Network-Assisted D2D-Aided Coded Distributed Learning [59.29409589861241]
We propose a novel device-to-device (D2D)-aided coded federated learning method (D2D-CFL) for load balancing across devices.
We derive an optimal compression rate for achieving minimum processing time and establish its connection with the convergence time.
Our proposed method is beneficial for real-time collaborative applications, where the users continuously generate training data.
arXiv Detail & Related papers (2021-11-26T18:44:59Z) - Machine learning-based porosity estimation from spectral decomposed
seismic data [0.0]
Estimating porosity models via seismic data is challenging due to the signal noise and insufficient resolution of seismic data.
We propose a machine learning-based workflow to convert seismic data to porosity models.
arXiv Detail & Related papers (2021-11-23T00:03:38Z) - Deep Cellular Recurrent Network for Efficient Analysis of Time-Series
Data with Spatial Information [52.635997570873194]
This work proposes a novel deep cellular recurrent neural network (DCRNN) architecture to process complex multi-dimensional time series data with spatial information.
The proposed architecture achieves state-of-the-art performance while utilizing substantially less trainable parameters when compared to comparable methods in the literature.
arXiv Detail & Related papers (2021-01-12T20:08:18Z) - Scaling through abstractions -- high-performance vectorial wave
simulations for seismic inversion with Devito [0.6745502291821955]
Devito is an open-source Python project based on domain-specific language and compiler technology.
This article presents the generation and simulation of MPI-parallel propagators for the pseudo-acoustic wave-equation in tilted transverse isotropic media.
arXiv Detail & Related papers (2020-04-22T12:20:07Z) - SeismiQB -- a novel framework for deep learning with seismic data [62.997667081978825]
We've developed an open-sourced Python framework with emphasis on working with neural networks.
It provides convenient tools for fast loading seismic cubes in multiple data formats.
It also generates crops of desired shape and augmenting them with various transformations.
arXiv Detail & Related papers (2020-01-10T10:45:56Z) - Temporal-Spatial Neural Filter: Direction Informed End-to-End
Multi-channel Target Speech Separation [66.46123655365113]
Target speech separation refers to extracting the target speaker's speech from mixed signals.
Two main challenges are the complex acoustic environment and the real-time processing requirement.
We propose a temporal-spatial neural filter, which directly estimates the target speech waveform from multi-speaker mixture.
arXiv Detail & Related papers (2020-01-02T11:12:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.