OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic
Full Waveform Inversion
- URL: http://arxiv.org/abs/2111.02926v6
- Date: Sat, 24 Jun 2023 00:02:32 GMT
- Title: OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic
Full Waveform Inversion
- Authors: Chengyuan Deng, Shihang Feng, Hanchen Wang, Xitong Zhang, Peng Jin,
Yinan Feng, Qili Zeng, Yinpeng Chen, Youzuo Lin
- Abstract summary: Full waveform inversion (FWI) is widely used in geophysics to reconstruct high-resolution velocity maps from seismic data.
Recent success of data-driven FWI methods results in a rapidly increasing demand for open datasets to serve the geophysics community.
We present OpenFWI, a collection of large-scale multi-structural benchmark datasets.
- Score: 16.117689670474142
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Full waveform inversion (FWI) is widely used in geophysics to reconstruct
high-resolution velocity maps from seismic data. The recent success of
data-driven FWI methods results in a rapidly increasing demand for open
datasets to serve the geophysics community. We present OpenFWI, a collection of
large-scale multi-structural benchmark datasets, to facilitate diversified,
rigorous, and reproducible research on FWI. In particular, OpenFWI consists of
12 datasets (2.1TB in total) synthesized from multiple sources. It encompasses
diverse domains in geophysics (interface, fault, CO2 reservoir, etc.), covers
different geological subsurface structures (flat, curve, etc.), and contains
various amounts of data samples (2K - 67K). It also includes a dataset for 3D
FWI. Moreover, we use OpenFWI to perform benchmarking over four deep learning
methods, covering both supervised and unsupervised learning regimes. Along with
the benchmarks, we implement additional experiments, including physics-driven
methods, complexity analysis, generalization study, uncertainty quantification,
and so on, to sharpen our understanding of datasets and methods. The studies
either provide valuable insights into the datasets and the performance, or
uncover their current limitations. We hope OpenFWI supports prospective
research on FWI and inspires future open-source efforts on AI for science. All
datasets and related information can be accessed through our website at
https://openfwi-lanl.github.io/
Related papers
- Inversion-DeepONet: A Novel DeepONet-Based Network with Encoder-Decoder for Full Waveform Inversion [28.406887976413845]
We propose a novel deep operator network (DeepONet) architecture Inversion-DeepONet for full waveform inversion (FWI)
We utilize convolutional neural network (CNN) to extract the features from seismic data in branch net.
We confirm the superior performance on accuracy and generalization ability of our network, compared with existing data-driven FWI methods.
arXiv Detail & Related papers (2024-08-15T08:15:06Z) - Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV &
CribsTV [50.616892315086574]
This paper proposes two novel datasets: SlowTV and CribsTV.
These are large-scale datasets curated from publicly available YouTube videos, containing a total of 2M training frames.
We leverage these datasets to tackle the challenging task of zero-shot generalization.
arXiv Detail & Related papers (2024-03-03T17:29:03Z) - $\mathbf{\mathbb{E}^{FWI}}$: Multi-parameter Benchmark Datasets for
Elastic Full Waveform Inversion of Geophysical Properties [27.94139165494327]
$mathbfmathbbEFWI$ is a comprehensive benchmark dataset for elastic full waveform inversion.
$mathbfmathbbEFWI$ encompasses 8 distinct datasets that cover diverse subsurface geologic structures.
The relation between P- and S-wave velocities provides crucial insights into the subsurface properties.
arXiv Detail & Related papers (2023-06-21T17:11:35Z) - infoVerse: A Universal Framework for Dataset Characterization with
Multidimensional Meta-information [68.76707843019886]
infoVerse is a universal framework for dataset characterization.
infoVerse captures multidimensional characteristics of datasets by incorporating various model-driven meta-information.
In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines.
arXiv Detail & Related papers (2023-05-30T18:12:48Z) - Argoverse 2: Next Generation Datasets for Self-Driving Perception and
Forecasting [64.7364925689825]
Argoverse 2 (AV2) is a collection of three datasets for perception and forecasting research in the self-driving domain.
The Lidar dataset contains 20,000 sequences of unlabeled lidar point clouds and map-aligned pose.
The Motion Forecasting dataset contains 250,000 scenarios mined for interesting and challenging interactions between the autonomous vehicle and other actors in each local scene.
arXiv Detail & Related papers (2023-01-02T00:36:22Z) - FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in
Realistic Healthcare Settings [51.09574369310246]
Federated Learning (FL) is a novel approach enabling several clients holding sensitive data to collaboratively train machine learning models.
We propose a novel cross-silo dataset suite focused on healthcare, FLamby, to bridge the gap between theory and practice of cross-silo FL.
Our flexible and modular suite allows researchers to easily download datasets, reproduce results and re-use the different components for their research.
arXiv Detail & Related papers (2022-10-10T12:17:30Z) - Does it work outside this benchmark? Introducing the Rigid Depth
Constructor tool, depth validation dataset construction in rigid scenes for
the masses [1.294486861344922]
We present a protocol to construct your own depth validation dataset for navigation.
RDC for Rigid Depth Constructor aims at being more accessible and cheaper than already existing techniques.
We also develop a test suite to get insightful information from the evaluated algorithm.
arXiv Detail & Related papers (2021-03-29T22:01:24Z) - OpenGF: An Ultra-Large-Scale Ground Filtering Dataset Built Upon Open
ALS Point Clouds Around the World [22.577705573226257]
We present OpenGF, first Ultra-Large-Scale Ground Filtering dataset.
OpenGF contains more than half a billion finely labeled ground and non-ground points.
We evaluate the performance of state-of-the-art rule-based algorithms and 3D semantic segmentation networks on our dataset.
arXiv Detail & Related papers (2021-01-24T04:07:35Z) - Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset,
Benchmarks and Challenges [52.624157840253204]
We present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points.
Our dataset consists of large areas from three UK cities, covering about 7.6 km2 of the city landscape.
We evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results.
arXiv Detail & Related papers (2020-09-07T14:47:07Z) - Robust Image Retrieval-based Visual Localization using Kapture [10.249293519246478]
We present a versatile pipeline for visual localization that facilitates the use of different local and global features.
We evaluate our methods on eight public datasets where they rank top on all and first on many of them.
To foster future research, we release code, models, and all datasets used in this paper in the kapture format open source under a permissive BSD license.
arXiv Detail & Related papers (2020-07-27T21:10:35Z) - Open Graph Benchmark: Datasets for Machine Learning on Graphs [86.96887552203479]
We present the Open Graph Benchmark (OGB) to facilitate scalable, robust, and reproducible graph machine learning (ML) research.
OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains.
For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics.
arXiv Detail & Related papers (2020-05-02T03:09:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.