Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring
- URL: http://arxiv.org/abs/2505.17782v1
- Date: Fri, 23 May 2025 11:55:41 GMT
- Title: Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring
- Authors: Nikolas Papadopoulos, Nikolaos Ioannis Bountos, Maria Sdraka, Andreas Karavias, Ioannis Papoutsis,
- Abstract summary: Hephaestus Minicubes is a global collection of 38temporal datacubes covering 44 of the world's most active volcanoes over a 7-year period.<n>Eachtemporal datacube integrates InSAR products, topographic data, as well as atmospheric variables which are known to introduce signal delays that mimic ground deformation in InSAR imagery.<n>We provide expert annotations detailing the type, intensity and spatial extent of deformation events, along with rich text descriptions of the observed scenes.
- Score: 2.6214349237099173
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Ground deformation is regarded in volcanology as a key precursor signal preceding volcanic eruptions. Satellite-based Interferometric Synthetic Aperture Radar (InSAR) enables consistent, global-scale deformation tracking; however, deep learning methods remain largely unexplored in this domain, mainly due to the lack of a curated machine learning dataset. In this work, we build on the existing Hephaestus dataset, and introduce Hephaestus Minicubes, a global collection of 38 spatiotemporal datacubes offering high resolution, multi-source and multi-temporal information, covering 44 of the world's most active volcanoes over a 7-year period. Each spatiotemporal datacube integrates InSAR products, topographic data, as well as atmospheric variables which are known to introduce signal delays that can mimic ground deformation in InSAR imagery. Furthermore, we provide expert annotations detailing the type, intensity and spatial extent of deformation events, along with rich text descriptions of the observed scenes. Finally, we present a comprehensive benchmark, demonstrating Hephaestus Minicubes' ability to support volcanic unrest monitoring as a multi-modal, multi-temporal classification and semantic segmentation task, establishing strong baselines with state-of-the-art architectures. This work aims to advance machine learning research in volcanic monitoring, contributing to the growing integration of data-driven methods within Earth science applications.
Related papers
- ReconMOST: Multi-Layer Sea Temperature Reconstruction with Observations-Guided Diffusion [48.540756751934836]
ReconMOST is a data-driven guided diffusion model framework for multi-layer sea temperature reconstruction.<n>Our method extends ML-based SST reconstruction to a global, multi-layer setting, handling over 92.5% missing data.
arXiv Detail & Related papers (2025-06-12T06:27:22Z) - TerraMesh: A Planetary Mosaic of Multimodal Earth Observation Data [3.674991996196602]
TerraMesh is a new globally diverse, multimodal dataset combining optical, radar, elevation, aperture and land-ready modalities in a Data-Ready format.<n>We provide detailed data processing steps, comprehensive statistics, and empirical evidence demonstrating improved model performance when pre-trained on TerraMesh.
arXiv Detail & Related papers (2025-04-15T13:20:35Z) - Foundation Models for Spatio-Temporal Data Science: A Tutorial and Survey [69.0648659029394]
Spatio-Temporal (ST) data science is fundamental to understanding complex systems in domains such as urban computing, climate science, and intelligent transportation.<n>Researchers have begun exploring the concept of Spatio-Temporal Foundation Models (STFMs) to enhance adaptability and generalization across diverse ST tasks.<n>STFMs empower the entire workflow of ST data science, ranging from data sensing, management, to mining, thereby offering a more holistic and scalable approach.
arXiv Detail & Related papers (2025-03-12T09:42:18Z) - EarthView: A Large Scale Remote Sensing Dataset for Self-Supervision [72.84868704100595]
This paper presents a dataset specifically designed for self-supervision on remote sensing data, intended to enhance deep learning applications on Earth monitoring tasks.<n>The dataset spans 15 tera pixels of global remote-sensing data, combining imagery from a diverse range of sources, including NEON, Sentinel, and a novel release of 1m spatial resolution data from Satellogic.<n>Accompanying the dataset is EarthMAE, a tailored Masked Autoencoder developed to tackle the distinct challenges of remote sensing data.
arXiv Detail & Related papers (2025-01-14T13:42:22Z) - A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models [0.0]
In volcano monitoring, effective recognition of seismic events is essential for understanding volcanic activity and raising timely warning alerts.
This study introduces a novel approach that utilizes Semantic models to automate seismic event recognition by applying a straight forward transformation of multi-channel 1D signals into 2D representations.
Our framework integrates multi-station seismic data with minimal preprocessing, performing both detection and classification simultaneously for five seismic event classes.
arXiv Detail & Related papers (2024-10-27T21:02:37Z) - Anomaly detection for the identification of volcanic unrest in satellite imagery [2.642212767247493]
This paper explores the use of unsupervised deep learning on satellite data for the purpose of identifying volcanic deformation as anomalies.
Our detector is based on Patch Distribution Modeling (PaDiM), and the detection performance is enhanced with a weighted distance.
arXiv Detail & Related papers (2024-05-28T18:00:10Z) - Observation-Guided Meteorological Field Downscaling at Station Scale: A
Benchmark and a New Method [66.80344502790231]
We extend meteorological downscaling to arbitrary scattered station scales and establish a new benchmark and dataset.
Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors.
Our proposed method outperforms other specially designed baseline models on multiple surface variables.
arXiv Detail & Related papers (2024-01-22T14:02:56Z) - Multi-Modal Learning-based Reconstruction of High-Resolution Spatial
Wind Speed Fields [46.72819846541652]
We propose a framework based on Vari Data Assimilation and Deep Learning concepts.
This framework is applied to recover rich-in-time, high-resolution information on sea surface wind speed.
arXiv Detail & Related papers (2023-12-14T13:40:39Z) - Large Models for Time Series and Spatio-Temporal Data: A Survey and
Outlook [95.32949323258251]
Temporal data, notably time series andtemporal-temporal data, are prevalent in real-world applications.
Recent advances in large language and other foundational models have spurred increased use in time series andtemporal data mining.
arXiv Detail & Related papers (2023-10-16T09:06:00Z) - Federated Prompt Learning for Weather Foundation Models on Devices [37.88417074427373]
On-device intelligence for weather forecasting uses local deep learning models to analyze weather patterns without centralized cloud computing.
This paper propose Federated Prompt Learning for Weather Foundation Models on Devices (FedPoD)
FedPoD enables devices to obtain highly customized models while maintaining communication efficiency.
arXiv Detail & Related papers (2023-05-23T16:59:20Z) - Hephaestus: A large scale multitask dataset towards InSAR understanding [1.8350044465969417]
In this work, we put the effort to create and make available the first of its kind, manually annotated InSAR dataset.
The dataset consists of 19,919 individual Sentinel-1 interferograms acquired over 44 different volcanoes globally, which are split into 216,106 InSAR patches.
It is designed to address different computer vision problems, including volcano state classification, semantic segmentation of ground deformation, detection and classification of atmospheric signals.
arXiv Detail & Related papers (2022-04-20T12:58:18Z) - Improving the Thermal Infrared Monitoring of Volcanoes: A Deep Learning
Approach for Intermittent Image Series [0.0]
We show that a proposed architecture can forecast volcanic temperature imagery with the lowest RMSE ($4.164circ$C, other methods: $4.217-5.291circ$C)
We also examined performance on multiple time series derived from the thermal imagery and the effect of training with data from singular volcanoes.
This work highlights the potential of data-driven deep learning models for volcanic unrest forecasting.
arXiv Detail & Related papers (2021-09-27T02:31:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.