Digital Typhoon: Long-term Satellite Image Dataset for the
Spatio-Temporal Modeling of Tropical Cyclones
- URL: http://arxiv.org/abs/2311.02665v1
- Date: Sun, 5 Nov 2023 14:22:13 GMT
- Title: Digital Typhoon: Long-term Satellite Image Dataset for the
Spatio-Temporal Modeling of Tropical Cyclones
- Authors: Asanobu Kitamoto and Jared Hwang and Bastien Vuillod and Lucas Gautier
and Yingtao Tian and Tarin Clanuwat
- Abstract summary: This paper presents the official release of the longest typhoon satellite image dataset for 40+ years.
It is aimed at benchmarking machine learning models for long-term-temporal data.
The dataset is publicly available at http://agora.nii.ac.jp/digital-typhoon/.
- Score: 0.907599024697789
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents the official release of the Digital Typhoon dataset, the
longest typhoon satellite image dataset for 40+ years aimed at benchmarking
machine learning models for long-term spatio-temporal data. To build the
dataset, we developed a workflow to create an infrared typhoon-centered image
for cropping using Lambert azimuthal equal-area projection referring to the
best track data. We also address data quality issues such as inter-satellite
calibration to create a homogeneous dataset. To take advantage of the dataset,
we organized machine learning tasks by the types and targets of inference, with
other tasks for meteorological analysis, societal impact, and climate change.
The benchmarking results on the analysis, forecasting, and reanalysis for the
intensity suggest that the dataset is challenging for recent deep learning
models, due to many choices that affect the performance of various models. This
dataset reduces the barrier for machine learning researchers to meet
large-scale real-world events called tropical cyclones and develop machine
learning models that may contribute to advancing scientific knowledge on
tropical cyclones as well as solving societal and sustainability issues such as
disaster reduction and climate change. The dataset is publicly available at
http://agora.ex.nii.ac.jp/digital-typhoon/dataset/ and
https://github.com/kitamoto-lab/digital-typhoon/.
Related papers
- EarthView: A Large Scale Remote Sensing Dataset for Self-Supervision [72.84868704100595]
This paper presents a dataset specifically designed for self-supervision on remote sensing data, intended to enhance deep learning applications on Earth monitoring tasks.
The dataset spans 15 tera pixels of global remote-sensing data, combining imagery from a diverse range of sources, including NEON, Sentinel, and a novel release of 1m spatial resolution data from Satellogic.
Accompanying the dataset is EarthMAE, a tailored Masked Autoencoder developed to tackle the distinct challenges of remote sensing data.
arXiv Detail & Related papers (2025-01-14T13:42:22Z) - WxC-Bench: A Novel Dataset for Weather and Climate Downstream Tasks [1.0369983700531806]
High-quality machine learning (ML)-ready datasets play a foundational role in developing new artificial intelligence (AI) models.
Here we introduce WxC-Bench, a multi-modal dataset designed to support the development of generalizable AI models.
We provide a comprehensive description of the dataset and also present a technical validation for baseline analysis.
arXiv Detail & Related papers (2024-12-03T19:20:27Z) - Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks [0.30723404270319693]
This paper presents a new version of the longest typhoon satellite image dataset for 40+ years aimed at machine learning models for long-term-temporal data.
The new addition in dataset V2 is tropical cyclone data from the southern hemisphere.
Having data from two hemispheres allows us to ask new research questions about regional differences across basins and hemispheres.
arXiv Detail & Related papers (2024-11-25T14:25:39Z) - Deep Learning for Weather Forecasting: A CNN-LSTM Hybrid Model for Predicting Historical Temperature Data [7.559331742876793]
This study introduces a hybrid model combining Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks to predict historical temperature data.
CNNs are utilized for spatial feature extraction, while LSTMs handle temporal dependencies, resulting in significantly improved prediction accuracy and stability.
arXiv Detail & Related papers (2024-10-19T03:38:53Z) - Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling [55.13352174687475]
This paper proposes a physics-AI hybrid model (i.e., WeatherGFT) which generalizes weather forecasts to finer-grained temporal scales beyond training dataset.
Specifically, we employ a carefully designed PDE kernel to simulate physical evolution on a small time scale.
We also introduce a lead time-aware training framework to promote the generalization of the model at different lead times.
arXiv Detail & Related papers (2024-05-22T16:21:02Z) - Learning Robust Precipitation Forecaster by Temporal Frame Interpolation [65.5045412005064]
We develop a robust precipitation forecasting model that demonstrates resilience against spatial-temporal discrepancies.
Our approach has led to significant improvements in forecasting precision, culminating in our model securing textit1st place in the transfer learning leaderboard of the textitWeather4cast'23 competition.
arXiv Detail & Related papers (2023-11-30T08:22:08Z) - A Distributed Approach to Meteorological Predictions: Addressing Data
Imbalance in Precipitation Prediction Models through Federated Learning and
GANs [0.0]
classification of weather data involves categorizing meteorological phenomena into classes, thereby facilitating nuanced analyses and precise predictions.
It's imperative that classification algorithms proficiently navigate challenges such as data imbalances.
Data augmentation techniques can improve the model's accuracy in classifying rare but critical weather events.
arXiv Detail & Related papers (2023-10-19T21:28:20Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Unleashing Realistic Air Quality Forecasting: Introducing the
Ready-to-Use PurpleAirSF Dataset [4.190243190157989]
This paper introduces PurpleAirSF, a comprehensive and easily accessible dataset from the PurpleAir network.
We present a detailed account of the data collection and processing methods employed to build PurpleAirSF.
We conduct preliminary experiments using both classic and modern-temporal forecasting models, thereby establishing a benchmark for future air quality forecasting tasks.
arXiv Detail & Related papers (2023-06-24T12:10:16Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Satellite Image Time Series Analysis for Big Earth Observation Data [50.591267188664666]
This paper describes sits, an open-source R package for satellite image time series analysis using machine learning.
We show that this approach produces high accuracy for land use and land cover maps through a case study in the Cerrado biome.
arXiv Detail & Related papers (2022-04-24T15:23:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.