Weather2K: A Multivariate Spatio-Temporal Benchmark Dataset for
Meteorological Forecasting Based on Real-Time Observation Data from Ground
Weather Stations
- URL: http://arxiv.org/abs/2302.10493v1
- Date: Tue, 21 Feb 2023 07:46:08 GMT
- Title: Weather2K: A Multivariate Spatio-Temporal Benchmark Dataset for
Meteorological Forecasting Based on Real-Time Observation Data from Ground
Weather Stations
- Authors: Xun Zhu and Yutong Xiong and Ming Wu and Gaozhen Nie and Bin Zhang and
Ziheng Yang
- Abstract summary: This paper presents a new benchmark dataset named Weather2K.
It aims to make up for the deficiencies of existing weather forecasting datasets in terms of real-time, reliability, and diversity.
The data is hourly collected from 2,130 ground weather stations covering an area of 6 million square kilometers.
- Score: 9.061222268562249
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weather forecasting is one of the cornerstones of meteorological work. In
this paper, we present a new benchmark dataset named Weather2K, which aims to
make up for the deficiencies of existing weather forecasting datasets in terms
of real-time, reliability, and diversity, as well as the key bottleneck of data
quality. To be specific, our Weather2K is featured from the following aspects:
1) Reliable and real-time data. The data is hourly collected from 2,130 ground
weather stations covering an area of 6 million square kilometers. 2)
Multivariate meteorological variables. 20 meteorological factors and 3
constants for position information are provided with a length of 40,896 time
steps. 3) Applicable to diverse tasks. We conduct a set of baseline tests on
time series forecasting and spatio-temporal forecasting. To the best of our
knowledge, our Weather2K is the first attempt to tackle weather forecasting
task by taking full advantage of the strengths of observation data from ground
weather stations. Based on Weather2K, we further propose Meteorological Factors
based Multi-Graph Convolution Network (MFMGCN), which can effectively construct
the intrinsic correlation among geographic locations based on meteorological
factors. Sufficient experiments show that MFMGCN improves both the forecasting
performance and temporal robustness. We hope our Weather2K can significantly
motivate researchers to develop efficient and accurate algorithms to advance
the task of weather forecasting. The dataset can be available at
https://github.com/bycnfz/weather2k/.
Related papers
- Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection [67.40407388422514]
We design a conceptual fine-grained causal model named TBN Granger Causality.
Second, we propose an end-to-end deep generative model called TacSas, which discovers TBN Granger Causality in a generative manner.
We test TacSas on climate benchmark ERA5 for climate forecasting and the extreme weather benchmark of NOAA for extreme weather alerts.
arXiv Detail & Related papers (2024-08-08T06:47:21Z) - WeatherQA: Can Multimodal Language Models Reason about Severe Weather? [45.43764278625153]
Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year.
This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas.
We introduce WeatherQA, the first multimodal dataset designed for machines to reason about complex combinations of weather parameters and predict severe weather in real-world scenarios.
arXiv Detail & Related papers (2024-06-17T05:23:18Z) - Observation-Guided Meteorological Field Downscaling at Station Scale: A
Benchmark and a New Method [66.80344502790231]
We extend meteorological downscaling to arbitrary scattered station scales and establish a new benchmark and dataset.
Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors.
Our proposed method outperforms other specially designed baseline models on multiple surface variables.
arXiv Detail & Related papers (2024-01-22T14:02:56Z) - GraphCast: Learning skillful medium-range global weather forecasting [107.40054095223779]
We introduce a machine learning-based method called "GraphCast", which can be trained directly from reanalysis data.
It predicts hundreds of weather variables, over 10 days at 0.25 degree resolution globally, in under one minute.
We show that GraphCast significantly outperforms the most accurate operational deterministic systems on 90% of 1380 verification targets.
arXiv Detail & Related papers (2022-12-24T18:15:39Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z) - MLRM: A Multiple Linear Regression based Model for Average Temperature
Prediction of A Day [3.6704226968275258]
We aim to predict the weather of an area using past meteorological data and features using the Multiple Linear Regression Model.
The model is successfully able to predict the average temperature of a day with an error of 2.8 degrees Celsius.
arXiv Detail & Related papers (2022-03-11T10:22:57Z) - Efficient spatio-temporal weather forecasting using U-Net [0.0]
Weather forecast plays an essential role in multiple aspects of the daily life of human beings.
Deep learning based models have seen wide success in many weather-prediction related tasks.
arXiv Detail & Related papers (2021-12-13T10:28:33Z) - A generative adversarial network approach to (ensemble) weather
prediction [91.3755431537592]
We use a conditional deep convolutional generative adversarial network to predict the geopotential height of the 500 hPa pressure level, the two-meter temperature and the total precipitation for the next 24 hours over Europe.
The proposed models are trained on 4 years of ERA5 reanalysis data from 2015-2018 with the goal to predict the associated meteorological fields in 2019.
arXiv Detail & Related papers (2020-06-13T20:53:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.