A Novel Transformer Network with Shifted Window Cross-Attention for
Spatiotemporal Weather Forecasting
- URL: http://arxiv.org/abs/2208.01252v1
- Date: Tue, 2 Aug 2022 05:04:53 GMT
- Title: A Novel Transformer Network with Shifted Window Cross-Attention for
Spatiotemporal Weather Forecasting
- Authors: Alabi Bojesomo, Hasan Al Marzouqi, Panos Liatsis
- Abstract summary: We tackle the challenge of weather forecasting using a video transformer network.
Vision transformer architectures have been explored in various applications.
We propose the use of Video Swin-Transformer, coupled with a dedicated augmentation scheme.
- Score: 5.414308305392762
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Earth Observatory is a growing research area that can capitalize on the
powers of AI for short time forecasting, a Now-casting scenario. In this work,
we tackle the challenge of weather forecasting using a video transformer
network. Vision transformer architectures have been explored in various
applications, with major constraints being the computational complexity of
Attention and the data hungry training. To address these issues, we propose the
use of Video Swin-Transformer, coupled with a dedicated augmentation scheme.
Moreover, we employ gradual spatial reduction on the encoder side and
cross-attention on the decoder. The proposed approach is tested on the
Weather4Cast2021 weather forecasting challenge data, which requires the
prediction of 8 hours ahead future frames (4 per hour) from an hourly weather
product sequence. The dataset was normalized to 0-1 to facilitate using the
evaluation metrics across different datasets. The model results in an MSE score
of 0.4750 when provided with training data, and 0.4420 during transfer learning
without using training data, respectively.
Related papers
- Transformer based time series prediction of the maximum power point for solar photovoltaic cells [0.24233709516962787]
This paper proposes an improved deep learning based maximum power point tracking (MPPT) in solar photovoltaic cells.
In this article, the ambient conditions of a location are represented through a comprehensive set of environmental features.
The proposed model performs power point tracking in a robust, dynamic, and nonlatent manner, over a wide range of atmospheric conditions.
arXiv Detail & Related papers (2024-09-24T17:26:55Z) - WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets [0.5735035463793009]
WeatherFormer is a transformer encoder-based model designed to learn robust weather features from minimal observations.
WeatherFormer was pretrained on a large pretraining dataset comprised of 39 years of satellite measurements across the Americas.
arXiv Detail & Related papers (2024-05-22T17:43:46Z) - Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling [55.13352174687475]
This paper proposes a physics-AI hybrid model (i.e., WeatherGFT) which Generalizes weather forecasts to Finer-grained Temporal scales.
Specifically, we employ a carefully designed PDE kernel to simulate physical evolution on a small time scale.
We introduce a lead time-aware training framework to promote the generalization of the model at different lead times.
arXiv Detail & Related papers (2024-05-22T16:21:02Z) - STC-ViT: Spatio Temporal Continuous Vision Transformer for Weather Forecasting [0.0]
We present STC-ViT, a Spatio-Temporal Continuous Transformer Vision for weather forecasting.
STC-ViT incorporates the continuous time Neural ODE layers with multi-head attention mechanism to learn the continuous weather evolution over time.
We evaluate STC-ViT against operational Numerical Weather Prediction (NWP) model and several deep learning based weather forecasting models.
arXiv Detail & Related papers (2024-02-28T01:15:30Z) - Rethinking Urban Mobility Prediction: A Super-Multivariate Time Series
Forecasting Approach [71.67506068703314]
Long-term urban mobility predictions play a crucial role in the effective management of urban facilities and services.
Traditionally, urban mobility data has been structured as videos, treating longitude and latitude as fundamental pixels.
In our research, we introduce a fresh perspective on urban mobility prediction.
Instead of oversimplifying urban mobility data as traditional video data, we regard it as a complex time series.
arXiv Detail & Related papers (2023-12-04T07:39:05Z) - FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond
10 Days Lead [93.67314652898547]
We present FengWu, an advanced data-driven global medium-range weather forecast system based on Artificial Intelligence (AI)
FengWu is able to accurately reproduce the atmospheric dynamics and predict the future land and atmosphere states at 37 vertical levels on a 0.25deg latitude-longitude resolution.
The results suggest that FengWu can significantly improve the forecast skill and extend the skillful global medium-range weather forecast out to 10.75 days lead.
arXiv Detail & Related papers (2023-04-06T09:16:39Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z) - ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast [12.812768685050898]
Post-processing ensemble prediction systems can improve weather forecasting, especially for extreme event prediction.
This paper introduces the ENS-10 dataset, consisting of ten ensemble members spread over 20 years (1998-2017).
The ensemble members are generated by perturbing numerical weather simulations to capture the chaotic behavior of the Earth.
arXiv Detail & Related papers (2022-06-29T17:40:56Z) - Vision in adverse weather: Augmentation using CycleGANs with various
object detectors for robust perception in autonomous racing [70.16043883381677]
In autonomous racing, the weather can change abruptly, causing significant degradation in perception, resulting in ineffective manoeuvres.
In order to improve detection in adverse weather, deep-learning-based models typically require extensive datasets captured in such conditions.
We introduce an approach of using synthesised adverse condition datasets in autonomous racing (generated using CycleGAN) to improve the performance of four out of five state-of-the-art detectors.
arXiv Detail & Related papers (2022-01-10T10:02:40Z) - TENT: Tensorized Encoder Transformer for Temperature Forecasting [3.498371632913735]
We introduce a new model based on the Transformer architecture for weather forecasting.
We show that compared to the original transformer and 3D convolutional neural networks, the proposed TENT model can better model the underlying complex pattern of weather data.
Experiments on two real-life weather datasets are performed.
arXiv Detail & Related papers (2021-06-28T14:17:22Z) - Deep Transformer Networks for Time Series Classification: The NPP Safety
Case [59.20947681019466]
An advanced temporal neural network referred to as the Transformer is used within a supervised learning fashion to model the time-dependent NPP simulation data.
The Transformer can learn the characteristics of the sequential data and yield promising performance with approximately 99% classification accuracy on the testing dataset.
arXiv Detail & Related papers (2021-04-09T14:26:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.