CSU-PCAST: A Dual-Branch Transformer Framework for medium-range ensemble Precipitation Forecasting
- URL: http://arxiv.org/abs/2510.20769v1
- Date: Thu, 23 Oct 2025 17:43:38 GMT
- Title: CSU-PCAST: A Dual-Branch Transformer Framework for medium-range ensemble Precipitation Forecasting
- Authors: Tianyi Xiong, Haonan Chen,
- Abstract summary: This study develops a deep learning-based ensemble framework for multi-step precipitation prediction.<n>The architecture employs a patch-based Swin Transformer backbone with periodic convolutions to handle longitudinal continuity.<n>Training minimizes a hybrid loss combining the Continuous Ranked Probability Score (CRPS) and weighted log1p mean squared error (log1pMSE)
- Score: 6.540270371082014
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate medium-range precipitation forecasting is crucial for hydrometeorological risk management and disaster mitigation, yet remains challenging for current numerical weather prediction (NWP) systems. Traditional ensemble systems such as the Global Ensemble Forecast System (GEFS) struggle to maintain high skill, especially for moderate and heavy rainfall at extended lead times. This study develops a deep learning-based ensemble framework for multi-step precipitation prediction through joint modeling of a comprehensive set of atmospheric variables. The model is trained on ERA5 reanalysis data at 0.25$^{\circ}$ spatial resolution, with precipitation labels from NASA's Integrated Multi-satellite Retrievals for Global Precipitation Measurement (GPM) constellation (IMERG), incorporating 57 input variables, including upper-air and surface predictors. The architecture employs a patch-based Swin Transformer backbone with periodic convolutions to handle longitudinal continuity and integrates time and noise embeddings through conditional layer normalization. A dual-branch decoder predicts total precipitation and other variables, with targeted freezing of encoder-decoder pathways for specialized training. Training minimizes a hybrid loss combining the Continuous Ranked Probability Score (CRPS) and weighted log1p mean squared error (log1pMSE), balancing probabilistic accuracy and magnitude fidelity. During inference, the model ingests real-time Global Forecast System (GFS) initial conditions to generate 15-day forecasts autoregressively. Evaluation against GEFS using IMERG data demonstrates higher Critical Success Index (CSI) scores at precipitation thresholds of 0.1 mm, 1 mm, 10 mm, and 20 mm, highlighting improved performance for moderate to heavy rainfall.
Related papers
- A Dual-TransUNet Deep Learning Framework for Multi-Source Precipitation Merging and Improving Seasonal and Extreme Estimates [2.9811995156103457]
Multi-source precipitation products (MSPs) from satellite retrievals and reanalysis are widely used for hydroclimatic monitoring.<n>We develop a dual-stage TransUNet-based multi-source precipitation merging framework (DDL-MSPMF) that integrates six MSPs with four ERA5 near-surface physical predictors.
arXiv Detail & Related papers (2026-02-04T16:55:43Z) - How Effective Are Time-Series Models for Precipitation Nowcasting? A Comprehensive Benchmark for GNSS-based Precipitation Nowcasting [18.312964316878283]
RainfallBench is a benchmark designed for precipitation nowcasting.<n>The dataset is derived from five years of meteorological observations, recorded at hourly intervals across six essential variables.<n>It incorporates precipitable water vapor (PWV), a crucial indicator of rainfall that is absent in other datasets.
arXiv Detail & Related papers (2025-09-28T03:21:24Z) - Wavelet-SARIMA-Transformer: A Hybrid Model for Rainfall Forecasting [0.0]
This study develops and evaluates a novel hybridWavelet SARIMA Transformer, WST framework to forecast using monthly rainfall across five meteorological subdivisions of Northeast India over the 1971 to 2023 period.
arXiv Detail & Related papers (2025-09-15T13:27:19Z) - OneForecast: A Universal Framework for Global and Regional Weather Forecasting [67.61381313555091]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z) - Skillful High-Resolution Ensemble Precipitation Forecasting with an Integrated Deep Learning Framework [4.3313006430322165]
High-resolution precipitation forecasts are crucial for providing accurate weather prediction and supporting effective responses to extreme weather events.<n>We propose a physics-inspired deep learning framework for high-resolution ensemble precipitation forecasting.
arXiv Detail & Related papers (2025-01-06T10:29:38Z) - Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection [67.40407388422514]
We design a conceptual fine-grained causal model named TBN Granger Causality.
Second, we propose an end-to-end deep generative model called TacSas, which discovers TBN Granger Causality in a generative manner.
We test TacSas on climate benchmark ERA5 for climate forecasting and the extreme weather benchmark of NOAA for extreme weather alerts.
arXiv Detail & Related papers (2024-08-08T06:47:21Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Global Precipitation Nowcasting of Integrated Multi-satellitE Retrievals
for GPM: A U-Net Convolutional LSTM Architecture [3.5776345196917254]
This paper presents a deep learning architecture for nowcasting of precipitation almost globally every 30 min with a 4-hour lead time.
The architecture fuses a U-Net and a convolutional long short-term memory (LSTM) neural network.
It is trained using data from the Integrated MultisatellitE Retrievals for GPM (IMERG) and a few key precipitation drivers from the Global Forecast System (GFS)
arXiv Detail & Related papers (2023-07-20T13:04:26Z) - Towards replacing precipitation ensemble predictions systems using
machine learning [0.0]
We propose a new approach to generating ensemble weather predictions for high-resolution precipitation.
The method uses generative adversarial networks to learn the complex patterns of precipitation.
We demonstrate the feasibility of generating realistic precipitation ensemble members on unseen higher resolutions.
arXiv Detail & Related papers (2023-04-20T12:20:35Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z) - Nowcasting-Nets: Deep Neural Network Structures for Precipitation
Nowcasting Using IMERG [1.9860735109145415]
We use Recurrent and Convolutional deep neural network structures to address the challenge of precipitation nowcasting.
A total of five models are trained using Global Precipitation Measurement (GPM) Integrated Multi-satellitE Retrievals for GPM (IMERG) precipitation data over the Eastern Contiguous United States (CONUS)
The models were designed to provide forecasts with a lead time of up to 1.5 hours and, by using a feedback loop approach, the ability of the models to extend the forecast time to 4.5 hours was also investigated.
arXiv Detail & Related papers (2021-08-16T02:55:32Z) - A generative adversarial network approach to (ensemble) weather
prediction [91.3755431537592]
We use a conditional deep convolutional generative adversarial network to predict the geopotential height of the 500 hPa pressure level, the two-meter temperature and the total precipitation for the next 24 hours over Europe.
The proposed models are trained on 4 years of ERA5 reanalysis data from 2015-2018 with the goal to predict the associated meteorological fields in 2019.
arXiv Detail & Related papers (2020-06-13T20:53:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.