Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data
- URL: http://arxiv.org/abs/2404.08613v1
- Date: Fri, 12 Apr 2024 17:22:29 GMT
- Title: Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data
- Authors: Huan Zhang, Justin Finkel, Dorian S. Abbot, Edwin P. Gerber, Jonathan Weare,
- Abstract summary: Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place.
Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events.
We train a convolutional neural network (CNN) and build a sparse predictive model for the persistence of Atlantic blocking.
- Score: 7.72430072816717
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstrate this approach on an idealized quasigeostrophic model developed by Marshall and Molteni (1993). We train a convolutional neural network (CNN), and subsequently, build a sparse predictive model for the persistence of Atlantic blocking, conditioned on an initial high-pressure anomaly. Shapley Additive ExPlanation (SHAP) analysis reveals that high-pressure anomalies in the American Southeast and North Atlantic, separated by a trough over Atlantic Canada, contribute significantly to prediction of sustained blocking events in the Atlantic region. This agrees with previous work that identified precursors in the same regions via wave train analysis. When we apply the same CNN to blockings in the ERA5 atmospheric reanalysis, there is insufficient data to accurately predict persistent blocks. We partially overcome this limitation by pre-training the CNN on the plentiful data of the Marshall-Molteni model, and then using Transfer Learning to achieve better predictions than direct training. SHAP analysis before and after transfer learning allows a comparison between the predictive features in the reanalysis and the quasigeostrophic model, quantifying dynamical biases in the idealized model. This work demonstrates the potential for machine learning methods to extract meaningful precursors of extreme weather events and achieve better prediction using limited observational data.
Related papers
- Inferring Thunderstorm Occurrence from Vertical Profiles of Convection-Permitting Simulations: Physical Insights from a Physical Deep Learning Model [0.0]
Thunderstorms have significant social and economic impacts due to heavy precipitation, hail, lightning, and strong winds.
We develop SALAMA 1D, a deep neural network that directly infers the probability of thunderstorm occurrence from vertical profiles of ten atmospheric variables.
SALAMA 1D is trained over Central Europe with lightning observations as the ground truth.
arXiv Detail & Related papers (2024-09-30T08:40:28Z) - Ensemble data assimilation to diagnose AI-based weather prediction model: A case with ClimaX version 0.3.1 [0.0]
This study proposes using ensemble data assimilation for diagnosing AI-based weather prediction models.
Experiments with an AI-based model ClimaX demonstrated that the ensemble data assimilation cycled stably for the AI-based weather prediction model.
arXiv Detail & Related papers (2024-07-25T05:22:08Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling [55.13352174687475]
This paper proposes a physics-AI hybrid model (i.e., WeatherGFT) which Generalizes weather forecasts to Finer-grained Temporal scales.
Specifically, we employ a carefully designed PDE kernel to simulate physical evolution on a small time scale.
We introduce a lead time-aware training framework to promote the generalization of the model at different lead times.
arXiv Detail & Related papers (2024-05-22T16:21:02Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach
Without Reanalysis Data [18.321586950937647]
We present an approach that harnesses real-time Unified Model (UM) data, sidestepping the limitations of reanalysis data.
Our model provides predictions at 6-hour intervals for up to 72 hours in advance and outperforms both state-of-the-art data-driven methods and numerical weather prediction models.
arXiv Detail & Related papers (2024-01-28T18:28:33Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Spatio-temporal neural structural causal models for bike flow prediction [2.991894112851257]
The fundamental issue of managing bike-sharing systems is bike flow prediction.
Recent methods over-emphasize the contextual conditions on the transportation system.
We propose a Spatiotemporal-temporal Structure Causal Model.
arXiv Detail & Related papers (2023-01-19T01:39:21Z) - Long-term hail risk assessment with deep neural networks [0.0]
Hail risk assessment is necessary to estimate and reduce damage to crops, orchards, and infrastructure.
There are no machine learning models for data-driven forecasting of changes in hail frequency for a given area.
This study compares two approaches and introduces a model suitable for the task of forecasting changes in hail frequency for ongoing decades.
arXiv Detail & Related papers (2022-08-31T18:24:39Z) - An advanced spatio-temporal convolutional recurrent neural network for
storm surge predictions [73.4962254843935]
We study the capability of artificial neural network models to emulate storm surge based on the storm track/size/intensity history.
This study presents a neural network model that can predict storm surge, informed by a database of synthetic storm simulations.
arXiv Detail & Related papers (2022-04-18T23:42:18Z) - Forecasting large-scale circulation regimes using deformable
convolutional neural networks and global spatiotemporal climate data [86.1450118623908]
We investigate a supervised machine learning approach based on deformable convolutional neural networks (deCNNs)
We forecast the North Atlantic-European weather regimes during extended boreal winter for 1 to 15 days into the future.
Due to its wider field of view, we also observe deCNN achieving considerably better performance than regular convolutional neural networks at lead times beyond 5-6 days.
arXiv Detail & Related papers (2022-02-10T11:37:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.