Explainable AI in Deep Learning-Based Prediction of Solar Storms
- URL: http://arxiv.org/abs/2508.16543v1
- Date: Fri, 22 Aug 2025 17:09:00 GMT
- Title: Explainable AI in Deep Learning-Based Prediction of Solar Storms
- Authors: Adam O. Rawashdeh, Jason T. L. Wang, Katherine G. Herbert,
- Abstract summary: We present an approach to making a deep learning-based solar storm prediction model interpretable.<n>This is the first time that interpretability has been added to an LSTM-based solar storm prediction model.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A deep learning model is often considered a black-box model, as its internal workings tend to be opaque to the user. Because of the lack of transparency, it is challenging to understand the reasoning behind the model's predictions. Here, we present an approach to making a deep learning-based solar storm prediction model interpretable, where solar storms include solar flares and coronal mass ejections (CMEs). This deep learning model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun's surface that produces a flare within 24 hours will also produce a CME associated with the flare. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. To make the model's predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model's behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based solar storm prediction model.
Related papers
- Weather Prediction with Diffusion Guided by Realistic Forecast Processes [49.07556359513563]
We introduce a novel method that applies diffusion models (DM) for weather forecasting.
Our method can achieve both direct and iterative forecasting with the same modeling framework.
The flexibility and controllability of our model empowers a more trustworthy DL system for the general weather community.
arXiv Detail & Related papers (2024-02-06T21:28:42Z) - FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation [67.20588721130623]
We develop an AI-based cyclic weather forecasting system, FengWu-4DVar.
FengWu-4DVar can incorporate observational data into the data-driven weather forecasting model.
Experiments on the simulated observational dataset demonstrate that FengWu-4DVar is capable of generating reasonable analysis fields.
arXiv Detail & Related papers (2023-12-16T02:07:56Z) - Physically Explainable Deep Learning for Convective Initiation
Nowcasting Using GOES-16 Satellite Observations [0.1874930567916036]
Convection initiation (CI) nowcasting remains a challenging problem for both numerical weather prediction models and existing nowcasting algorithms.
In this study, object-based probabilistic deep learning models are developed to predict CI based on multichannel infrared GOES-R satellite observations.
arXiv Detail & Related papers (2023-10-24T17:18:44Z) - Towards Interpretable Solar Flare Prediction with Attention-based Deep
Neural Networks [1.1624569521079424]
Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
arXiv Detail & Related papers (2023-09-08T19:21:10Z) - Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods [0.6882042556551611]
We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
arXiv Detail & Related papers (2023-07-29T03:18:56Z) - OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive
Learning [67.07363529640784]
We propose OpenSTL to categorize prevalent approaches into recurrent-based and recurrent-free models.
We conduct standard evaluations on datasets across various domains, including synthetic moving object trajectory, human motion, driving scenes, traffic flow and forecasting weather.
We find that recurrent-free models achieve a good balance between efficiency and performance than recurrent models.
arXiv Detail & Related papers (2023-06-20T03:02:14Z) - TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction [64.63645677568384]
We introduce a novel saliency prediction model that learns to output saliency maps in sequential time intervals.
Our approach locally modulates the saliency predictions by combining the learned temporal maps.
Our code will be publicly available on GitHub.
arXiv Detail & Related papers (2023-01-05T22:10:16Z) - Benchmarking of Deep Learning Irradiance Forecasting Models from Sky
Images -- an in-depth Analysis [0.0]
We train four commonly used Deep Learning architectures to forecast solar irradiance from sequences of hemispherical sky images.
Results show that encodingtemporal aspects greatly improved the predictions with 10 min Forecast Skill reaching 20.4% on the test year.
We conclude that, with a common setup, Deep Learning models tend to behave just as a'very smart persistence model', temporally aligned with the persistence model while mitigating its most penalising errors.
arXiv Detail & Related papers (2021-02-01T09:31:14Z) - Learning Interpretable Deep State Space Model for Probabilistic Time
Series Forecasting [98.57851612518758]
Probabilistic time series forecasting involves estimating the distribution of future based on its history.
We propose a deep state space model for probabilistic time series forecasting whereby the non-linear emission model and transition model are parameterized by networks.
We show in experiments that our model produces accurate and sharp probabilistic forecasts.
arXiv Detail & Related papers (2021-01-31T06:49:33Z) - VAE-LIME: Deep Generative Model Based Approach for Local Data-Driven
Model Interpretability Applied to the Ironmaking Industry [70.10343492784465]
It is necessary to expose to the process engineer, not solely the model predictions, but also their interpretability.
Model-agnostic local interpretability solutions based on LIME have recently emerged to improve the original method.
We present in this paper a novel approach, VAE-LIME, for local interpretability of data-driven models forecasting the temperature of the hot metal produced by a blast furnace.
arXiv Detail & Related papers (2020-07-15T07:07:07Z) - Interpreting LSTM Prediction on Solar Flare Eruption with Time-series
Clustering [0.0]
We conduct a post hoc analysis of solar flare predictions made by a Long Short Term Memory (LSTM) model.
We train the the LSTM model for binary classification to provide a prediction score for the probability of M/X class flares to occur in next hour.
Our work shows that a subset of SHARP parameters contain the key signals that strong solar flare eruptions are imminent.
arXiv Detail & Related papers (2019-12-27T22:56:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.