Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods
- URL: http://arxiv.org/abs/2307.15878v1
- Date: Sat, 29 Jul 2023 03:18:56 GMT
- Title: Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods
- Authors: Chetraj Pandey, Rafal A. Angryk and Berkay Aydin
- Abstract summary: We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
- Score: 0.6882042556551611
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper contributes to the growing body of research on deep learning
methods for solar flare prediction, primarily focusing on highly overlooked
near-limb flares and utilizing the attribution methods to provide a post hoc
qualitative explanation of the model's predictions. We present a solar flare
prediction model, which is trained using hourly full-disk line-of-sight
magnetogram images and employs a binary prediction mode to forecast
$\geq$M-class flares that may occur within the following 24-hour period. To
address the class imbalance, we employ a fusion of data augmentation and class
weighting techniques; and evaluate the overall performance of our model using
the true skill statistic (TSS) and Heidke skill score (HSS). Moreover, we
applied three attribution methods, namely Guided Gradient-weighted Class
Activation Mapping, Integrated Gradients, and Deep Shapley Additive
Explanations, to interpret and cross-validate our model's predictions with the
explanations. Our analysis revealed that full-disk prediction of solar flares
aligns with characteristics related to active regions (ARs). In particular, the
key findings of this study are: (1) our deep learning models achieved an
average TSS=0.51 and HSS=0.35, and the results further demonstrate a competent
capability to predict near-limb solar flares and (2) the qualitative analysis
of the model explanation indicates that our model identifies and uses features
associated with ARs in central and near-limb locations from full-disk
magnetograms to make corresponding predictions. In other words, our models
learn the shape and texture-based characteristics of flaring ARs even at
near-limb areas, which is a novel and critical capability with significant
implications for operational forecasting.
Related papers
- Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery Perspective [60.64922606733441]
We introduce a mathematical model that formalizes relational learning as hypergraph recovery to study pre-training of Foundation Models (FMs)
In our framework, the world is represented as a hypergraph, with data abstracted as random samples from hyperedges. We theoretically examine the feasibility of a Pre-Trained Model (PTM) to recover this hypergraph and analyze the data efficiency in a minimax near-optimal style.
arXiv Detail & Related papers (2024-06-17T06:20:39Z) - CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding [62.075029712357]
This work introduces the Cognitive Diffusion Probabilistic Models (CogDPM)
CogDPM features a precision estimation method based on the hierarchical sampling capabilities of diffusion models and weight the guidance with precision weights estimated by the inherent property of diffusion models.
We apply CogDPM to real-world prediction tasks using the United Kindom precipitation and surface wind datasets.
arXiv Detail & Related papers (2024-05-03T15:54:50Z) - Unveiling the Potential of Deep Learning Models for Solar Flare
Prediction in Near-Limb Regions [1.2699007098398802]
This study aims to evaluate the performance of deep learning models in predicting $geq$M-class solar flares with a prediction window of 24 hours.
We trained three well-known deep learning architectures--AlexNet, VGG16, and ResNet34 using transfer learning.
Our research findings demonstrate that our models are capable of discerning complex spatial patterns from full-disk magnetograms.
arXiv Detail & Related papers (2023-09-25T19:30:02Z) - Towards Interpretable Solar Flare Prediction with Attention-based Deep
Neural Networks [1.1624569521079424]
Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
arXiv Detail & Related papers (2023-09-08T19:21:10Z) - Exploring Deep Learning for Full-disk Solar Flare Prediction with
Empirical Insights from Guided Grad-CAM Explanations [4.085931783551287]
This study progresses solar flare prediction research by presenting a full-disk deep-learning model to forecast $geq$M-class solar flares.
Our analysis unveils that full-disk solar flare predictions correspond with active region characteristics.
arXiv Detail & Related papers (2023-08-30T02:24:09Z) - Explainable Deep Learning-based Solar Flare Prediction with post hoc
Attention for Operational Forecasting [0.6299766708197884]
This paper presents a post hoc analysis of a deep learning-based full-disk solar flare prediction model.
We used hourly full-disk line-of-sight magnetogram images and selected binary prediction mode to predict the occurrence of flares within 24 hours.
Our analysis shows that full-disk predictions of solar flares align with characteristics related to the active regions.
arXiv Detail & Related papers (2023-08-04T19:33:25Z) - TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction [64.63645677568384]
We introduce a novel saliency prediction model that learns to output saliency maps in sequential time intervals.
Our approach locally modulates the saliency predictions by combining the learned temporal maps.
Our code will be publicly available on GitHub.
arXiv Detail & Related papers (2023-01-05T22:10:16Z) - Spatial machine-learning model diagnostics: a model-agnostic
distance-based approach [91.62936410696409]
This contribution proposes spatial prediction error profiles (SPEPs) and spatial variable importance profiles (SVIPs) as novel model-agnostic assessment and interpretation tools.
The SPEPs and SVIPs of geostatistical methods, linear models, random forest, and hybrid algorithms show striking differences and also relevant similarities.
The novel diagnostic tools enrich the toolkit of spatial data science, and may improve ML model interpretation, selection, and design.
arXiv Detail & Related papers (2021-11-13T01:50:36Z) - An Interpretable Probabilistic Model for Short-Term Solar Power
Forecasting Using Natural Gradient Boosting [0.0]
We propose a two stage probabilistic forecasting framework able to generate highly accurate, reliable, and sharp forecasts.
The framework offers full transparency on both the point forecasts and the prediction intervals (PIs)
To highlight the performance and the applicability of the proposed framework, real data from two PV parks located in Southern Germany are employed.
arXiv Detail & Related papers (2021-08-05T12:59:38Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.