Exploring Deep Learning for Full-disk Solar Flare Prediction with
Empirical Insights from Guided Grad-CAM Explanations
- URL: http://arxiv.org/abs/2308.15712v1
- Date: Wed, 30 Aug 2023 02:24:09 GMT
- Title: Exploring Deep Learning for Full-disk Solar Flare Prediction with
Empirical Insights from Guided Grad-CAM Explanations
- Authors: Chetraj Pandey, Anli Ji, Trisha Nandakumar, Rafal A. Angryk, Berkay
Aydin
- Abstract summary: This study progresses solar flare prediction research by presenting a full-disk deep-learning model to forecast $geq$M-class solar flares.
Our analysis unveils that full-disk solar flare predictions correspond with active region characteristics.
- Score: 4.085931783551287
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This study progresses solar flare prediction research by presenting a
full-disk deep-learning model to forecast $\geq$M-class solar flares and
evaluating its efficacy on both central (within $\pm$70$^\circ$) and near-limb
(beyond $\pm$70$^\circ$) events, showcasing qualitative assessment of post hoc
explanations for the model's predictions, and providing empirical findings from
human-centered quantitative assessments of these explanations. Our model is
trained using hourly full-disk line-of-sight magnetogram images to predict
$\geq$M-class solar flares within the subsequent 24-hour prediction window.
Additionally, we apply the Guided Gradient-weighted Class Activation Mapping
(Guided Grad-CAM) attribution method to interpret our model's predictions and
evaluate the explanations. Our analysis unveils that full-disk solar flare
predictions correspond with active region characteristics. The following points
represent the most important findings of our study: (1) Our deep learning
models achieved an average true skill statistic (TSS) of $\sim$0.51 and a
Heidke skill score (HSS) of $\sim$0.38, exhibiting skill to predict solar
flares where for central locations the average recall is $\sim$0.75 (recall
values for X- and M-class are 0.95 and 0.73 respectively) and for the near-limb
flares the average recall is $\sim$0.52 (recall values for X- and M-class are
0.74 and 0.50 respectively); (2) qualitative examination of the model's
explanations reveals that it discerns and leverages features linked to active
regions in both central and near-limb locations within full-disk magnetograms
to produce respective predictions. In essence, our models grasp the shape and
texture-based properties of flaring active regions, even in proximity to limb
areas -- a novel and essential capability with considerable significance for
operational forecasting systems.
Related papers
- Embedding Ordinality to Binary Loss Function for Improving Solar Flare Forecasting [1.0225653612678713]
We introduce a novel approach to encode ordinality into a binary loss function showing an application to solar flare prediction.
We enhance solar flare forecasting by enabling flare predictions for each AR across the entire solar disk, without any longitudinal restrictions.
arXiv Detail & Related papers (2024-08-21T16:42:58Z) - Unveiling the Potential of Deep Learning Models for Solar Flare
Prediction in Near-Limb Regions [1.2699007098398802]
This study aims to evaluate the performance of deep learning models in predicting $geq$M-class solar flares with a prediction window of 24 hours.
We trained three well-known deep learning architectures--AlexNet, VGG16, and ResNet34 using transfer learning.
Our research findings demonstrate that our models are capable of discerning complex spatial patterns from full-disk magnetograms.
arXiv Detail & Related papers (2023-09-25T19:30:02Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Towards Interpretable Solar Flare Prediction with Attention-based Deep
Neural Networks [1.1624569521079424]
Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
arXiv Detail & Related papers (2023-09-08T19:21:10Z) - Explainable Deep Learning-based Solar Flare Prediction with post hoc
Attention for Operational Forecasting [0.6299766708197884]
This paper presents a post hoc analysis of a deep learning-based full-disk solar flare prediction model.
We used hourly full-disk line-of-sight magnetogram images and selected binary prediction mode to predict the occurrence of flares within 24 hours.
Our analysis shows that full-disk predictions of solar flares align with characteristics related to the active regions.
arXiv Detail & Related papers (2023-08-04T19:33:25Z) - Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods [0.6882042556551611]
We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
arXiv Detail & Related papers (2023-07-29T03:18:56Z) - How to Estimate Model Transferability of Pre-Trained Speech Models? [84.11085139766108]
"Score-based assessment" framework for estimating transferability of pre-trained speech models.
We leverage upon two representation theories, Bayesian likelihood estimation and optimal transport, to generate rank scores for the PSM candidates.
Our framework efficiently computes transferability scores without actual fine-tuning of candidate models or layers.
arXiv Detail & Related papers (2023-06-01T04:52:26Z) - Confidence and Dispersity Speak: Characterising Prediction Matrix for
Unsupervised Accuracy Estimation [51.809741427975105]
This work aims to assess how well a model performs under distribution shifts without using labels.
We use the nuclear norm that has been shown to be effective in characterizing both properties.
We show that the nuclear norm is more accurate and robust in accuracy than existing methods.
arXiv Detail & Related papers (2023-02-02T13:30:48Z) - Predictive World Models from Real-World Partial Observations [66.80340484148931]
We present a framework for learning a probabilistic predictive world model for real-world road environments.
While prior methods require complete states as ground truth for learning, we present a novel sequential training method to allow HVAEs to learn to predict complete states from partially observed states only.
arXiv Detail & Related papers (2023-01-12T02:07:26Z) - TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction [64.63645677568384]
We introduce a novel saliency prediction model that learns to output saliency maps in sequential time intervals.
Our approach locally modulates the saliency predictions by combining the learned temporal maps.
Our code will be publicly available on GitHub.
arXiv Detail & Related papers (2023-01-05T22:10:16Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.