Explainable Deep Learning-based Solar Flare Prediction with post hoc
Attention for Operational Forecasting
- URL: http://arxiv.org/abs/2308.02682v1
- Date: Fri, 4 Aug 2023 19:33:25 GMT
- Title: Explainable Deep Learning-based Solar Flare Prediction with post hoc
Attention for Operational Forecasting
- Authors: Chetraj Pandey, Rafal A. Angryk, Manolis K. Georgoulis, Berkay Aydin
- Abstract summary: This paper presents a post hoc analysis of a deep learning-based full-disk solar flare prediction model.
We used hourly full-disk line-of-sight magnetogram images and selected binary prediction mode to predict the occurrence of flares within 24 hours.
Our analysis shows that full-disk predictions of solar flares align with characteristics related to the active regions.
- Score: 0.6299766708197884
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a post hoc analysis of a deep learning-based full-disk
solar flare prediction model. We used hourly full-disk line-of-sight
magnetogram images and selected binary prediction mode to predict the
occurrence of $\geq$M1.0-class flares within 24 hours. We leveraged custom data
augmentation and sample weighting to counter the inherent class-imbalance
problem and used true skill statistic and Heidke skill score as evaluation
metrics. Recent advancements in gradient-based attention methods allow us to
interpret models by sending gradient signals to assign the burden of the
decision on the input features. We interpret our model using three post hoc
attention methods: (i) Guided Gradient-weighted Class Activation Mapping, (ii)
Deep Shapley Additive Explanations, and (iii) Integrated Gradients. Our
analysis shows that full-disk predictions of solar flares align with
characteristics related to the active regions. The key findings of this study
are: (1) We demonstrate that our full disk model can tangibly locate and
predict near-limb solar flares, which is a critical feature for operational
flare forecasting, (2) Our candidate model achieves an average
TSS=0.51$\pm$0.05 and HSS=0.38$\pm$0.08, and (3) Our evaluation suggests that
these models can learn conspicuous features corresponding to active regions
from full-disk magnetograms.
Related papers
- OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries.
OPUS incorporates a suite of non-trivial strategies to enhance model performance.
Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z) - Unveiling the Potential of Deep Learning Models for Solar Flare
Prediction in Near-Limb Regions [1.2699007098398802]
This study aims to evaluate the performance of deep learning models in predicting $geq$M-class solar flares with a prediction window of 24 hours.
We trained three well-known deep learning architectures--AlexNet, VGG16, and ResNet34 using transfer learning.
Our research findings demonstrate that our models are capable of discerning complex spatial patterns from full-disk magnetograms.
arXiv Detail & Related papers (2023-09-25T19:30:02Z) - Towards Interpretable Solar Flare Prediction with Attention-based Deep
Neural Networks [1.1624569521079424]
Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
arXiv Detail & Related papers (2023-09-08T19:21:10Z) - Exploring Deep Learning for Full-disk Solar Flare Prediction with
Empirical Insights from Guided Grad-CAM Explanations [4.085931783551287]
This study progresses solar flare prediction research by presenting a full-disk deep-learning model to forecast $geq$M-class solar flares.
Our analysis unveils that full-disk solar flare predictions correspond with active region characteristics.
arXiv Detail & Related papers (2023-08-30T02:24:09Z) - Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods [0.6882042556551611]
We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
arXiv Detail & Related papers (2023-07-29T03:18:56Z) - Generalizing Backpropagation for Gradient-Based Interpretability [103.2998254573497]
We show that the gradient of a model is a special case of a more general formulation using semirings.
This observation allows us to generalize the backpropagation algorithm to efficiently compute other interpretable statistics.
arXiv Detail & Related papers (2023-07-06T15:19:53Z) - TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction [64.63645677568384]
We introduce a novel saliency prediction model that learns to output saliency maps in sequential time intervals.
Our approach locally modulates the saliency predictions by combining the learned temporal maps.
Our code will be publicly available on GitHub.
arXiv Detail & Related papers (2023-01-05T22:10:16Z) - Extracting or Guessing? Improving Faithfulness of Event Temporal
Relation Extraction [87.04153383938969]
We improve the faithfulness of TempRel extraction models from two perspectives.
The first perspective is to extract genuinely based on contextual description.
The second perspective is to provide proper uncertainty estimation.
arXiv Detail & Related papers (2022-10-10T19:53:13Z) - Towards Coupling Full-disk and Active Region-based Flare Prediction for
Operational Space Weather Forecasting [0.5872014229110215]
We present new approaches to train and deploy an operational solar flare prediction system for $geq$M1.0-class flares.
In full-disk mode, predictions are performed on full-disk line-of-sight magnetograms using deep learning models.
In active region-based models, predictions are issued for each active region individually.
arXiv Detail & Related papers (2022-08-11T22:34:44Z) - Towards Open-World Feature Extrapolation: An Inductive Graph Learning
Approach [80.8446673089281]
We propose a new learning paradigm with graph representation and learning.
Our framework contains two modules: 1) a backbone network (e.g., feedforward neural nets) as a lower model takes features as input and outputs predicted labels; 2) a graph neural network as an upper model learns to extrapolate embeddings for new features via message passing over a feature-data graph built from observed data.
arXiv Detail & Related papers (2021-10-09T09:02:45Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.