Interpreting LSTM Prediction on Solar Flare Eruption with Time-series
Clustering
- URL: http://arxiv.org/abs/1912.12360v2
- Date: Mon, 9 Mar 2020 21:36:09 GMT
- Title: Interpreting LSTM Prediction on Solar Flare Eruption with Time-series
Clustering
- Authors: Hu Sun (1), Ward Manchester (2), Zhenbang Jiao (1), Xiantong Wang (2),
Yang Chen (1 and 3) ((1) Department of Statistics, University of Michigan,
Ann Arbor, (2) Climate and Space Sciences and Engineering, University of
Michigan, Ann Arbor, (3) Michigan Institute for Data Science, University of
Michigan, Ann Arbor)
- Abstract summary: We conduct a post hoc analysis of solar flare predictions made by a Long Short Term Memory (LSTM) model.
We train the the LSTM model for binary classification to provide a prediction score for the probability of M/X class flares to occur in next hour.
Our work shows that a subset of SHARP parameters contain the key signals that strong solar flare eruptions are imminent.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We conduct a post hoc analysis of solar flare predictions made by a Long
Short Term Memory (LSTM) model employing data in the form of Space-weather HMI
Active Region Patches (SHARP) parameters calculated from data in proximity to
the magnetic polarity inversion line where the flares originate. We train the
the LSTM model for binary classification to provide a prediction score for the
probability of M/X class flares to occur in next hour. We then develop a
dimension-reduction technique to reduce the dimensions of SHARP parameter (LSTM
inputs) and demonstrate the different patterns of SHARP parameters
corresponding to the transition from low to high prediction score. Our work
shows that a subset of SHARP parameters contain the key signals that strong
solar flare eruptions are imminent. The dynamics of these parameters have a
highly uniform trajectory for many events whose LSTM prediction scores for M/X
class flares transition from very low to very high. The results demonstrate the
existence of a few threshold values of SHARP parameters that when surpassed
indicate a high probability of the eruption of a strong flare. Our method has
distilled the knowledge of solar flare eruption learnt by deep learning model
and provides a more interpretable approximation, which provides physical
insight to processes driving solar flares.
Related papers
- Enhancing Multivariate Time Series-based Solar Flare Prediction with Multifaceted Preprocessing and Contrastive Learning [0.9374652839580181]
Accurate solar flare prediction is crucial due to the significant risks that intense solar flares pose to astronauts, space equipment, and satellite communication systems.
Our research enhances solar flare prediction by utilizing advanced data preprocessing and classification methods.
arXiv Detail & Related papers (2024-09-21T05:00:34Z) - SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models [85.67096251281191]
We present an innovative approach to model fusion called zero-shot Sparse MIxture of Low-rank Experts (SMILE) construction.
SMILE allows for the upscaling of source models into an MoE model without extra data or further training.
We conduct extensive experiments across diverse scenarios, such as image classification and text generation tasks, using full fine-tuning and LoRA fine-tuning.
arXiv Detail & Related papers (2024-08-19T17:32:15Z) - Beam Prediction based on Large Language Models [51.45077318268427]
Millimeter-wave (mmWave) communication is promising for next-generation wireless networks but suffers from significant path loss.
Traditional deep learning models, such as long short-term memory (LSTM), enhance beam tracking accuracy however are limited by poor robustness and generalization.
In this letter, we use large language models (LLMs) to improve the robustness of beam prediction.
arXiv Detail & Related papers (2024-08-16T12:40:01Z) - Solar synthetic imaging: Introducing denoising diffusion probabilistic models on SDO/AIA data [0.0]
This study proposes using generative deep learning models, specifically a Denoising Diffusion Probabilistic Model (DDPM), to create synthetic images of solar phenomena.
By employing a dataset from the AIA instrument aboard the SDO spacecraft, we aim to address the data scarcity issue.
The DDPM's performance is evaluated using cluster metrics, Frechet Inception Distance (FID), and F1-score, showcasing promising results in generating realistic solar imagery.
arXiv Detail & Related papers (2024-04-03T08:18:45Z) - Forecasting SEP Events During Solar Cycles 23 and 24 Using Interpretable
Machine Learning [38.321248253111776]
We employ a suite of machine learning strategies to evaluate the predictive potential of a new data product for a forecast of post-solar flare SEP events.
Despite the augmented volume of data, the prediction accuracy reaches 0.7 +- 0.1, which aligns with but does not exceed these published benchmarks.
arXiv Detail & Related papers (2024-03-04T23:12:17Z) - Observation-Guided Meteorological Field Downscaling at Station Scale: A
Benchmark and a New Method [66.80344502790231]
We extend meteorological downscaling to arbitrary scattered station scales and establish a new benchmark and dataset.
Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors.
Our proposed method outperforms other specially designed baseline models on multiple surface variables.
arXiv Detail & Related papers (2024-01-22T14:02:56Z) - Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods [0.6882042556551611]
We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
arXiv Detail & Related papers (2023-07-29T03:18:56Z) - To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis [50.31589712761807]
Large language models (LLMs) are notoriously token-hungry during pre-training, and high-quality text data on the web is approaching its scaling limit for LLMs.
We investigate the consequences of repeating pre-training data, revealing that the model is susceptible to overfitting.
Second, we examine the key factors contributing to multi-epoch degradation, finding that significant factors include dataset size, model parameters, and training objectives.
arXiv Detail & Related papers (2023-05-22T17:02:15Z) - A Comparative Study on Generative Models for High Resolution Solar
Observation Imaging [59.372588316558826]
This work investigates capabilities of current state-of-the-art generative models to accurately capture the data distribution behind observed solar activity states.
Using distributed training on supercomputers, we are able to train generative models for up to 1024x1024 resolution that produce high quality samples indistinguishable to human experts.
arXiv Detail & Related papers (2023-04-14T14:40:32Z) - Solar Flare Index Prediction Using SDO/HMI Vector Magnetic Data Products
with Statistical and Machine Learning Methods [6.205102537396887]
Solar flares, especially the M- and X-class ones, are often associated with coronal mass ejections (CMEs)
Here, we introduce several statistical and Machine Learning approaches to the prediction of the AR's Flare Index (FI) that quantifies the flare productivity of an AR.
arXiv Detail & Related papers (2022-09-28T02:13:33Z) - Time-to-Green predictions for fully-actuated signal control systems with
supervised learning [56.66331540599836]
This paper proposes a time series prediction framework using aggregated traffic signal and loop detector data.
We utilize state-of-the-art machine learning models to predict future signal phases' duration.
Results based on an empirical data set from a fully-actuated signal control system in Zurich, Switzerland, show that machine learning models outperform conventional prediction methods.
arXiv Detail & Related papers (2022-08-24T07:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.