Feature Selection on a Flare Forecasting Testbed: A Comparative Study of
24 Methods
- URL: http://arxiv.org/abs/2109.14770v1
- Date: Thu, 30 Sep 2021 00:23:09 GMT
- Title: Feature Selection on a Flare Forecasting Testbed: A Comparative Study of
24 Methods
- Authors: Atharv Yeoleka, Sagar Patel, Shreejaa Talla, Krishna Rukmini
Puthucode, Azim Ahmadzadeh, Viacheslav M. Sadykov, and Rafal A. Angryk
- Abstract summary: SWAN-SF contains 54 unique features, with 24 quantitative features computed from the photospheric magnetic field maps of active regions.
In this study, for the first time, we systematically attacked the problem of quantifying the relevance of these features to the ambitious task of flare forecasting.
- Score: 0.7768952514701895
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The Space-Weather ANalytics for Solar Flares (SWAN-SF) is a multivariate time
series benchmark dataset recently created to serve the heliophysics community
as a testbed for solar flare forecasting models. SWAN-SF contains 54 unique
features, with 24 quantitative features computed from the photospheric magnetic
field maps of active regions, describing their precedent flare activity. In
this study, for the first time, we systematically attacked the problem of
quantifying the relevance of these features to the ambitious task of flare
forecasting. We implemented an end-to-end pipeline for preprocessing, feature
selection, and evaluation phases. We incorporated 24 Feature Subset Selection
(FSS) algorithms, including multivariate and univariate, supervised and
unsupervised, wrappers and filters. We methodologically compared the results of
different FSS algorithms, both on the multivariate time series and vectorized
formats, and tested their correlation and reliability, to the extent possible,
by using the selected features for flare forecasting on unseen data, in
univariate and multivariate fashions. We concluded our investigation with a
report of the best FSS methods in terms of their top-k features, and the
analysis of the findings. We wish the reproducibility of our study and the
availability of the data allow the future attempts be comparable with our
findings and themselves.
Related papers
- Contrastive Representation Learning for Predicting Solar Flares from Extremely Imbalanced Multivariate Time Series Data [1.024113475677323]
Major solar flares are abrupt surges in the Sun's magnetic flux, presenting significant risks to technological infrastructure.
In this paper, we introduce CONTREX, a novel contrastive representation learning approach for multivariate time series data.
Our approach shows promising solar flare prediction results on the Space Weather Analytics for Solar Flares (SWAN-SF) multivariate time series benchmark dataset.
arXiv Detail & Related papers (2024-10-01T01:20:47Z) - Enhancing Multivariate Time Series-based Solar Flare Prediction with Multifaceted Preprocessing and Contrastive Learning [0.9374652839580181]
Accurate solar flare prediction is crucial due to the significant risks that intense solar flares pose to astronauts, space equipment, and satellite communication systems.
Our research enhances solar flare prediction by utilizing advanced data preprocessing and classification methods.
arXiv Detail & Related papers (2024-09-21T05:00:34Z) - Towards Hybrid Embedded Feature Selection and Classification Approach with Slim-TSF [0.0]
This study aims to uncover hidden relationships and the evolutionary characteristics of solar flares and their source regions.
Preliminary findings indicate a notable improvement, with an average increase of 5% in both the True Skill Statistic (TSS) and Heidke Skill Score (HSS)
arXiv Detail & Related papers (2024-09-06T18:12:05Z) - Huge Ensembles Part II: Properties of a Huge Ensemble of Hindcasts Generated with Spherical Fourier Neural Operators [1.8986950987113536]
In Part I, we created an ensemble based on Spherical Neural Operators.
In Part II, we generate a huge ensemble (HENS) with 7,424 members each day of summer 2023.
HENS precisely samples the tails of the forecast distribution and presents a detailed sampling of internal variability.
arXiv Detail & Related papers (2024-08-02T21:31:34Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Observation-Guided Meteorological Field Downscaling at Station Scale: A
Benchmark and a New Method [66.80344502790231]
We extend meteorological downscaling to arbitrary scattered station scales and establish a new benchmark and dataset.
Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors.
Our proposed method outperforms other specially designed baseline models on multiple surface variables.
arXiv Detail & Related papers (2024-01-22T14:02:56Z) - Embedded feature selection in LSTM networks with multi-objective
evolutionary ensemble learning for time series forecasting [49.1574468325115]
We present a novel feature selection method embedded in Long Short-Term Memory networks.
Our approach optimize the weights and biases of the LSTM in a partitioned manner.
Experimental evaluations on air quality time series data from Italy and southeast Spain demonstrate that our method substantially improves the ability generalization of conventional LSTMs.
arXiv Detail & Related papers (2023-12-29T08:42:10Z) - TACTiS: Transformer-Attentional Copulas for Time Series [76.71406465526454]
estimation of time-varying quantities is a fundamental component of decision making in fields such as healthcare and finance.
We propose a versatile method that estimates joint distributions using an attention-based decoder.
We show that our model produces state-of-the-art predictions on several real-world datasets.
arXiv Detail & Related papers (2022-02-07T21:37:29Z) - How to Train Your Flare Prediction Model: Revisiting Robust Sampling of
Rare Events [0.9851812512860351]
We present a case study of solar flare forecasting by means of metadata feature time series, by treating it as a prominent class-imbalance and temporally coherent problem.
We showcase the general concept of temporal coherence triggered by the demand of continuity in time series forecasting and show that lack of proper understanding of this effect may spuriously enhance models' performance.
We revisit the main remedies for these challenges and present several experiments to illustrate the exact impact that each of these remedies may have on performance.
arXiv Detail & Related papers (2021-03-12T21:37:08Z) - Learning summary features of time series for likelihood free inference [93.08098361687722]
We present a data-driven strategy for automatically learning summary features from time series data.
Our results indicate that learning summary features from data can compete and even outperform LFI methods based on hand-crafted values.
arXiv Detail & Related papers (2020-12-04T19:21:37Z) - Supervised Feature Subset Selection and Feature Ranking for Multivariate
Time Series without Feature Extraction [78.84356269545157]
We introduce supervised feature ranking and feature subset selection algorithms for MTS classification.
Unlike most existing supervised/unsupervised feature selection algorithms for MTS our techniques do not require a feature extraction step to generate a one-dimensional feature vector from the time series.
arXiv Detail & Related papers (2020-05-01T07:46:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.