Towards Coupling Full-disk and Active Region-based Flare Prediction for
Operational Space Weather Forecasting
- URL: http://arxiv.org/abs/2209.07406v1
- Date: Thu, 11 Aug 2022 22:34:44 GMT
- Title: Towards Coupling Full-disk and Active Region-based Flare Prediction for
Operational Space Weather Forecasting
- Authors: Chetraj Pandey, Anli Ji, Rafal A. Angryk, Manolis K. Georgoulis and
Berkay Aydin
- Abstract summary: We present new approaches to train and deploy an operational solar flare prediction system for $geq$M1.0-class flares.
In full-disk mode, predictions are performed on full-disk line-of-sight magnetograms using deep learning models.
In active region-based models, predictions are issued for each active region individually.
- Score: 0.5872014229110215
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Solar flare prediction is a central problem in space weather forecasting and
has captivated the attention of a wide spectrum of researchers due to recent
advances in both remote sensing as well as machine learning and deep learning
approaches. The experimental findings based on both machine and deep learning
models reveal significant performance improvements for task specific datasets.
Along with building models, the practice of deploying such models to production
environments under operational settings is a more complex and often
time-consuming process which is often not addressed directly in research
settings. We present a set of new heuristic approaches to train and deploy an
operational solar flare prediction system for $\geq$M1.0-class flares with two
prediction modes: full-disk and active region-based. In full-disk mode,
predictions are performed on full-disk line-of-sight magnetograms using deep
learning models whereas in active region-based models, predictions are issued
for each active region individually using multivariate time series data
instances. The outputs from individual active region forecasts and full-disk
predictors are combined to a final full-disk prediction result with a
meta-model. We utilized an equal weighted average ensemble of two base
learners' flare probabilities as our baseline meta learner and improved the
capabilities of our two base learners by training a logistic regression model.
The major findings of this study are: (i) We successfully coupled two
heterogeneous flare prediction models trained with different datasets and model
architecture to predict a full-disk flare probability for next 24 hours, (ii)
Our proposed ensembling model, i.e., logistic regression, improves on the
predictive performance of two base learners and the baseline meta learner
measured in terms of two widely used metrics True Skill Statistic (TSS) and
Heidke Skill core (HSS), and (iii) Our result analysis suggests that the
logistic regression-based ensemble (Meta-FP) improves on the full-disk model
(base learner) by $\sim9\%$ in terms TSS and $\sim10\%$ in terms of HSS.
Similarly, it improves on the AR-based model (base learner) by $\sim17\%$ and
$\sim20\%$ in terms of TSS and HSS respectively. Finally, when compared to the
baseline meta model, it improves on TSS by $\sim10\%$ and HSS by $\sim15\%$.
Related papers
- On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - Uncertainty Aware Learning for Language Model Alignment [97.36361196793929]
We propose uncertainty-aware learning (UAL) to improve the model alignment of different task scenarios.
We implement UAL in a simple fashion -- adaptively setting the label smoothing value of training according to the uncertainty of individual samples.
Experiments on widely used benchmarks demonstrate that our UAL significantly and consistently outperforms standard supervised fine-tuning.
arXiv Detail & Related papers (2024-06-07T11:37:45Z) - Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling [0.0]
Time-series modeling in process industries faces the challenge of dealing with complex, multi-faceted, and evolving data characteristics.
We introduce the Recency-Weighted Temporally-Segmented (ReWTS) ensemble model, a novel chunk-based approach for multi-step forecasting.
We present a comparative analysis, utilizing two years of data from a wastewater treatment plant and a drinking water treatment plant in Norway.
arXiv Detail & Related papers (2024-03-04T16:00:35Z) - Supervised Contrastive Learning based Dual-Mixer Model for Remaining
Useful Life Prediction [3.081898819471624]
The Remaining Useful Life (RUL) prediction aims at providing an accurate estimate of the remaining time from the current predicting moment to the complete failure of the device.
To overcome the shortcomings of rigid combination for temporal and spatial features in most existing RUL prediction approaches, a spatial-temporal homogeneous feature extractor, named Dual-Mixer model, is proposed.
The effectiveness of the proposed method is validated through comparisons with other latest research works on the C-MAPSS dataset.
arXiv Detail & Related papers (2024-01-29T14:38:44Z) - The Languini Kitchen: Enabling Language Modelling Research at Different
Scales of Compute [66.84421705029624]
We introduce an experimental protocol that enables model comparisons based on equivalent compute, measured in accelerator hours.
We pre-process an existing large, diverse, and high-quality dataset of books that surpasses existing academic benchmarks in quality, diversity, and document length.
This work also provides two baseline models: a feed-forward model derived from the GPT-2 architecture and a recurrent model in the form of a novel LSTM with ten-fold throughput.
arXiv Detail & Related papers (2023-09-20T10:31:17Z) - Towards Interpretable Solar Flare Prediction with Attention-based Deep
Neural Networks [1.1624569521079424]
Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
arXiv Detail & Related papers (2023-09-08T19:21:10Z) - Explaining Full-disk Deep Learning Model for Solar Flare Prediction
using Attribution Methods [0.6882042556551611]
We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
arXiv Detail & Related papers (2023-07-29T03:18:56Z) - Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting [10.083361616081874]
This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting.
Lagged numerical ensemble forecasts and observational data, including relative humidity, pressure at sea level, and geopotential height, are incorporated into various ML methods.
For regression, quantile regression, and tercile classification tasks, we consider using linear models, random forests, convolutional neural networks, and stacked models.
arXiv Detail & Related papers (2022-11-29T01:11:04Z) - An Empirical Study on Distribution Shift Robustness From the Perspective
of Pre-Training and Data Augmentation [91.62129090006745]
This paper studies the distribution shift problem from the perspective of pre-training and data augmentation.
We provide the first comprehensive empirical study focusing on pre-training and data augmentation.
arXiv Detail & Related papers (2022-05-25T13:04:53Z) - Sparse MoEs meet Efficient Ensembles [49.313497379189315]
We study the interplay of two popular classes of such models: ensembles of neural networks and sparse mixture of experts (sparse MoEs)
We present Efficient Ensemble of Experts (E$3$), a scalable and simple ensemble of sparse MoEs that takes the best of both classes of models, while using up to 45% fewer FLOPs than a deep ensemble.
arXiv Detail & Related papers (2021-10-07T11:58:35Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.