Towards Coupling Full-disk and Active Region-based Flare Prediction for
  Operational Space Weather Forecasting
        - URL: http://arxiv.org/abs/2209.07406v1
- Date: Thu, 11 Aug 2022 22:34:44 GMT
- Title: Towards Coupling Full-disk and Active Region-based Flare Prediction for
  Operational Space Weather Forecasting
- Authors: Chetraj Pandey, Anli Ji, Rafal A. Angryk, Manolis K. Georgoulis and
  Berkay Aydin
- Abstract summary: We present new approaches to train and deploy an operational solar flare prediction system for $geq$M1.0-class flares.
In full-disk mode, predictions are performed on full-disk line-of-sight magnetograms using deep learning models.
In active region-based models, predictions are issued for each active region individually.
- Score: 0.5872014229110215
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Solar flare prediction is a central problem in space weather forecasting and
has captivated the attention of a wide spectrum of researchers due to recent
advances in both remote sensing as well as machine learning and deep learning
approaches. The experimental findings based on both machine and deep learning
models reveal significant performance improvements for task specific datasets.
Along with building models, the practice of deploying such models to production
environments under operational settings is a more complex and often
time-consuming process which is often not addressed directly in research
settings. We present a set of new heuristic approaches to train and deploy an
operational solar flare prediction system for $\geq$M1.0-class flares with two
prediction modes: full-disk and active region-based. In full-disk mode,
predictions are performed on full-disk line-of-sight magnetograms using deep
learning models whereas in active region-based models, predictions are issued
for each active region individually using multivariate time series data
instances. The outputs from individual active region forecasts and full-disk
predictors are combined to a final full-disk prediction result with a
meta-model. We utilized an equal weighted average ensemble of two base
learners' flare probabilities as our baseline meta learner and improved the
capabilities of our two base learners by training a logistic regression model.
The major findings of this study are: (i) We successfully coupled two
heterogeneous flare prediction models trained with different datasets and model
architecture to predict a full-disk flare probability for next 24 hours, (ii)
Our proposed ensembling model, i.e., logistic regression, improves on the
predictive performance of two base learners and the baseline meta learner
measured in terms of two widely used metrics True Skill Statistic (TSS) and
Heidke Skill core (HSS), and (iii) Our result analysis suggests that the
logistic regression-based ensemble (Meta-FP) improves on the full-disk model
(base learner) by $\sim9\%$ in terms TSS and $\sim10\%$ in terms of HSS.
Similarly, it improves on the AR-based model (base learner) by $\sim17\%$ and
$\sim20\%$ in terms of TSS and HSS respectively. Finally, when compared to the
baseline meta model, it improves on TSS by $\sim10\%$ and HSS by $\sim15\%$.
 
      
        Related papers
        - Intention-Conditioned Flow Occupancy Models [69.79049994662591]
 Large-scale pre-training has fundamentally changed how machine learning research is done today.<n>Applying this same framework to reinforcement learning is appealing because it offers compelling avenues for addressing core challenges in RL.<n>Recent advances in generative AI have provided new tools for modeling highly complex distributions.
 arXiv  Detail & Related papers  (2025-06-10T15:27:46Z)
- Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge   Retention in Language Model Pre-Training [51.41246396610475]
 This paper aims to predict performance in closed-book question answering (QA) without the help of external tools.<n>We conduct large-scale retrieval and semantic analysis across the pre-training corpora of 21 publicly available and 3 custom-trained large language models.<n>Building on these foundations, we propose Size-dependent Mutual Information (SMI), an information-theoretic metric that linearly correlates pre-training data characteristics.
 arXiv  Detail & Related papers  (2025-02-06T13:23:53Z)
- SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape   Estimation [81.36747103102459]
 Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications.
Current state-of-the-art methods focus on training innovative architectural designs on confined datasets.
We investigate the impact of scaling up EHPS towards a family of generalist foundation models.
 arXiv  Detail & Related papers  (2025-01-16T18:59:46Z)
- What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? [83.83230167222852]
 We find that a model's generalization behavior can be effectively characterized by a training metric we call pre-memorization train accuracy.
By connecting a model's learning behavior to its generalization, pre-memorization train accuracy can guide targeted improvements to training strategies.
 arXiv  Detail & Related papers  (2024-11-12T09:52:40Z)
- On conditional diffusion models for PDE simulations [53.01911265639582]
 We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
 arXiv  Detail & Related papers  (2024-10-21T18:31:04Z)
- Recency-Weighted Temporally-Segmented Ensemble for Time-Series Modeling [0.0]
 Time-series modeling in process industries faces the challenge of dealing with complex, multi-faceted, and evolving data characteristics.
We introduce the Recency-Weighted Temporally-Segmented (ReWTS) ensemble model, a novel chunk-based approach for multi-step forecasting.
We present a comparative analysis, utilizing two years of data from a wastewater treatment plant and a drinking water treatment plant in Norway.
 arXiv  Detail & Related papers  (2024-03-04T16:00:35Z)
- Supervised Contrastive Learning based Dual-Mixer Model for Remaining
  Useful Life Prediction [3.081898819471624]
 The Remaining Useful Life (RUL) prediction aims at providing an accurate estimate of the remaining time from the current predicting moment to the complete failure of the device.
To overcome the shortcomings of rigid combination for temporal and spatial features in most existing RUL prediction approaches, a spatial-temporal homogeneous feature extractor, named Dual-Mixer model, is proposed.
The effectiveness of the proposed method is validated through comparisons with other latest research works on the C-MAPSS dataset.
 arXiv  Detail & Related papers  (2024-01-29T14:38:44Z)
- The Languini Kitchen: Enabling Language Modelling Research at Different
  Scales of Compute [66.84421705029624]
 We introduce an experimental protocol that enables model comparisons based on equivalent compute, measured in accelerator hours.
We pre-process an existing large, diverse, and high-quality dataset of books that surpasses existing academic benchmarks in quality, diversity, and document length.
This work also provides two baseline models: a feed-forward model derived from the GPT-2 architecture and a recurrent model in the form of a novel LSTM with ten-fold throughput.
 arXiv  Detail & Related papers  (2023-09-20T10:31:17Z)
- Towards Interpretable Solar Flare Prediction with Attention-based Deep
  Neural Networks [1.1624569521079424]
 Solar flare prediction is a central problem in space weather forecasting.
We developed an attention-based deep learning model to perform full-disk binary flare predictions.
Our model can learn conspicuous features corresponding to active regions from full-disk magnetogram images.
 arXiv  Detail & Related papers  (2023-09-08T19:21:10Z)
- Explaining Full-disk Deep Learning Model for Solar Flare Prediction
  using Attribution Methods [0.6882042556551611]
 We present a solar flare prediction model, which is trained using hourly full-disk line-of-sight magnetogram images.
We evaluate the overall performance of our model using the true skill statistic (TSS) and Heidke skill score (HSS)
Our analysis revealed that full-disk prediction of solar flares aligns with characteristics related to active regions (ARs)
 arXiv  Detail & Related papers  (2023-07-29T03:18:56Z)
- Beyond Ensemble Averages: Leveraging Climate Model Ensembles for   Subseasonal Forecasting [10.083361616081874]
 This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting.
Lagged numerical ensemble forecasts and observational data, including relative humidity, pressure at sea level, and geopotential height, are incorporated into various ML methods.
For regression, quantile regression, and tercile classification tasks, we consider using linear models, random forests, convolutional neural networks, and stacked models.
 arXiv  Detail & Related papers  (2022-11-29T01:11:04Z)
- An Empirical Study on Distribution Shift Robustness From the Perspective
  of Pre-Training and Data Augmentation [91.62129090006745]
 This paper studies the distribution shift problem from the perspective of pre-training and data augmentation.
We provide the first comprehensive empirical study focusing on pre-training and data augmentation.
 arXiv  Detail & Related papers  (2022-05-25T13:04:53Z)
- Sparse MoEs meet Efficient Ensembles [49.313497379189315]
 We study the interplay of two popular classes of such models: ensembles of neural networks and sparse mixture of experts (sparse MoEs)
We present Efficient Ensemble of Experts (E$3$), a scalable and simple ensemble of sparse MoEs that takes the best of both classes of models, while using up to 45% fewer FLOPs than a deep ensemble.
 arXiv  Detail & Related papers  (2021-10-07T11:58:35Z)
- Back2Future: Leveraging Backfill Dynamics for Improving Real-time
  Predictions in Future [73.03458424369657]
 In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
 arXiv  Detail & Related papers  (2021-06-08T14:48:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.