SolARED: Solar Active Region Emergence Dataset for Machine Learning Aided Predictions
- URL: http://arxiv.org/abs/2601.13145v1
- Date: Mon, 19 Jan 2026 15:25:18 GMT
- Title: SolARED: Solar Active Region Emergence Dataset for Machine Learning Aided Predictions
- Authors: Spiridon Kasapis, Eren Dogan, Irina N. Kitiashvili, Alexander G. Kosovichev, John T. Stefan, Jake D. Butler, Jonas Tirona, Sarang Patil, Mengjia Xu,
- Abstract summary: Solar Active Region Emergence dataset (SolARED) is derived from full-disk maps of the Doppler velocity, magnetic field, and continuum intensity, obtained by the Helioseismic and Magnetic Imager (HMI) onboard the Solar Dynamics Observatory (SDO)<n>SolARED includes time series of remapped, tracked, and binned data that characterize the evolution of acoustic power of solar oscillations, unsigned magnetic flux, and intensity for 50 large ARs before, during, and after their emergence on the solar surface, as well as surrounding areas observed on the solar disc between 2010 and 2023.
- Score: 33.14810385896251
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The development of accurate forecasts of solar eruptive activity has become increasingly important for preventing potential impacts on space technologies and exploration. Therefore, it is crucial to detect Active Regions (ARs) before they start forming on the solar surface. This will enable the development of early-warning capabilities for upcoming space weather disturbances. For this reason, we prepared the Solar Active Region Emergence Dataset (SolARED). The dataset is derived from full-disk maps of the Doppler velocity, magnetic field, and continuum intensity, obtained by the Helioseismic and Magnetic Imager (HMI) onboard the Solar Dynamics Observatory (SDO). SolARED includes time series of remapped, tracked, and binned data that characterize the evolution of acoustic power of solar oscillations, unsigned magnetic flux, and continuum intensity for 50 large ARs before, during, and after their emergence on the solar surface, as well as surrounding areas observed on the solar disc between 2010 and 2023. The resulting ML-ready SolARED dataset is designed to support enhancements of predictive capabilities, enabling the development of operational forecasts for the emergence of active regions. The SolARED dataset is available at https://sun.njit.edu/sarportal/, through an interactive visualization web application.
Related papers
- Ultra-short-term solar power forecasting by deep learning and data reconstruction [60.200987006598524]
We propose a deep-learning based ultra-short-term solar power prediction with data reconstruction.<n>We employ deep-learning models to capture long- and short-term dependencies towards the target prediction period.
arXiv Detail & Related papers (2025-09-21T14:22:35Z) - SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction [2.288747975391298]
This paper introduces a high resolution, machine learning-ready heliophysics dataset derived from NASA's Solar Dynamics Observatory (SDO)<n>The dataset includes processed imagery from the Atmospheric Imaging Assembly (AIA) and Helioseismic and Magnetic Imager (HMI)<n>To ensure suitability for ML tasks, the data has been preprocessed, including correction of spacecraft roll angles, orbital adjustments, exposure normalization, and degradation compensation.
arXiv Detail & Related papers (2025-08-18T00:05:01Z) - An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions [4.318993143281039]
This paper employs the random forest (RF) model to address the binary classification task.<n>It analyzes the links between solar flares and their originating ARs with observational data gathered from 2011 to 2021 by SolarMonitor.org and the XRT flare database.<n>We find that the features of AR_Type_Today, Hale_Class_Yesterday are the most and the least prepotent features, respectively.
arXiv Detail & Related papers (2025-02-20T22:00:39Z) - Solar Active Regions Emergence Prediction Using Long Short-Term Memory
Networks [44.99833362998488]
We develop Long Short-Term Memory (LSTM) models to predict the formation of active regions (ARs) on the solar surface.
Time-series datasets of acoustic power and magnetic flux are used to train LSTM models on predicting continuum intensity, 12 hours in advance.
These novel machine learning (ML) models are able to capture variations of the acoustic power density associated with upcoming magnetic flux emergence and continuum intensity decrease.
arXiv Detail & Related papers (2024-09-25T23:09:46Z) - Extreme Solar Flare Prediction Using Residual Networks with HMI Magnetograms and Intensitygrams [0.0]
We present a novel approach for predicting extreme solar flares using HMI intensitygrams and magnetograms.
By detecting sunspots from intensitygrams and extracting magnetic field patches from magnetograms, we train a Residual Network (ResNet) to classify extreme class flares.
Our model demonstrates high accuracy, offering a robust tool for predicting extreme solar flares and improving space weather forecasting.
arXiv Detail & Related papers (2024-05-23T16:17:16Z) - Forecasting SEP Events During Solar Cycles 23 and 24 Using Interpretable
Machine Learning [38.321248253111776]
We employ a suite of machine learning strategies to evaluate the predictive potential of a new data product for a forecast of post-solar flare SEP events.
Despite the augmented volume of data, the prediction accuracy reaches 0.7 +- 0.1, which aligns with but does not exceed these published benchmarks.
arXiv Detail & Related papers (2024-03-04T23:12:17Z) - Predicting the Emergence of Solar Active Regions Using Machine Learning [41.94295877935867]
To create early warning capabilities for upcoming Space Weather disturbances, we have selected a dataset of 61 emerging active regions.
We have utilized Doppler shift and continuum intensity observations from the Helioseismic and Magnetic Imager (HMI) onboard the Solar Dynamics Observatory (SDO)
We have developed a machine learning model to capture the acoustic power flux density variations associated with upcoming magnetic flux emergence.
arXiv Detail & Related papers (2024-02-14T01:41:50Z) - Improving day-ahead Solar Irradiance Time Series Forecasting by
Leveraging Spatio-Temporal Context [46.72071291175356]
Solar power harbors immense potential in mitigating climate change by substantially reducing CO$_2$ emissions.
However, the inherent variability of solar irradiance poses a significant challenge for seamlessly integrating solar power into the electrical grid.
In this paper, we put forth a deep learning architecture designed to harnesstemporal context using satellite data.
arXiv Detail & Related papers (2023-06-01T19:54:39Z) - Solar Active Region Magnetogram Image Dataset for Studies of Space
Weather [0.0]
The dataset incorporates data from three sources and provides SDO Helioseismic and Magnetic Imager (HMI) magnetograms of solar active regions.
This dataset will be useful for image analysis or solar physics research related to magnetic structure, its evolution over time, and its relation to solar flares.
This dataset is a minimally processed, user dataset of consistently sized images of solar active regions that can serve as a benchmark dataset for solar flare prediction research.
arXiv Detail & Related papers (2023-05-16T14:44:24Z) - A Comparative Study on Generative Models for High Resolution Solar
Observation Imaging [59.372588316558826]
This work investigates capabilities of current state-of-the-art generative models to accurately capture the data distribution behind observed solar activity states.
Using distributed training on supercomputers, we are able to train generative models for up to 1024x1024 resolution that produce high quality samples indistinguishable to human experts.
arXiv Detail & Related papers (2023-04-14T14:40:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.