Scalable Machine Learning Analysis of Parker Solar Probe Solar Wind Data
- URL: http://arxiv.org/abs/2510.21066v1
- Date: Fri, 24 Oct 2025 00:41:39 GMT
- Title: Scalable Machine Learning Analysis of Parker Solar Probe Solar Wind Data
- Authors: Daniela Martin, Connor O'Brien, Valmir P Moraes Filho, Jinsu Hong, Jasmine R. Kobayashi, Evangelia Samara, Joseph Gallego,
- Abstract summary: We present a scalable machine learning framework for analyzing Parker Solar Probe (PSP) solar wind data.<n>The PSP dataset exceeds 150 GB, challenging conventional analysis approaches.<n>We reveal characteristic trends in the inner heliosphere, including increasing solar wind speed with distance from the Sun.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present a scalable machine learning framework for analyzing Parker Solar Probe (PSP) solar wind data using distributed processing and the quantum-inspired Kernel Density Matrices (KDM) method. The PSP dataset (2018--2024) exceeds 150 GB, challenging conventional analysis approaches. Our framework leverages Dask for large-scale statistical computations and KDM to estimate univariate and bivariate distributions of key solar wind parameters, including solar wind speed, proton density, and proton thermal speed, as well as anomaly thresholds for each parameter. We reveal characteristic trends in the inner heliosphere, including increasing solar wind speed with distance from the Sun, decreasing proton density, and the inverse relationship between speed and density. Solar wind structures play a critical role in enhancing and mediating extreme space weather phenomena and can trigger geomagnetic storms; our analyses provide quantitative insights into these processes. This approach offers a tractable, interpretable, and distributed methodology for exploring complex physical datasets and facilitates reproducible analysis of large-scale in situ measurements. Processed data products and analysis tools are made publicly available to advance future studies of solar wind dynamics and space weather forecasting. The code and configuration files used in this study are publicly available to support reproducibility.
Related papers
- Ultra-short-term solar power forecasting by deep learning and data reconstruction [60.200987006598524]
We propose a deep-learning based ultra-short-term solar power prediction with data reconstruction.<n>We employ deep-learning models to capture long- and short-term dependencies towards the target prediction period.
arXiv Detail & Related papers (2025-09-21T14:22:35Z) - SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction [2.288747975391298]
This paper introduces a high resolution, machine learning-ready heliophysics dataset derived from NASA's Solar Dynamics Observatory (SDO)<n>The dataset includes processed imagery from the Atmospheric Imaging Assembly (AIA) and Helioseismic and Magnetic Imager (HMI)<n>To ensure suitability for ML tasks, the data has been preprocessed, including correction of spacecraft roll angles, orbital adjustments, exposure normalization, and degradation compensation.
arXiv Detail & Related papers (2025-08-18T00:05:01Z) - Solar Flare Forecast: A Comparative Analysis of Machine Learning Algorithms for Solar Flare Class Prediction [0.0]
Solar flares are among the most powerful and dynamic events in the solar system, resulting from the sudden release of magnetic energy stored in the Sun's atmosphere.<n>This study evaluates the predictive performance of three machine learning algorithms for classifying solar flares into 4 categories.
arXiv Detail & Related papers (2025-05-06T10:08:41Z) - Solar synthetic imaging: Introducing denoising diffusion probabilistic models on SDO/AIA data [0.0]
This study proposes using generative deep learning models, specifically a Denoising Diffusion Probabilistic Model (DDPM), to create synthetic images of solar phenomena.
By employing a dataset from the AIA instrument aboard the SDO spacecraft, we aim to address the data scarcity issue.
The DDPM's performance is evaluated using cluster metrics, Frechet Inception Distance (FID), and F1-score, showcasing promising results in generating realistic solar imagery.
arXiv Detail & Related papers (2024-04-03T08:18:45Z) - Observation-Guided Meteorological Field Downscaling at Station Scale: A
Benchmark and a New Method [66.80344502790231]
We extend meteorological downscaling to arbitrary scattered station scales and establish a new benchmark and dataset.
Inspired by data assimilation techniques, we integrate observational data into the downscaling process, providing multi-scale observational priors.
Our proposed method outperforms other specially designed baseline models on multiple surface variables.
arXiv Detail & Related papers (2024-01-22T14:02:56Z) - Improving day-ahead Solar Irradiance Time Series Forecasting by
Leveraging Spatio-Temporal Context [46.72071291175356]
Solar power harbors immense potential in mitigating climate change by substantially reducing CO$_2$ emissions.
However, the inherent variability of solar irradiance poses a significant challenge for seamlessly integrating solar power into the electrical grid.
In this paper, we put forth a deep learning architecture designed to harnesstemporal context using satellite data.
arXiv Detail & Related papers (2023-06-01T19:54:39Z) - A Comparative Study on Generative Models for High Resolution Solar
Observation Imaging [59.372588316558826]
This work investigates capabilities of current state-of-the-art generative models to accurately capture the data distribution behind observed solar activity states.
Using distributed training on supercomputers, we are able to train generative models for up to 1024x1024 resolution that produce high quality samples indistinguishable to human experts.
arXiv Detail & Related papers (2023-04-14T14:40:32Z) - Learning-based estimation of in-situ wind speed from underwater
acoustics [58.293528982012255]
We introduce a deep learning approach for the retrieval of wind speed time series from underwater acoustics.
Our approach bridges data assimilation and learning-based frameworks to benefit both from prior physical knowledge and computational efficiency.
arXiv Detail & Related papers (2022-08-18T15:27:40Z) - Energy Aware Deep Reinforcement Learning Scheduling for Sensors
Correlated in Time and Space [62.39318039798564]
We propose a scheduling mechanism capable of taking advantage of correlated information.
The proposed mechanism is capable of determining the frequency with which sensors should transmit their updates.
We show that our solution can significantly extend the sensors' lifetime.
arXiv Detail & Related papers (2020-11-19T09:53:27Z) - Short term solar energy prediction by machine learning algorithms [0.47791962198275073]
We report daily prediction of solar energy by exploiting the strength of machine learning techniques.
Forecast models of base line regressors including linear, ridge, lasso, decision tree, random forest and artificial neural networks have been implemented.
It has been observed that improved accuracy is achieved through random forest and ridge regressor for both grid sizes.
arXiv Detail & Related papers (2020-10-25T17:56:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.