SolarGPT-QA: A Domain-Adaptive Large Language Model for Educational Question Answering in Space Weather and Heliophysics
- URL: http://arxiv.org/abs/2601.12131v1
- Date: Sat, 17 Jan 2026 18:18:11 GMT
- Title: SolarGPT-QA: A Domain-Adaptive Large Language Model for Educational Question Answering in Space Weather and Heliophysics
- Authors: Santosh Chapagain, MohammadReza EskandariNasab, Onur Vural, Shah Muhammad Hamdi, Soukaina Filali Boubrahimi,
- Abstract summary: Solar activity, including solar flares, coronal mass ejections (CMEs), and geomagnetic storms, can significantly impact satellites, aviation, power grids, data centers, and space missions.<n>We introduce SolarGPT-QA, a question answering system based on a domain-adapted large language model built on the LLaMA-3 base model.<n>Human pairwise evaluations show that SolarGPT-QA outperforms general-purpose models in zero-shot settings.
- Score: 0.2770822269241973
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Solar activity, including solar flares, coronal mass ejections (CMEs), and geomagnetic storms, can significantly impact satellites, aviation, power grids, data centers, and space missions. Extreme solar events can cause substantial economic damage if not predicted in advance, highlighting the importance of accurate forecasting and effective education in space science. Although large language models (LLMs) perform well on general tasks, they often lack domain-specific knowledge and pedagogical capability to clearly explain complex space science concepts. We introduce SolarGPT-QA, a question answering system based on a domain-adapted large language model built on the LLaMA-3 base model. The model is trained using scientific literature and large-scale question-answer data generated with GPT-4 and refined using Grok-3 in a student-friendly storytelling style. Human pairwise evaluations show that SolarGPT-QA outperforms general-purpose models in zero-shot settings and achieves competitive performance compared to instruction-tuned models for educational explanations in space weather and heliophysics. A small pilot student comprehension study further suggests improved clarity and accessibility of the generated explanations. Ablation experiments indicate that combining domain-adaptive pretraining with pedagogical fine-tuning is important for balancing scientific accuracy and educational effectiveness. This work represents an initial step toward a broader SolarGPT framework for space science education and forecasting.
Related papers
- ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning [118.46980291324148]
ATLAS is a large-scale, high-difficulty, and cross-disciplinary evaluation suite composed of approximately 800 original problems.<n>Its key features include: High Originality and Contamination Resistance, with all questions newly created or substantially adapted to prevent test data leakage.<n>Preliminary results on leading models demonstrate ATLAS's effectiveness in differentiating their advanced scientific reasoning capabilities.
arXiv Detail & Related papers (2025-11-18T11:13:06Z) - CORONA-Fields: Leveraging Foundation Models for Classification of Solar Wind Phenomena [0.0]
Major space weather contributors are the solar wind and coronal mass ejections.<n>We adapt a foundation model for solar physics to create embeddings suitable for solar wind structure analysis.<n>As a first proof-of-concept, it lays the groundwork for future improvements toward more reliable space weather predictions.
arXiv Detail & Related papers (2025-11-13T00:56:00Z) - Ultra-short-term solar power forecasting by deep learning and data reconstruction [60.200987006598524]
We propose a deep-learning based ultra-short-term solar power prediction with data reconstruction.<n>We employ deep-learning models to capture long- and short-term dependencies towards the target prediction period.
arXiv Detail & Related papers (2025-09-21T14:22:35Z) - Surya: Foundation Model for Heliophysics [3.573991705302358]
We introduce Surya, a 366M parameter foundation model for heliophysics designed to learn general-purpose solar representations.<n>We show its ability to forecast solar dynamics and flare events, while downstream fine-tuning with parameter-efficient Low-temporal AdaptationRank (LoRA) shows strong performance.<n>Its novel architecture and performance suggest that the model is able to learn the underlying physics behind solar evolution.
arXiv Detail & Related papers (2025-08-18T05:44:25Z) - Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution [0.0]
We introduce a novel method to predict the evolution of the solar line-of-sight (LoS) magnetogram using image-to-image translation with Denoising Diffusion Probabilistic Models (DDPMs)
Our approach combines "computer science metrics" for image quality and "physics metrics" for physical accuracy to evaluate model performance.
The results indicate that DDPMs are effective in maintaining the structural integrity, the dynamic range of solar magnetic fields, the magnetic flux and other physical features such as the size of the active regions, surpassing traditional persistence models, also in flaring situation.
arXiv Detail & Related papers (2024-07-16T12:28:10Z) - SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models [70.01883340129204]
spatial reasoning is a crucial component of both biological and artificial intelligence.
We present a comprehensive study of the capability of current state-of-the-art large language models (LLMs) on spatial reasoning.
arXiv Detail & Related papers (2024-06-07T01:06:34Z) - A Foundation Model for the Earth System [82.73624748093333]
We introduce Aurora, a large-scale foundation model for the Earth system trained on over a million hours of diverse data.
Aurora outperforms operational forecasts for air quality, ocean waves, tropical cyclone tracks, and high-resolution weather forecasting at orders of magnitude smaller computational expense than dedicated existing systems.
arXiv Detail & Related papers (2024-05-20T14:45:18Z) - FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems [56.0640340392818]
We introduce a framework, FREE, that enables the use of varying features and available information to train a universal model.<n>The core idea is to map available environmental data into a text space and then convert the traditional predictive modeling task in environmental science to a semantic recognition problem.<n>Our evaluation on two societally important real-world applications, stream water temperature prediction and crop yield prediction, demonstrates the superiority of FREE over multiple baselines.
arXiv Detail & Related papers (2023-11-17T00:53:09Z) - Harnessing the Power of Adversarial Prompting and Large Language Models
for Robust Hypothesis Generation in Astronomy [0.0]
We employ in-context prompting, supplying the model with up to 1000 papers from the NASA Astrophysics Data System.
Our findings point towards a substantial boost in hypothesis generation when using in-context prompting.
We illustrate how adversarial prompting empowers GPT-4 to extract essential details from a vast knowledge base to produce meaningful hypotheses.
arXiv Detail & Related papers (2023-06-20T16:16:56Z) - Improving day-ahead Solar Irradiance Time Series Forecasting by
Leveraging Spatio-Temporal Context [46.72071291175356]
Solar power harbors immense potential in mitigating climate change by substantially reducing CO$_2$ emissions.
However, the inherent variability of solar irradiance poses a significant challenge for seamlessly integrating solar power into the electrical grid.
In this paper, we put forth a deep learning architecture designed to harnesstemporal context using satellite data.
arXiv Detail & Related papers (2023-06-01T19:54:39Z) - A Comparative Study on Generative Models for High Resolution Solar
Observation Imaging [59.372588316558826]
This work investigates capabilities of current state-of-the-art generative models to accurately capture the data distribution behind observed solar activity states.
Using distributed training on supercomputers, we are able to train generative models for up to 1024x1024 resolution that produce high quality samples indistinguishable to human experts.
arXiv Detail & Related papers (2023-04-14T14:40:32Z) - Computational Solar Energy -- Ensemble Learning Methods for Prediction
of Solar Power Generation based on Meteorological Parameters in Eastern India [0.0]
It is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location.
In this paper, the impact of weather parameters on solar PV power generation is estimated by several Ensemble ML (EML) models like Bagging, Boosting, Stacking, and Voting.
The results demonstrate greater prediction accuracy of around 96% for Stacking and Voting models.
arXiv Detail & Related papers (2023-01-21T19:16:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.