A Hybrid Autoencoder-Transformer Model for Robust Day-Ahead Electricity Price Forecasting under Extreme Conditions
- URL: http://arxiv.org/abs/2511.06898v1
- Date: Mon, 10 Nov 2025 09:47:24 GMT
- Title: A Hybrid Autoencoder-Transformer Model for Robust Day-Ahead Electricity Price Forecasting under Extreme Conditions
- Authors: Boyan Tang, Xuanhao Ren, Peng Xiao, Shunbo Lei, Xiaorong Sun, Jianghua Wu,
- Abstract summary: This paper proposes a novel hybrid deep learning framework that integrates a Distilled Attention Transformer model and an Autoencoder Self-regression Model.<n>Experiments on datasets sampled from California and Shandong Province demonstrate that our framework significantly outperforms state-of-the-art methods in prediction accuracy, robustness, and computational efficiency.
- Score: 2.2360725546624267
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate day-ahead electricity price forecasting (DAEPF) is critical for the efficient operation of power systems, but extreme condition and market anomalies pose significant challenges to existing forecasting methods. To overcome these challenges, this paper proposes a novel hybrid deep learning framework that integrates a Distilled Attention Transformer (DAT) model and an Autoencoder Self-regression Model (ASM). The DAT leverages a self-attention mechanism to dynamically assign higher weights to critical segments of historical data, effectively capturing both long-term trends and short-term fluctuations. Concurrently, the ASM employs unsupervised learning to detect and isolate anomalous patterns induced by extreme conditions, such as heavy rain, heat waves, or human festivals. Experiments on datasets sampled from California and Shandong Province demonstrate that our framework significantly outperforms state-of-the-art methods in prediction accuracy, robustness, and computational efficiency. Our framework thus holds promise for enhancing grid resilience and optimizing market operations in future power systems.
Related papers
- COTN: A Chaotic Oscillatory Transformer Network for Complex Volatile Systems under Extreme Conditions [4.606846373731374]
Accurate prediction of financial and electricity markets, especially under extreme conditions, remains a significant challenge.<n>We propose the Chaoticy Transformer Network (COTN), which combines a Transformer architecture with a novel Lee activation function.<n>COTN incorporates an Autoencoder Self-Regressive (ASR) module to detect and isolate abnormal market patterns.<n>Our approach outperforms state-of-the-art deep learning models like Informer by up to 17% and traditional statistical methods like GARCH by as much as 17%.
arXiv Detail & Related papers (2025-11-09T08:17:19Z) - Adaptive Online Learning with LSTM Networks for Energy Price Prediction [0.5514902789425196]
This study focuses on developing a predictive model to forecast day-ahead electricity prices in the California energy market.<n>The model incorporates a variety of features, including historical price data, weather conditions, and the energy generation mix.<n>The results demonstrate that the custom loss function can improve the model's performance, aligning predicted prices more closely with actual values.
arXiv Detail & Related papers (2025-10-19T15:48:38Z) - Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting [52.6508222408558]
We introduce Elucidated Rolling Diffusion Models (ERDM)<n>ERDM is the first framework to unify a rolling forecast structure with the principled, performant design of Elucidated Diffusion Models (EDM)<n>On 2D Navier-Stokes simulations and ERA5 global weather forecasting at 1.5circ resolution, ERDM consistently outperforms key diffusion-based baselines.
arXiv Detail & Related papers (2025-06-24T21:44:31Z) - Deep learning for predicting hauling fleet production capacity under uncertainties in open pit mines using real and simulated data [0.0]
We propose a deep-learning framework that blends real-world operational records with synthetically generated mechanical-breakdown scenarios.<n>We evaluate two architectures: an XGBoost regressor achieving a median absolute error (MedAE) of 14.3 per cent and a Long Short-Term Memory network with a MedAE of 15.1 per cent.
arXiv Detail & Related papers (2025-06-04T12:12:56Z) - Open-set Anomaly Segmentation in Complex Scenarios [88.11076112792992]
This paper introduces ComsAmy, a benchmark for open-set anomaly segmentation in complex scenarios.<n>ComsAmy encompasses a wide spectrum of adverse weather conditions, dynamic driving environments, and diverse anomaly types.<n>We propose a novel energy-entropy learning (EEL) strategy that integrates the complementary information from energy and entropy.
arXiv Detail & Related papers (2025-04-28T12:00:10Z) - Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions [16.705621552594643]
Existing models often struggle with capturing the complex relationships between target variables and covariates.<n>We propose a novel model architecture that leverages the iTransformer for feature extraction from target variables.<n>A cross-attention mechanism is integrated to fuse the outputs of both models, followed by a Kolmogorov-Arnold network mapping.<n>Results demonstrate that the proposed model effectively capture seasonal variations in PV power generation and improve forecasting accuracy.
arXiv Detail & Related papers (2024-12-03T09:16:13Z) - Anomaly Detection in California Electricity Price Forecasting: Enhancing Accuracy and Reliability Using Principal Component Analysis [0.0]
This study focuses on enhancing electricity price forecasting in California's grid.<n>We analyze CAISO's hourly electricity prices and demand from 2016-2021 to improve day-ahead forecasting accuracy.
arXiv Detail & Related papers (2024-11-25T20:55:25Z) - On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - Your Autoregressive Generative Model Can be Better If You Treat It as an
Energy-Based One [83.5162421521224]
We propose a unique method termed E-ARM for training autoregressive generative models.
E-ARM takes advantage of a well-designed energy-based learning objective.
We show that E-ARM can be trained efficiently and is capable of alleviating the exposure bias problem.
arXiv Detail & Related papers (2022-06-26T10:58:41Z) - Stochastically forced ensemble dynamic mode decomposition for
forecasting and analysis of near-periodic systems [65.44033635330604]
We introduce a novel load forecasting method in which observed dynamics are modeled as a forced linear system.
We show that its use of intrinsic linear dynamics offers a number of desirable properties in terms of interpretability and parsimony.
Results are presented for a test case using load data from an electrical grid.
arXiv Detail & Related papers (2020-10-08T20:25:52Z) - No MCMC for me: Amortized sampling for fast and stable training of
energy-based models [62.1234885852552]
Energy-Based Models (EBMs) present a flexible and appealing way to represent uncertainty.
We present a simple method for training EBMs at scale using an entropy-regularized generator to amortize the MCMC sampling.
Next, we apply our estimator to the recently proposed Joint Energy Model (JEM), where we match the original performance with faster and stable training.
arXiv Detail & Related papers (2020-10-08T19:17:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.