EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models
- URL: http://arxiv.org/abs/2407.13538v1
- Date: Thu, 18 Jul 2024 14:10:50 GMT
- Title: EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models
- Authors: Nan Lin, Peter Palensky, Pedro P. Vergara,
- Abstract summary: High-resolution time series data are crucial for operation and planning in energy systems.
Due to data collection costs and privacy concerns, such data is often unavailable or insufficient for downstream tasks.
We propose EnergyDiff, a universal data generation framework for energy time series data.
- Score: 2.677325229270716
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: High-resolution time series data are crucial for operation and planning in energy systems such as electrical power systems and heating systems. However, due to data collection costs and privacy concerns, such data is often unavailable or insufficient for downstream tasks. Data synthesis is a potential solution for this data scarcity. With the recent development of generative AI, we propose EnergyDiff, a universal data generation framework for energy time series data. EnergyDiff builds on state-of-the-art denoising diffusion probabilistic models, utilizing a proposed denoising network dedicated to high-resolution time series data and introducing a novel Marginal Calibration technique. Our extensive experimental results demonstrate that EnergyDiff achieves significant improvement in capturing temporal dependencies and marginal distributions compared to baselines, particularly at the 1-minute resolution. Additionally, EnergyDiff consistently generates high-quality time series data across diverse energy domains, time resolutions, and at both customer and transformer levels with reduced computational need.
Related papers
- Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models [69.06149482021071]
We propose a novel EHR data generation model called EHRPD.
It is a diffusion-based model designed to predict the next visit based on the current one while also incorporating time interval estimation.
We conduct experiments on two public datasets and evaluate EHRPD from fidelity, privacy, and utility perspectives.
arXiv Detail & Related papers (2024-06-20T02:20:23Z) - A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings [15.525789412274587]
This study investigates the diverse dimensions of IoT data heterogeneity in both intra-building and inter-building contexts.
The results emphasize the critical need for multi-modal data integration, domain-informed modeling, and automated data engineering pipelines.
arXiv Detail & Related papers (2024-05-23T07:45:48Z) - Creating synthetic energy meter data using conditional diffusion and building metadata [0.0]
The study proposes a conditional diffusion model for generating high-quality synthetic energy data using relevant metadata.
Using a dataset comprising 1,828 power meters from various buildings and countries, this model is compared with traditional methods.
Results demonstrate the proposed diffusion model's superior performance, with a 36% reduction in Frechet Inception Distance (FID) score and a 13% decrease in Kullback-Leibler divergence (KL divergence)
arXiv Detail & Related papers (2024-03-31T01:58:38Z) - Analysis and Optimization of Wireless Federated Learning with Data
Heterogeneity [72.85248553787538]
This paper focuses on performance analysis and optimization for wireless FL, considering data heterogeneity, combined with wireless resource allocation.
We formulate the loss function minimization problem, under constraints on long-term energy consumption and latency, and jointly optimize client scheduling, resource allocation, and the number of local training epochs (CRE)
Experiments on real-world datasets demonstrate that the proposed algorithm outperforms other benchmarks in terms of the learning accuracy and energy consumption.
arXiv Detail & Related papers (2023-08-04T04:18:01Z) - Towards Long-Term Time-Series Forecasting: Feature, Pattern, and
Distribution [57.71199089609161]
Long-term time-series forecasting (LTTF) has become a pressing demand in many applications, such as wind power supply planning.
Transformer models have been adopted to deliver high prediction capacity because of the high computational self-attention mechanism.
We propose an efficient Transformerbased model, named Conformer, which differentiates itself from existing methods for LTTF in three aspects.
arXiv Detail & Related papers (2023-01-05T13:59:29Z) - Combining Embeddings and Fuzzy Time Series for High-Dimensional Time
Series Forecasting in Internet of Energy Applications [0.0]
Fuzzy Time Series (FTS) models stand out as data-driven non-parametric models of easy implementation and high accuracy.
We present a new methodology for handling high-dimensional time series, by projecting the original high-dimensional data into a low dimensional embedding space.
arXiv Detail & Related papers (2021-12-03T19:50:09Z) - Data-Driven Time Series Reconstruction for Modern Power Systems Research [11.447394702830408]
This paper proposes a data-driven framework for reconstructing high-fidelity time series using publicly-available grid snapshots and historical data published by transmission system operators.
Thereby, synthetic but highly realistic time series data, spanning multiple years with a 5-minute granularity, is generated at the individual component level.
arXiv Detail & Related papers (2021-10-26T15:26:38Z) - PIETS: Parallelised Irregularity Encoders for Forecasting with
Heterogeneous Time-Series [5.911865723926626]
Heterogeneity and irregularity of multi-source data sets present a significant challenge to time-series analysis.
In this work, we design a novel architecture, PIETS, to model heterogeneous time-series.
We show that PIETS is able to effectively model heterogeneous temporal data and outperforms other state-of-the-art approaches in the prediction task.
arXiv Detail & Related papers (2021-09-30T20:01:19Z) - Energy-Efficient Model Compression and Splitting for Collaborative
Inference Over Time-Varying Channels [52.60092598312894]
We propose a technique to reduce the total energy bill at the edge device by utilizing model compression and time-varying model split between the edge and remote nodes.
Our proposed solution results in minimal energy consumption and $CO$ emission compared to the considered baselines.
arXiv Detail & Related papers (2021-06-02T07:36:27Z) - Deep Cellular Recurrent Network for Efficient Analysis of Time-Series
Data with Spatial Information [52.635997570873194]
This work proposes a novel deep cellular recurrent neural network (DCRNN) architecture to process complex multi-dimensional time series data with spatial information.
The proposed architecture achieves state-of-the-art performance while utilizing substantially less trainable parameters when compared to comparable methods in the literature.
arXiv Detail & Related papers (2021-01-12T20:08:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.