Building power consumption datasets: Survey, taxonomy and future
directions
- URL: http://arxiv.org/abs/2009.08192v1
- Date: Thu, 17 Sep 2020 10:19:21 GMT
- Title: Building power consumption datasets: Survey, taxonomy and future
directions
- Authors: Yassine Himeur and Abdullah Alsalemi and Faycal Bensaali and Abbes
Amira
- Abstract summary: This work is proposed to survey, study and visualize the numerical and methodological nature of building energy consumption datasets.
A total of thirty-one databases are examined and compared in terms of several features, such as the geographical location, period of collection, number of monitored households, sampling rate of collected data, number of sub-metered appliances, extracted features and release date.
A novel dataset has been presented, namely Qatar university dataset, which is an annotated power consumption anomaly detection dataset.
- Score: 2.389598109913753
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the last decade, extended efforts have been poured into energy efficiency.
Several energy consumption datasets were henceforth published, with each
dataset varying in properties, uses and limitations. For instance, building
energy consumption patterns are sourced from several sources, including ambient
conditions, user occupancy, weather conditions and consumer preferences. Thus,
a proper understanding of the available datasets will result in a strong basis
for improving energy efficiency. Starting from the necessity of a comprehensive
review of existing databases, this work is proposed to survey, study and
visualize the numerical and methodological nature of building energy
consumption datasets. A total of thirty-one databases are examined and compared
in terms of several features, such as the geographical location, period of
collection, number of monitored households, sampling rate of collected data,
number of sub-metered appliances, extracted features and release date.
Furthermore, data collection platforms and related modules for data
transmission, data storage and privacy concerns used in different datasets are
also analyzed and compared. Based on the analytical study, a novel dataset has
been presented, namely Qatar university dataset, which is an annotated power
consumption anomaly detection dataset. The latter will be very useful for
testing and training anomaly detection algorithms, and hence reducing wasted
energy. Moving forward, a set of recommendations is derived to improve datasets
collection, such as the adoption of multi-modal data collection, smart Internet
of things data collection, low-cost hardware platforms and privacy and security
mechanisms. In addition, future directions to improve datasets exploitation and
utilization are identified, including the use of novel machine learning
solutions, innovative visualization tools and explainable recommender systems.
Related papers
- Occupancy Detection Based on Electricity Consumption [0.0]
This article presents a new methodology for extracting intervals when a home is vacant from low-frequency electricity consumption data.
It shows encouraging results on both simulated and real consumption curves.
arXiv Detail & Related papers (2023-12-13T21:49:09Z) - Benchmarks and Custom Package for Energy Forecasting [55.460452605056894]
Energy forecasting aims to minimize the cost of subsequent tasks such as power grid dispatch.
In this paper, we collected large-scale load datasets and released a new renewable energy dataset.
We conducted extensive experiments with 21 forecasting methods in these energy datasets at different levels under 11 evaluation metrics.
arXiv Detail & Related papers (2023-07-14T06:50:02Z) - LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting [65.71129509623587]
Road traffic forecasting plays a critical role in smart city initiatives and has experienced significant advancements thanks to the power of deep learning.
However, the promising results achieved on current public datasets may not be applicable to practical scenarios.
We introduce the LargeST benchmark dataset, which includes a total of 8,600 sensors in California with a 5-year time coverage.
arXiv Detail & Related papers (2023-06-14T05:48:36Z) - infoVerse: A Universal Framework for Dataset Characterization with
Multidimensional Meta-information [68.76707843019886]
infoVerse is a universal framework for dataset characterization.
infoVerse captures multidimensional characteristics of datasets by incorporating various model-driven meta-information.
In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines.
arXiv Detail & Related papers (2023-05-30T18:12:48Z) - Self-supervised Activity Representation Learning with Incremental Data:
An Empirical Study [7.782045150068569]
This research examines the impact of using a self-supervised representation learning model for time series classification tasks.
We analyzed the effect of varying the size, distribution, and source of the unlabeled data on the final classification performance across four public datasets.
arXiv Detail & Related papers (2023-05-01T01:39:55Z) - A Comprehensive Survey of Dataset Distillation [73.15482472726555]
It has become challenging to handle the unlimited growth of data with limited computing power.
Deep learning technology has developed unprecedentedly in the last decade.
This paper provides a holistic understanding of dataset distillation from multiple aspects.
arXiv Detail & Related papers (2023-01-13T15:11:38Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - DC-BENCH: Dataset Condensation Benchmark [79.18718490863908]
This work provides the first large-scale standardized benchmark on dataset condensation.
It consists of a suite of evaluations to comprehensively reflect the generability and effectiveness of condensation methods.
The benchmark library is open-sourced to facilitate future research and application.
arXiv Detail & Related papers (2022-07-20T03:54:05Z) - LEAD1.0: A Large-scale Annotated Dataset for Energy Anomaly Detection in
Commercial Buildings [0.0]
We release a well-annotated version of a publicly available ASHRAE Great Energy Predictor III data set containing 1,413 smart electricity meter time series spanning over one year.
We benchmark the performance of eight state-of-the-art anomaly detection methods on our dataset and compare their performance.
arXiv Detail & Related papers (2022-03-30T07:30:59Z) - A Collection and Categorization of Open-Source Wind and Wind Power
Datasets [0.0]
We show that there are publicly available datasets sufficient for wind power forecasting tasks.
We also discuss the different data groups properties to enable researchers to choose appropriate open-source datasets.
arXiv Detail & Related papers (2022-02-17T08:53:09Z) - Energy Disaggregation with Semi-supervised Sparse Coding [0.0]
Energy disaggregation research aims to decompose the aggregated energy consumption data into its component appliances.
In this paper, a discriminative disaggregation model based on sparse coding has been evaluated on large-scale household power usage dataset for energy conservation.
arXiv Detail & Related papers (2020-04-20T21:05:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.