BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark
for Short-Term Load Forecasting
- URL: http://arxiv.org/abs/2307.00142v3
- Date: Wed, 10 Jan 2024 15:07:03 GMT
- Title: BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark
for Short-Term Load Forecasting
- Authors: Patrick Emami, Abhijeet Sahu, Peter Graf
- Abstract summary: Data-driven short-term load forecasting (STLF) has suffered from a lack of open, large-scale datasets with high building diversity.
We present BuildingsBench, which consists of: 1) Buildings-900K, a large-scale dataset of 900K simulated buildings representing the U.S. building stock; and 2) an evaluation platform with over 1,900 real residential and commercial buildings from 7 open datasets.
- Score: 3.8489470606363514
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Short-term forecasting of residential and commercial building energy
consumption is widely used in power systems and continues to grow in
importance. Data-driven short-term load forecasting (STLF), although promising,
has suffered from a lack of open, large-scale datasets with high building
diversity. This has hindered exploring the pretrain-then-fine-tune paradigm for
STLF. To help address this, we present BuildingsBench, which consists of: 1)
Buildings-900K, a large-scale dataset of 900K simulated buildings representing
the U.S. building stock; and 2) an evaluation platform with over 1,900 real
residential and commercial buildings from 7 open datasets. BuildingsBench
benchmarks two under-explored tasks: zero-shot STLF, where a pretrained model
is evaluated on unseen buildings without fine-tuning, and transfer learning,
where a pretrained model is fine-tuned on a target building. The main finding
of our benchmark analysis is that synthetically pretrained models generalize
surprisingly well to real commercial buildings. An exploration of the effect of
increasing dataset size and diversity on zero-shot commercial building
performance reveals a power-law with diminishing returns. We also show that
fine-tuning pretrained models on real commercial and residential buildings
improves performance for a majority of target buildings. We hope that
BuildingsBench encourages and facilitates future research on generalizable
STLF. All datasets and code can be accessed from
https://github.com/NREL/BuildingsBench.
Related papers
- GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation [90.53485251837235]
GIFT-Eval is a pioneering benchmark aimed at promoting evaluation across diverse datasets.
GIFT-Eval encompasses 28 datasets over 144,000 time series and 177 million data points.
We also provide a non-leaking pretraining dataset containing approximately 230 billion data points.
arXiv Detail & Related papers (2024-10-14T11:29:38Z) - Predicting building types and functions at transnational scale [0.0]
We train a graph neural network (GNN) classifier on a large-scale graph dataset consisting of OpenStreetMap (OSM) buildings across the EU, Norway, Switzerland, and the UK.
A graph transformer model achieves a high Cohen's kappa coefficient of 0.754 when classifying buildings into 9 classes, and a very high Cohen's kappa coefficient of 0.844 when classifying buildings into the residential and non-residential classes.
arXiv Detail & Related papers (2024-09-15T11:02:45Z) - Pushing the Limits of Pre-training for Time Series Forecasting in the
CloudOps Domain [54.67888148566323]
We introduce three large-scale time series forecasting datasets from the cloud operations domain.
We show it is a strong zero-shot baseline and benefits from further scaling, both in model and dataset size.
Accompanying these datasets and results is a suite of comprehensive benchmark results comparing classical and deep learning baselines to our pre-trained method.
arXiv Detail & Related papers (2023-10-08T08:09:51Z) - Semi-supervised Learning from Street-View Images and OpenStreetMap for
Automatic Building Height Estimation [59.6553058160943]
We propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OpenStreetMap data.
The proposed method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters.
The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data.
arXiv Detail & Related papers (2023-07-05T18:16:30Z) - Points for Energy Renovation (PointER): A LiDAR-Derived Point Cloud
Dataset of One Million English Buildings Linked to Energy Characteristics [0.705947228027401]
This paper presents a building point cloud dataset that promotes a data-driven, large-scale understanding of the 3D representation of buildings and their energy characteristics.
We generate building point clouds by intersecting building footprints with geo-referenced LiDAR data and link them with attributes from UK's energy performance database.
To achieve a representative sample, we select one million buildings from a range of rural and urban regions across England, of which half a million are linked to energy characteristics.
arXiv Detail & Related papers (2023-06-28T08:48:22Z) - Retrieval-Enhanced Contrastive Vision-Text Models [61.783728119255365]
We propose to equip vision-text models with the ability to refine their embedding with cross-modal retrieved information from a memory at inference time.
Remarkably, we show that this can be done with a light-weight, single-layer, fusion transformer on top of a frozen CLIP.
Our experiments validate that our retrieval-enhanced contrastive (RECO) training improves CLIP performance substantially on several challenging fine-grained tasks.
arXiv Detail & Related papers (2023-06-12T15:52:02Z) - Building Coverage Estimation with Low-resolution Remote Sensing Imagery [65.95520230761544]
We propose a method for estimating building coverage using only publicly available low-resolution satellite imagery.
Our model achieves a coefficient of determination as high as 0.968 on predicting building coverage in regions of different levels of development around the world.
arXiv Detail & Related papers (2023-01-04T05:19:33Z) - Estimating Building Energy Efficiency From Street View Imagery, Aerial
Imagery, and Land Surface Temperature Data [0.0]
This work proposes a new method which can estimate a building's energy efficiency using purely remotely sensed data.
We find that in the binary setting of distinguishing efficient from inefficient buildings, our end-to-end deep learning model achieves a macro-averaged F1-score of 62.06%.
arXiv Detail & Related papers (2022-06-05T21:04:20Z) - Zero Shot Learning for Predicting Energy Usage of Buildings in
Sustainable Design [2.929237637363991]
The 2030 Challenge is aimed at making all new buildings and major renovations carbon neutral by 2030.
It is important to understand how the various building factors contribute to energy usage of a building, right at design time.
Rich training datasets are needed for AI-based solutions to achieve good prediction accuracy.
arXiv Detail & Related papers (2022-02-10T18:08:58Z) - Lifelong Property Price Prediction: A Case Study for the Toronto Real
Estate Market [75.28009817291752]
We present Luce, the first life-long predictive model for automated property valuation.
Luce addresses two critical issues of property valuation: the lack of recent sold prices and the sparsity of house data.
We demonstrate the benefit of Luce by applying it to large, real-life datasets obtained from the Toronto real estate market.
arXiv Detail & Related papers (2020-08-12T07:32:16Z) - Holistic Parameteric Reconstruction of Building Models from Point Clouds [9.93322840476651]
We propose a holistic parametric reconstruction method which means taking into consideration the entire point clouds of one building simultaneously.
We first use a well-designed deep neural network to segment and identify primitives in the given building point clouds.
A holistic optimization strategy is then introduced to simultaneously determine the parameters of a segmented primitive.
The achieved overall quality of reconstruction is 0.08 meters for point-surface-distance or 0.7 times RMSE of the input LiDAR points.
arXiv Detail & Related papers (2020-05-19T05:42:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.