Dynamic Time Warping Clustering to Discover Socio-Economic
Characteristics in Smart Water Meter Data
- URL: http://arxiv.org/abs/2112.13778v2
- Date: Tue, 28 Dec 2021 20:17:05 GMT
- Title: Dynamic Time Warping Clustering to Discover Socio-Economic
Characteristics in Smart Water Meter Data
- Authors: D. B. Steffelbauer, E. J. M. Blokker, S. G. Buchberger, A. Knobbe, E.
Abraham
- Abstract summary: This paper aims to link smart water meter data to socio-economic user characteristics by applying a novel clustering algorithm.
The approach is tested on simulated and measured single family home datasets.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Socio-economic characteristics are influencing the temporal and spatial
variability of water demand - the biggest source of uncertainties within water
distribution system modeling. Improving our knowledge on these influences can
be utilized to decrease demand uncertainties. This paper aims to link smart
water meter data to socio-economic user characteristics by applying a novel
clustering algorithm that uses dynamic time warping on daily demand patterns.
The approach is tested on simulated and measured single family home datasets.
We show that the novel algorithm performs better compared to commonly used
clustering methods, both, in finding the right number of clusters as well as
assigning patterns correctly. Additionally, the methodology can be used to
identify outliers within clusters of demand patterns. Furthermore, this study
investigates which socio-economic characteristics (e.g. employment status,
number of residents) are prevalent within single clusters and, consequently,
can be linked to the shape of the cluster's barycenters. In future, the
proposed methods in combination with stochastic demand models can be used to
fill data-gaps in hydraulic models.
Related papers
- Statistical Jump Model for Mixed-Type Data with Missing Data Imputation [0.0]
We introduce the statistical jump model for mixed-type data.
This framework incorporates regime persistence, enhancing interpretability and reducing the frequency of state switches.
We validate our approach through extensive simulation studies and an empirical application to air quality data.
arXiv Detail & Related papers (2024-09-02T12:36:35Z) - A Machine Learning-Based Framework for Clustering Residential
Electricity Load Profiles to Enhance Demand Response Programs [0.0]
We present a novel machine learning based framework in order to achieve optimal load profiling through a real case study.
In this paper, we present a novel machine learning based framework in order to achieve optimal load profiling through a real case study.
arXiv Detail & Related papers (2023-10-31T11:23:26Z) - Causal Feature Selection via Transfer Entropy [59.999594949050596]
Causal discovery aims to identify causal relationships between features with observational data.
We introduce a new causal feature selection approach that relies on the forward and backward feature selection procedures.
We provide theoretical guarantees on the regression and classification errors for both the exact and the finite-sample cases.
arXiv Detail & Related papers (2023-10-17T08:04:45Z) - Optimization of a Hydrodynamic Computational Reservoir through Evolution [58.720142291102135]
We interface with a model of a hydrodynamic system, under development by a startup, as a computational reservoir.
We optimized the readout times and how inputs are mapped to the wave amplitude or frequency using an evolutionary search algorithm.
Applying evolutionary methods to this reservoir system substantially improved separability on an XNOR task, in comparison to implementations with hand-selected parameters.
arXiv Detail & Related papers (2023-04-20T19:15:02Z) - Targeted demand response for flexible energy communities using
clustering techniques [2.572906392867547]
The goal is to alter the consumption behavior of the prosumers within a distributed energy community in Italy.
Three popular machine learning algorithms are employed, namely k-means, k-medoids and agglomerative clustering.
We evaluate the methods using multiple metrics including a novel metric proposed within this study, namely peak performance score (PPS)
arXiv Detail & Related papers (2023-03-01T02:29:30Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - Selecting the suitable resampling strategy for imbalanced data
classification regarding dataset properties [62.997667081978825]
In many application domains such as medicine, information retrieval, cybersecurity, social media, etc., datasets used for inducing classification models often have an unequal distribution of the instances of each class.
This situation, known as imbalanced data classification, causes low predictive performance for the minority class examples.
Oversampling and undersampling techniques are well-known strategies to deal with this problem by balancing the number of examples of each class.
arXiv Detail & Related papers (2021-12-15T18:56:39Z) - A Spatial-temporal Graph Deep Learning Model for Urban Flood Nowcasting
Leveraging Heterogeneous Community Features [1.2599533416395765]
The objective of this study is to develop and test a novel structured deep-learning modeling framework for urban flood nowcasting.
We present a new computational modeling framework including an attention-based spatial-temporal graph convolution network (ASTGCN) model.
Results indicate that the model provides superior performance for the nowcasting of urban flood inundation at the census tract level.
arXiv Detail & Related papers (2021-11-09T15:35:05Z) - Uncover Residential Energy Consumption Patterns Using Socioeconomic and
Smart Meter Data [2.9340691207364786]
We analyze the real-world smart meter data and extract load patterns using K-Medoids clustering.
We develop an analytical framework with feature selection and deep learning models to estimate the relationship between load patterns and socioeconomic features.
arXiv Detail & Related papers (2021-04-12T01:57:14Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Topology-based Clusterwise Regression for User Segmentation and Demand
Forecasting [63.78344280962136]
Using a public and a novel proprietary data set of commercial data, this research shows that the proposed system enables analysts to both cluster their user base and plan demand at a granular level.
This work seeks to introduce TDA-based clustering of time series and clusterwise regression with matrix factorization methods as viable tools for the practitioner.
arXiv Detail & Related papers (2020-09-08T12:10:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.