Filling in the Blanks: Applying Data Imputation in incomplete Water Metering Data
- URL: http://arxiv.org/abs/2506.08882v1
- Date: Tue, 10 Jun 2025 15:13:05 GMT
- Title: Filling in the Blanks: Applying Data Imputation in incomplete Water Metering Data
- Authors: Dimitrios Amaxilatis, Themistoklis Sarantakos, Ioannis Chatzigiannakis, Georgios Mylonas,
- Abstract summary: We study the application of recent data imputation techniques to enhance monitoring and management of water distribution networks using smart water meters.<n>Our results indicate that effective data imputation can substantially enhance the quality of the insights derived from water consumption data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we explore the application of recent data imputation techniques to enhance monitoring and management of water distribution networks using smart water meters, based on data derived from a real-world IoT water grid monitoring deployment. Despite the detailed data produced by such meters, data gaps due to technical issues can significantly impact operational decisions and efficiency. Our results, by comparing various imputation methods, such as k-Nearest Neighbors, MissForest, Transformers, and Recurrent Neural Networks, indicate that effective data imputation can substantially enhance the quality of the insights derived from water consumption data as we study their effect on accuracy and reliability of water metering data to provide solutions in applications like leak detection and predictive maintenance scheduling.
Related papers
- Efficient Conformance Checking of Rich Data-Aware Declare Specifications (Extended) [49.46686813437884]
We show that it is possible to compute data-aware optimal alignments in a rich setting with general data types and data conditions.<n>This is achieved by carefully combining the two best-known approaches to deal with control flow and data dependencies.
arXiv Detail & Related papers (2025-06-30T10:16:21Z) - A Nonlinear Low-rank Representation Model with Convolutional Neural Network for Imputing Water Quality Data [12.655766751647985]
Integrity of Water Quality Data (WQD) is critical in environmental monitoring for scientific decision-making and ecological protection.<n>Traditional data imputation methods are difficult to depict the potential dynamics and fail to capture the deep data features.<n>This paper proposes a Low-rank Representation model (NLR) with Convolutional Neural Networks (CNN) for imputing missing WQD.
arXiv Detail & Related papers (2025-06-30T08:48:19Z) - Challenges, Methods, Data -- a Survey of Machine Learning in Water Distribution Networks [5.185604886838128]
This work presents the main tasks in water distribution networks and discusses how they relate to machine learning.
It analyses how the particularities of the domain pose challenges to and can be leveraged by machine learning approaches.
arXiv Detail & Related papers (2024-10-16T11:21:07Z) - Dataset Growth [59.68869191071907]
InfoGrowth is an efficient online algorithm for data cleaning and selection.
It can improve data quality/efficiency on both single-modal and multi-modal tasks.
arXiv Detail & Related papers (2024-05-28T16:43:57Z) - Importance-Aware Adaptive Dataset Distillation [53.79746115426363]
Development of deep learning models is enabled by the availability of large-scale datasets.
dataset distillation aims to synthesize a compact dataset that retains the essential information from the large original dataset.
We propose an importance-aware adaptive dataset distillation (IADD) method that can improve distillation performance.
arXiv Detail & Related papers (2024-01-29T03:29:39Z) - GraphGuard: Detecting and Counteracting Training Data Misuse in Graph
Neural Networks [69.97213941893351]
The emergence of Graph Neural Networks (GNNs) in graph data analysis has raised critical concerns about data misuse during model training.
Existing methodologies address either data misuse detection or mitigation, and are primarily designed for local GNN models.
This paper introduces a pioneering approach called GraphGuard, to tackle these challenges.
arXiv Detail & Related papers (2023-12-13T02:59:37Z) - Estimating irregular water demands with physics-informed machine
learning to inform leakage detection [0.0]
Leakages in drinking water distribution networks pose significant challenges to water utilities.
We present a physics-informed machine learning algorithm that analyses pressure data and estimates unknown irregular water demands.
arXiv Detail & Related papers (2023-09-06T11:55:16Z) - Deep learning-based flow disaggregation for short-term hydropower plant
operations [2.4874453414078896]
High temporal resolution data plays a vital role in effective short-term hydropower plant operations.
In this study, we propose a deep learning-based time series disaggregation model to derive hourly inflow data from daily inflow data for short-term hydropower plant operations.
arXiv Detail & Related papers (2023-08-11T10:52:43Z) - Non-intrusive Water Usage Classification Considering Limited Training
Data [5.935761705025763]
Smart metering of domestic water consumption to continuously monitor the usage of different appliances has been shown to have an impact on people's behavior towards water conservation.
The installation of multiple sensors to monitor each appliance currently has a high initial cost and as a result, monitoring consumption from different appliances using sensors is not cost-effective.
We propose a new algorithm for classifying single and overlapping household water usage events, using the total domestic consumption measurements.
arXiv Detail & Related papers (2023-01-02T11:57:36Z) - Deep Reinforcement Learning Assisted Federated Learning Algorithm for
Data Management of IIoT [82.33080550378068]
The continuous expanded scale of the industrial Internet of Things (IIoT) leads to IIoT equipments generating massive amounts of user data every moment.
How to manage these time series data in an efficient and safe way in the field of IIoT is still an open issue.
This paper studies the FL technology applications to manage IIoT equipment data in wireless network environments.
arXiv Detail & Related papers (2022-02-03T07:12:36Z) - DeepTimeAnomalyViz: A Tool for Visualizing and Post-processing Deep
Learning Anomaly Detection Results for Industrial Time-Series [88.12892448747291]
We introduce the DeTAVIZ interface, which is a web browser based visualization tool for quick exploration and assessment of feasibility of DL based anomaly detection in a given problem.
DeTAVIZ allows the user to easily and quickly iterate through multiple post processing options and compare different models, and allows for manual optimisation towards a chosen metric.
arXiv Detail & Related papers (2021-09-21T10:38:26Z) - Vessel and Port Efficiency Metrics through Validated AIS data [0.0]
We propose a machine learning-based data-driven methodology to detect and correct erroneous AIS data.
We also propose a metric that can be used by vessel operators and ports to express numerically their business and environmental efficiency.
arXiv Detail & Related papers (2021-04-30T19:51:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.