AI for Water Sustainability: Global Water Quality Assessment and Prediction with Explainable AI with LLM Chatbot for Insights
- URL: http://arxiv.org/abs/2409.10898v3
- Date: Sun, 26 Oct 2025 16:01:09 GMT
- Title: AI for Water Sustainability: Global Water Quality Assessment and Prediction with Explainable AI with LLM Chatbot for Insights
- Authors: Biplov Paneru, Bishwash Paneru,
- Abstract summary: This paper introduces various hybrid deep learning models to predict on the CCME dataset with multiple water quality parameters from Canada, China, the UK, the USA, and Ireland.<n>CatBoost, XGBoost, and Extra Trees Regressor predicted Water Quality Index (WQI) values with an average RMSE of 1.2 and an R squared score of 0.99.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Ensuring safe water supplies requires effective water quality monitoring, especially in developing countries like Nepal, where contamination risks are high. This paper introduces various hybrid deep learning models to predict on the CCME dataset with multiple water quality parameters from Canada, China, the UK, the USA, and Ireland, with 2.82 million data records feature-engineered and evaluated using them. Models such as CatBoost, XGBoost, and Extra Trees, along with neural networks combining CNN and LSTM layers, are used to capture temporal and spatial patterns in the data. The model demonstrated notable accuracy improvements, aiding proactive water quality control. CatBoost, XGBoost, and Extra Trees Regressor predicted Water Quality Index (WQI) values with an average RMSE of 1.2 and an R squared score of 0.99. Additionally, classifiers achieved 99% accuracy, cross-validated across models. SHAP analysis showed the importance of indicators like F.R.C. and orthophosphate levels in hybrid architectures' classification decisions. The practical application is demonstrated along with a chatbot application for water quality insights.
Related papers
- Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction [69.38041171537573]
Water quality is foundational to environmental sustainability, ecosystem resilience, and public health.<n>Deep learning offers transformative potential for large-scale water quality prediction and scientific insights generation.<n>Their widespread adoption in high-stakes operational decision-making, such as pollution mitigation and equitable resource allocation, is prevented by unresolved trustworthiness challenges.
arXiv Detail & Related papers (2025-03-13T01:50:50Z) - Integrating Boosted learning with Differential Evolution (DE) Optimizer: A Prediction of Groundwater Quality Risk Assessment in Odisha [0.0]
This study developed a machine learning-based predictive model to evaluate the Groundwater Quality Index (GWQI)
It has been achieved with the help of a hybrid machine learning model i.e. LCBoost Fusion.
arXiv Detail & Related papers (2025-02-25T07:47:41Z) - Leveraging graph neural networks and mobility data for COVID-19 forecasting [37.9506001142702]
COVID-19 pandemic has victimized over 7 million people to date, prompting diverse research efforts.<n>Spatio-temporal models combining mobility data with machine learning have gained attention for disease forecasting.<n>Here, we explore Graph Convolutional Recurrent Network (GCRN) and Graph Convolutional Long ShortTerm Memory (GTM)<n>The aim is to forecast future values of COVID-19 cases in Brazil and China by leveraging human mobility networks.
arXiv Detail & Related papers (2025-01-20T19:52:31Z) - Backdoor Attacks against No-Reference Image Quality Assessment Models via a Scalable Trigger [76.36315347198195]
No-Reference Image Quality Assessment (NR-IQA) plays a critical role in evaluating and optimizing computer vision systems.
Recent research indicates that NR-IQA models are susceptible to adversarial attacks.
We present a novel poisoning-based backdoor attack against NR-IQA (BAIQA)
arXiv Detail & Related papers (2024-12-10T08:07:19Z) - Towards an Autonomous Surface Vehicle Prototype for Artificial Intelligence Applications of Water Quality Monitoring [68.41400824104953]
This paper presents a vehicle prototype that addresses the use of Artificial Intelligence algorithms and enhanced sensing techniques for water quality monitoring.
The vehicle is fully equipped with high-quality sensors to measure water quality parameters and water depth.
By means of a stereo-camera, it also can detect and locate macro-plastics in real environments.
arXiv Detail & Related papers (2024-10-08T10:35:32Z) - SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking [40.996860106131244]
Climate and increasing droughts pose significant challenges to water resource management around the world.
We present a new dataset, SEN12-WATER, along with a benchmark using a end-to-end Deep Learning framework for proactive drought-related analysis.
arXiv Detail & Related papers (2024-09-25T16:50:59Z) - Introducing δ-XAI: a novel sensitivity-based method for local AI explanations [42.06878765569675]
High-performing AI/ML models often lack interpretability, hampering clinicians' trust in their predictions.
To address this, XAI techniques are being developed to describe AI/ML predictions in human-understandable terms.
Here, we introduce a novel delta-XAI method that provides local explanations of ML model predictions by extending the delta index.
arXiv Detail & Related papers (2024-07-25T19:07:49Z) - Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data [44.99833362998488]
Amyotrophic Lateral Sclerosis (ALS) is a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options.
The present investigation, spearheaded by the iDPP@CLEF 2024 challenge, focuses on utilizing sensor-derived data obtained through an app.
arXiv Detail & Related papers (2024-07-10T19:17:23Z) - Graph Neural Networks for Pressure Estimation in Water Distribution
Systems [44.99833362998488]
Pressure and flow estimation in Water Distribution Networks (WDN) allows water management companies to optimize their control operations.
We combine physics-based modeling and Graph Neural Networks (GNN), a data-driven approach, to address the pressure estimation problem.
Our GNN-based model estimates the pressure of a large-scale WDN in The Netherlands with a MAE of 1.94mH$$O and a MAPE of 7%.
arXiv Detail & Related papers (2023-11-17T15:30:12Z) - Modeling groundwater levels in California's Central Valley by hierarchical Gaussian process and neural network regression [9.816891579613628]
A novel machine learning method is formulated for modeling groundwater levels by learning from a 3D lithological texture model of the Central Valley aquifer.
We show how the model predictions may be used to supplement hydrological understanding of aquifer responses in basins with irregular well data.
Our results indicate that on average the 2017 and 2019 wet years in California were largely ineffective in replenishing the groundwater loss caused during previous drought years.
arXiv Detail & Related papers (2023-10-23T04:21:26Z) - Rapid Flood Inundation Forecast Using Fourier Neural Operator [77.30160833875513]
Flood inundation forecast provides critical information for emergency planning before and during flood events.
High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding.
We present a hybrid process-based and data-driven machine learning (ML) approach for flood extent and inundation depth prediction.
arXiv Detail & Related papers (2023-07-29T22:49:50Z) - DeepAqua: Self-Supervised Semantic Segmentation of Wetland Surface Water
Extent with SAR Images using Knowledge Distillation [44.99833362998488]
We present DeepAqua, a self-supervised deep learning model that eliminates the need for manual annotations during the training phase.
We exploit cases where optical- and radar-based water masks coincide, enabling the detection of both open and vegetated water surfaces.
Experimental results show that DeepAqua outperforms other unsupervised methods by improving accuracy by 7%, Intersection Over Union by 27%, and F1 score by 14%.
arXiv Detail & Related papers (2023-05-02T18:06:21Z) - Continuous time recurrent neural networks: overview and application to
forecasting blood glucose in the intensive care unit [56.801856519460465]
Continuous time autoregressive recurrent neural networks (CTRNNs) are a deep learning model that account for irregular observations.
We demonstrate the application of these models to probabilistic forecasting of blood glucose in a critical care setting.
arXiv Detail & Related papers (2023-04-14T09:39:06Z) - An evaluation of deep learning models for predicting water depth
evolution in urban floods [59.31940764426359]
We compare different deep learning models for prediction of water depth at high spatial resolution.
Deep learning models are trained to reproduce the data simulated by the CADDIES cellular-automata flood model.
Our results show that the deep learning models present in general lower errors compared to the other methods.
arXiv Detail & Related papers (2023-02-20T16:08:54Z) - Deep Learning for Prawn Farming: Forecasting and Anomaly Detection [1.7324358447544173]
We present a decision support system for managing water quality in prawn ponds.
The system uses various sources of data and deep learning models in a novel way to provide 24-hour forecasting and anomaly detection of water quality parameters.
arXiv Detail & Related papers (2022-05-12T20:52:30Z) - SOUL: An Energy-Efficient Unsupervised Online Learning Seizure Detection
Classifier [68.8204255655161]
Implantable devices that record neural activity and detect seizures have been adopted to issue warnings or trigger neurostimulation to suppress seizures.
For an implantable seizure detection system, a low power, at-the-edge, online learning algorithm can be employed to dynamically adapt to neural signal drifts.
SOUL was fabricated in TSMC's 28 nm process occupying 0.1 mm2 and achieves 1.5 nJ/classification energy efficiency, which is at least 24x more efficient than state-of-the-art.
arXiv Detail & Related papers (2021-10-01T23:01:20Z) - Artificial Intelligence Hybrid Deep Learning Model for Groundwater Level
Prediction Using MLP-ADAM [0.0]
In this paper, a multi-layer perceptron is applied to simulate groundwater level.
The adaptive moment estimation algorithm is also used to this matter.
Results indicate that deep learning algorithms can demonstrate a high accuracy prediction.
arXiv Detail & Related papers (2021-07-29T10:11:45Z) - Coastal water quality prediction based on machine learning with feature
interpretation and spatio-temporal analysis [1.1124907412872893]
Poor coastal water quality can harbor pathogens that are dangerous to human health.
Routine monitoring data of $Escherichia Coli$ and enterococci across 15 public beaches in Rijeka, Croatia, were used to build machine learning models.
Catboost algorithm performed best with R$2$ values of 0.71 and 0.68 for predicting $E. Coli$ and enterococci.
arXiv Detail & Related papers (2021-07-07T14:00:14Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - When in Doubt: Neural Non-Parametric Uncertainty Quantification for
Epidemic Forecasting [70.54920804222031]
Most existing forecasting models disregard uncertainty quantification, resulting in mis-calibrated predictions.
Recent works in deep neural models for uncertainty-aware time-series forecasting also have several limitations.
We model the forecasting task as a probabilistic generative process and propose a functional neural process model called EPIFNP.
arXiv Detail & Related papers (2021-06-07T18:31:47Z) - Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing
Regressions In NLP Model Updates [68.09049111171862]
This work focuses on quantifying, reducing and analyzing regression errors in the NLP model updates.
We formulate the regression-free model updates into a constrained optimization problem.
We empirically analyze how model ensemble reduces regression.
arXiv Detail & Related papers (2021-05-07T03:33:00Z) - Water Quality Prediction on a Sigfox-compliant IoT Device: The Road
Ahead of WaterS [0.27998963147546135]
We focus on an Internet of Things water quality prediction system, namely WaterS, that can remotely communicate the gathered measurements.
The solution addresses the water pollution problem while taking into account the peculiar Internet of Things constraints such as energy efficiency and autonomy.
The source code of WaterS ecosystem has been released as open-source, to encourage and promote research activities from both Industry and Academia.
arXiv Detail & Related papers (2020-07-27T11:21:40Z) - Federated Learning in the Sky: Aerial-Ground Air Quality Sensing
Framework with UAV Swarms [53.38353133198842]
Air quality significantly affects human health, it is increasingly important to accurately and timely predict the Air Quality Index (AQI)
This paper proposes a new federated learning-based aerial-ground air quality sensing framework for fine-grained 3D air quality monitoring and forecasting.
For ground sensing systems, we propose a Graph Convolutional neural network-based Long Short-Term Memory (GC-LSTM) model to achieve accurate, real-time and future AQI inference.
arXiv Detail & Related papers (2020-07-23T13:32:47Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - A Hybrid Deep Learning Model for Predictive Flood Warning and Situation
Awareness using Channel Network Sensors Data [0.965964228590342]
The study used Harris County, Texas as the testbed, and obtained channel sensor data from three historical flood events.
The model is then tested in predicting the 2019 Imelda flood in Houston and the results show an excellent match with the empirical flood.
arXiv Detail & Related papers (2020-06-15T17:25:34Z) - A multivariate water quality parameter prediction model using recurrent
neural network [0.30458514384586394]
This research is to develop a water quality prediction model based on water quality parameters.
The model was developed using a recurrent neural network (RNN), Long Short-Term Memory (LSTM) and historical water quality data.
The single step model attained an error of 0.01 mg/L, whilst the multiple step model achieved a Root Mean Squared Error (RMSE) of 0.227 mg/L.
arXiv Detail & Related papers (2020-03-25T16:49:52Z) - Assessing Graph-based Deep Learning Models for Predicting Flash Point [52.931492216239995]
Graph-based deep learning (GBDL) models were implemented in predicting flash point for the first time.
Average R2 and Mean Absolute Error (MAE) scores of MPNN are, respectively, 2.3% lower and 2.0 K higher than previous comparable studies.
arXiv Detail & Related papers (2020-02-26T06:10:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.