Lightweight ML-Based Air Quality Prediction for IoT and Embedded Applications
- URL: http://arxiv.org/abs/2511.21857v1
- Date: Wed, 26 Nov 2025 19:31:20 GMT
- Title: Lightweight ML-Based Air Quality Prediction for IoT and Embedded Applications
- Authors: Md. Sad Abdullah Sami, Mushfiquzzaman Abid,
- Abstract summary: This study investigates the effectiveness and efficiency of two variants of the XGBoost regression model.<n>The full XGBoost model achieved superior predictive accuracy for both pollutants, while the tiny model, though slightly less precise, offered substantial computational benefits.<n>This makes the tiny XGBoost model suitable for real-time air-quality monitoring in IoT and embedded applications.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This study investigates the effectiveness and efficiency of two variants of the XGBoost regression model, the full-capacity and lightweight (tiny) versions, for predicting the concentrations of carbon monoxide (CO) and nitrogen dioxide (NO2). Using the AirQualityUCI dataset collected over one year in an urban environment, we conducted a comprehensive evaluation based on widely accepted metrics, including Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Bias Error (MBE), and the coefficient of determination (R2). In addition, we assessed resource-oriented metrics such as inference time, model size, and peak RAM usage. The full XGBoost model achieved superior predictive accuracy for both pollutants, while the tiny model, though slightly less precise, offered substantial computational benefits with significantly reduced inference time and model storage requirements. These results demonstrate the feasibility of deploying simplified models in resource-constrained environments without compromising predictive quality. This makes the tiny XGBoost model suitable for real-time air-quality monitoring in IoT and embedded applications.
Related papers
- Observationally Informed Adaptive Causal Experimental Design [55.998153710215654]
We propose Active Residual Learning, a new paradigm that leverages the observational model as a foundational prior.<n>This approach shifts the experimental focus from learning target causal quantities from scratch to efficiently estimating the residuals required to correct observational bias.<n> Experiments on synthetic and semi-synthetic benchmarks demonstrate that R-Design significantly outperforms baselines.
arXiv Detail & Related papers (2026-03-04T06:52:37Z) - Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting [1.2744523252873352]
This study investigates whether lightweight additive models -- Facebook Prophet (FBP) and NeuralProphet (NP) -- can deliver competitive forecasts for particulate matter in Beijing, China.<n>Using multi-year pollutant and meteorological data, we applied systematic feature selection (correlation, mutual information, mRMR), leakage-safe scaling, and chronological data splits.<n>Results show that FBP consistently outperformed NP, SARIMAX, and the learning-based baselines, achieving test $R2$ above 0.94 for both pollutants.
arXiv Detail & Related papers (2025-12-09T19:39:45Z) - China Regional 3km Downscaling Based on Residual Corrective Diffusion Model [39.12803910865843]
This work focuses on statistical downscaling, which establishes statistical relationships between low-resolution and high-resolution historical data.<n>In contrast to the original work of CorrDiff, the region considered in this work is nearly 40 times larger.<n>Deep learning has emerged as a powerful tool for this task, giving rise to various high-performance super-resolution models.
arXiv Detail & Related papers (2025-12-05T02:27:08Z) - Km-scale dynamical downscaling through conformalized latent diffusion models [45.94979929172337]
Dynamical downscaling is crucial for deriving high-resolution meteorological fields from coarse-scale simulations.<n>Generative Diffusion models (DMs) have recently emerged as powerful data-driven tools for this task.<n>However, DMs lack finite-sample guarantees against overconfident predictions, resulting in miscalibrated grid-point-level uncertainty estimates.<n>We tackle this issue by augmenting the downscaling pipeline with a conformal prediction framework.
arXiv Detail & Related papers (2025-10-15T08:41:36Z) - Deep Learning-Enhanced for Amine Emission Monitoring and Performance Analysis in Industrial Carbon Capture Plants [0.6533091401094101]
We present data driven deep learning models for forecasting and monitoring amine emissions and key performance parameters in amine-based post-combustion carbon capture systems.<n>For emission prediction, models were designed for 2-amino-2-methyl-1-propanol (AMP) and Piperazine emissions measured via FTIR and IMR-MS methods.<n>These models achieved high predictive accuracy exceeding 99% and effectively tracked both steady trends and abrupt fluctuations.
arXiv Detail & Related papers (2025-09-05T16:57:54Z) - Graph-Based Physics-Guided Urban PM2.5 Air Quality Imputation with Constrained Monitoring Data [7.076209890890611]
This work introduces GraPhy, a graph-based, physics-guided learning framework for high-resolution and accurate air quality modeling in urban areas with limited monitoring data.<n>Experiments using data from California's socioeconomically disadvantaged San Joaquin Valley show that GraPhy achieves the overall best performance evaluated by mean squared error (MSE), mean absolute error (MAE), and R-square value (R2), improving the performance by 9%-56% compared to various baseline models.
arXiv Detail & Related papers (2025-06-07T20:33:52Z) - Advancing Air Quality Monitoring: TinyML-Based Real-Time Ozone Prediction with Cost-Effective Edge Devices [0.0]
This paper introduces a novel TinyML-based system designed to predict ozone concentration in real-time.<n>The system employs an Arduino Nano 33 BLE Sense microcontroller equipped with an MQ7 sensor for carbon monoxide (CO) detection and built-in sensors for temperature and pressure measurements.
arXiv Detail & Related papers (2025-04-03T10:48:24Z) - Ultra-Resolution Adaptation with Ease [62.56434979517156]
We propose a set of key guidelines for ultra-resolution adaptation termed emphURAE.<n>We show that tuning minor components of the weight matrices outperforms widely-used low-rank adapters when synthetic data are unavailable.<n>Experiments validate that URAE achieves comparable 2K-generation performance to state-of-the-art closed-source models like FLUX1.1 [Pro] Ultra with only 3K samples and 2K iterations.
arXiv Detail & Related papers (2025-03-20T16:44:43Z) - Bridging Jensen Gap for Max-Min Group Fairness Optimization in Recommendation [63.66719748453878]
Group max-min fairness (MMF) is commonly used in fairness-aware recommender systems (RS) as an optimization objective.<n>We present an efficient and effective algorithm named FairDual, which utilizes a dual optimization technique to minimize the Jensen gap.<n>Our theoretical analysis demonstrates that FairDual can achieve a sub-linear convergence rate to the globally optimal solution.
arXiv Detail & Related papers (2025-02-13T13:33:45Z) - Can Deep Learning Trigger Alerts from Mobile-Captured Images? [0.0594961162060159]
This research contributes to verification of data augmentation techniques, CNN-based regression modelling for air quality prediction, and user-centric air quality monitoring through mobile technology.<n>The proposed system offers practical solutions for individuals to make informed environmental health and well-being decisions.
arXiv Detail & Related papers (2025-01-07T03:39:43Z) - Air Quality Forecasting Using Machine Learning: A Global perspective
with Relevance to Low-Resource Settings [0.0]
Air pollution stands as the fourth leading cause of death globally.
This study proposes a novel machine learning approach for accurate air quality prediction using two months of air quality data.
arXiv Detail & Related papers (2024-01-09T05:52:02Z) - Sampling is as easy as learning the score: theory for diffusion models
with minimal data assumptions [45.04514545004051]
We provide convergence guarantees for score-based generative models (SGMs)
We also examine SGMs based on the critically damped Langevin diffusion (CLD)
arXiv Detail & Related papers (2022-09-22T17:55:01Z) - Federated Learning in the Sky: Aerial-Ground Air Quality Sensing
Framework with UAV Swarms [53.38353133198842]
Air quality significantly affects human health, it is increasingly important to accurately and timely predict the Air Quality Index (AQI)
This paper proposes a new federated learning-based aerial-ground air quality sensing framework for fine-grained 3D air quality monitoring and forecasting.
For ground sensing systems, we propose a Graph Convolutional neural network-based Long Short-Term Memory (GC-LSTM) model to achieve accurate, real-time and future AQI inference.
arXiv Detail & Related papers (2020-07-23T13:32:47Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.