Related papers: Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting

Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting

URL: http://arxiv.org/abs/2512.09076v1
Date: Tue, 09 Dec 2025 19:39:45 GMT
Title: Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting
Authors: Moazzam Umer Gondal, Hamad ul Qudous, Asma Ahmad Farhan,
Abstract summary: This study investigates whether lightweight additive models -- Facebook Prophet (FBP) and NeuralProphet (NP) -- can deliver competitive forecasts for particulate matter in Beijing, China.<n>Using multi-year pollutant and meteorological data, we applied systematic feature selection (correlation, mutual information, mRMR), leakage-safe scaling, and chronological data splits.<n>Results show that FBP consistently outperformed NP, SARIMAX, and the learning-based baselines, achieving test $R2$ above 0.94 for both pollutants.
Score: 1.2744523252873352
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate forecasting of urban air pollution is essential for protecting public health and guiding mitigation policies. While Deep Learning (DL) and hybrid pipelines dominate recent research, their complexity and limited interpretability hinder operational use. This study investigates whether lightweight additive models -- Facebook Prophet (FBP) and NeuralProphet (NP) -- can deliver competitive forecasts for particulate matter (PM$_{2.5}$, PM$_{10}$) in Beijing, China. Using multi-year pollutant and meteorological data, we applied systematic feature selection (correlation, mutual information, mRMR), leakage-safe scaling, and chronological data splits. Both models were trained with pollutant and precursor regressors, with NP additionally leveraging lagged dependencies. For context, two machine learning baselines (LSTM, LightGBM) and one traditional statistical model (SARIMAX) were also implemented. Performance was evaluated on a 7-day holdout using MAE, RMSE, and $R^2$. Results show that FBP consistently outperformed NP, SARIMAX, and the learning-based baselines, achieving test $R^2$ above 0.94 for both pollutants. These findings demonstrate that interpretable additive models remain competitive with both traditional and complex approaches, offering a practical balance of accuracy, transparency, and ease of deployment.

Related papers

Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning [51.87858735871145]
We introduce Iprox, a framework that derives influence-preserving proxies directly from the target model.<n>Iprox consistently outperforms off-the-shelf proxies and baseline methods.
arXiv Detail & Related papers (2026-02-19T20:57:30Z)
ProAct: Agentic Lookahead in Interactive Environments [56.50613398808361]
ProAct is a framework that enables agents to internalize accurate lookahead reasoning through a two-stage training paradigm.<n>We introduce Grounded LookAhead Distillation (GLAD), where the agent undergoes supervised fine-tuning on trajectories derived from environment-based search.<n>We also propose the Monte-Carlo Critic (MC-Critic), a plug-and-play auxiliary value estimator designed to enhance policy-gradient algorithms.
arXiv Detail & Related papers (2026-02-05T05:45:16Z)
Lightweight ML-Based Air Quality Prediction for IoT and Embedded Applications [0.0]
This study investigates the effectiveness and efficiency of two variants of the XGBoost regression model.<n>The full XGBoost model achieved superior predictive accuracy for both pollutants, while the tiny model, though slightly less precise, offered substantial computational benefits.<n>This makes the tiny XGBoost model suitable for real-time air-quality monitoring in IoT and embedded applications.
arXiv Detail & Related papers (2025-11-26T19:31:20Z)
A comparison between geostatistical and machine learning models for spatio-temporal prediction of PM2.5 data [0.0]
Exposure to high concentrations of PM2.5$ have been linked to increased respiratory and cardiovascular hospital admissions, more emergency department visits and deaths.<n>Traditional air quality monitoring systems provide limited spatial and temporal data.<n>The advent of low-cost sensors has dramatically improved the granularity of air quality data, enabling real-time, high-resolution monitoring.<n>This study exploits the extensive data from PurpleAir sensors to assess and compare the effectiveness of various statistical and machine learning models in producing accurate hourly PM$_2.5$ maps across California.
arXiv Detail & Related papers (2025-09-15T15:32:57Z)
When Simpler Wins: Facebooks Prophet vs LSTM for Air Pollution Forecasting in Data-Constrained Northern Nigeria [0.44198435146063364]
This study evaluates Long Short-Term Memory (LSTM) networks and the Facebook Prophet model for forecasting multiple pollutants.<n>Results show that Prophet often matches or exceeds LSTM's accuracy, particularly in series dominated by seasonal and long-term trends.
arXiv Detail & Related papers (2025-08-22T09:23:59Z)
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination [67.67725938962798]
Pre-training on massive web-scale corpora leaves Qwen2.5 susceptible to data contamination in widely used benchmarks.<n>We introduce a generator that creates fully clean arithmetic problems of arbitrary length and difficulty, dubbed RandomCalculation.<n>We show that only accurate reward signals yield steady improvements that surpass the base model's performance boundary.
arXiv Detail & Related papers (2025-07-14T17:55:15Z)
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs [75.72672339168092]
We introduce ReasonFlux-PRM, a novel trajectory-aware PRM to evaluate trajectory-response type of reasoning traces.<n>ReasonFlux-PRM incorporates both step-level and trajectory-level supervision, enabling fine-grained reward assignment aligned with structured chain-of-thought data.<n>Our derived ReasonFlux-PRM-7B yields consistent performance improvements, achieving average gains of 12.1% in supervised fine-tuning, 4.5% in reinforcement learning, and 6.3% in test-time scaling.
arXiv Detail & Related papers (2025-06-23T17:59:02Z)
Machine Learning Models for Reinforced Concrete Pipes Condition Prediction: The State-of-the-Art Using Artificial Neural Networks and Multiple Linear Regression in a Wisconsin Case Study [0.0]
The aging sewer infrastructure in the U.S., covering 2.1 million kilometers, encounters increasing structural issues.<n>Around 75,000 yearly sanitary sewer overflows present serious economic, environmental, and public health hazards.<n>This research intends to enhance predictive accuracy for the condition of sewer pipelines through machine learning models.
arXiv Detail & Related papers (2025-02-01T08:16:08Z)
Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort? [0.0]
Energy communities (ECs) play a key role in enabling local demand shifting and enhancing self-sufficiency.<n>Data-driven forecasting has gained significant attention, but it remains insufficiently explored in many practical contexts.<n>This study evaluates the effectiveness of state-of-the-art deep learning models across various community size, historical data availability, and model complexity.
arXiv Detail & Related papers (2025-01-09T06:29:50Z)
Scaling Laws for Predicting Downstream Performance in LLMs [75.28559015477137]
This work focuses on the pre-training loss as a more computation-efficient metric for performance estimation.<n>We present FLP-M, a fundamental approach for performance prediction that addresses the practical need to integrate datasets from multiple sources during pre-training.
arXiv Detail & Related papers (2024-10-11T04:57:48Z)
Variable importance measure for spatial machine learning models with application to air pollution exposure prediction [2.633085745593072]
The objective is to predict air pollution exposures for study subjects at locations without data in order to optimize our ability to learn about health effects of air pollution. We tackle these challenges in two datasets: sulfur (S) from regulatory United States national PM2.5 sub-species data and ultrafine particles (UFP) from a new Seattle-area traffic-related air pollution dataset. Our key contribution is a leave-one-out approach for variable importance that leads to interpretable and comparable measures for a broad class of models.
arXiv Detail & Related papers (2024-06-04T05:51:36Z)
A Comparative Study of Machine Learning Algorithms for Anomaly Detection in Industrial Environments: Performance and Environmental Impact [62.997667081978825]
This study seeks to address the demands of high-performance machine learning models with environmental sustainability. Traditional machine learning algorithms, such as Decision Trees and Random Forests, demonstrate robust efficiency and performance. However, superior outcomes were obtained with optimised configurations, albeit with a commensurate increase in resource consumption.
arXiv Detail & Related papers (2023-07-01T15:18:00Z)
Learning representations with end-to-end models for improved remaining useful life prognostics [64.80885001058572]
The remaining Useful Life (RUL) of equipment is defined as the duration between the current time and its failure. We propose an end-to-end deep learning model based on multi-layer perceptron and long short-term memory layers (LSTM) to predict the RUL. We will discuss how the proposed end-to-end model is able to achieve such good results and compare it to other deep learning and state-of-the-art methods.
arXiv Detail & Related papers (2021-04-11T16:45:18Z)

This list is automatically generated from the titles and abstracts of the papers in this site.