Air Pollution Forecasting in Bucharest
- URL: http://arxiv.org/abs/2511.00532v1
- Date: Sat, 01 Nov 2025 12:24:11 GMT
- Title: Air Pollution Forecasting in Bucharest
- Authors: Dragoş-Andrei Şerban, Răzvan-Alexandru Smădu, Dumitru-Clementin Cercel,
- Abstract summary: Air pollution, especially the particulate matter 2.5 (PM2.5), has become a growing concern in recent years.<n>This paper aims to design, fine-tune, test, and evaluate machine learning models for predicting future levels of PM2.5.
- Score: 1.7035480256339337
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Air pollution, especially the particulate matter 2.5 (PM2.5), has become a growing concern in recent years, primarily in urban areas. Being exposed to air pollution is linked to developing numerous health problems, like the aggravation of respiratory diseases, cardiovascular disorders, lung function impairment, and even cancer or early death. Forecasting future levels of PM2.5 has become increasingly important over the past few years, as it can provide early warnings and help prevent diseases. This paper aims to design, fine-tune, test, and evaluate machine learning models for predicting future levels of PM2.5 over various time horizons. Our primary objective is to assess and compare the performance of multiple models, ranging from linear regression algorithms and ensemble-based methods to deep learning models, such as advanced recurrent neural networks and transformers, as well as large language models, on this forecasting task.
Related papers
- Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z) - Synergistic Neural Forecasting of Air Pollution with Stochastic Sampling [50.3911487821783]
Air pollution remains a leading global health and environmental risk, particularly in regions vulnerable to episodic air pollution spikes due to wildfires, urban haze and dust storms.<n>Here, we present SynCast, a high-resolution neural forecasting model that integrates meteorological and air composition data to improve predictions of both average and extreme pollution levels.
arXiv Detail & Related papers (2025-10-28T01:18:00Z) - A comparison between geostatistical and machine learning models for spatio-temporal prediction of PM2.5 data [0.0]
Exposure to high concentrations of PM2.5$ have been linked to increased respiratory and cardiovascular hospital admissions, more emergency department visits and deaths.<n>Traditional air quality monitoring systems provide limited spatial and temporal data.<n>The advent of low-cost sensors has dramatically improved the granularity of air quality data, enabling real-time, high-resolution monitoring.<n>This study exploits the extensive data from PurpleAir sensors to assess and compare the effectiveness of various statistical and machine learning models in producing accurate hourly PM$_2.5$ maps across California.
arXiv Detail & Related papers (2025-09-15T15:32:57Z) - Air Quality PM2.5 Index Prediction Model Based on CNN-LSTM [0.2796197251957245]
We propose an air quality PM2.5 index prediction model based on a hybrid CNN-LSTM architecture.<n>The model effectively combines Convolutional Neural Networks (CNN) for local spatial feature extraction and Long Short-Term Memory (LSTM) networks for modeling temporal dependencies in time series data.<n> Experimental results show that the model achieves a root mean square error (RMSE) of 5.236, outperforming traditional time series models in both accuracy and generalization.
arXiv Detail & Related papers (2025-08-15T04:46:25Z) - Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination [67.67725938962798]
Pre-training on massive web-scale corpora leaves Qwen2.5 susceptible to data contamination in widely used benchmarks.<n>We introduce a generator that creates fully clean arithmetic problems of arbitrary length and difficulty, dubbed RandomCalculation.<n>We show that only accurate reward signals yield steady improvements that surpass the base model's performance boundary.
arXiv Detail & Related papers (2025-07-14T17:55:15Z) - AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment [46.56288727659417]
Air pollution remains a leading global health risk, exacerbated by rapid industrialization and urbanization.<n>We introduce AirCast, a novel multi-variable air pollution forecasting model.<n>AirCast employs a multi-task head architecture that simultaneously forecasts atmospheric conditions and pollutant concentrations.
arXiv Detail & Related papers (2025-02-25T07:34:18Z) - Back to the Future: GNN-based NO$_2$ Forecasting via Future Covariates [49.93577170464313]
We deal with air quality observations in a city-wide network of ground monitoring stations.
We propose a conditioning block that embeds past and future covariates into the current observations.
We find that conditioning on future weather information has a greater impact than considering past traffic conditions.
arXiv Detail & Related papers (2024-04-08T09:13:16Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Safe AI for health and beyond -- Monitoring to transform a health
service [51.8524501805308]
We will assess the infrastructure required to monitor the outputs of a machine learning algorithm.
We will present two scenarios with examples of monitoring and updates of models.
arXiv Detail & Related papers (2023-03-02T17:27:45Z) - Detecting Elevated Air Pollution Levels by Monitoring Web Search
Queries: Deep Learning-Based Time Series Forecasting [7.978612711536259]
Prior work relied on modeling pollutant concentrations collected from ground-based monitors and meteorological data for long-term forecasting.
This study aims to develop and validate models to nowcast the observed pollution levels using Web search data, which is publicly available in near real-time from major search engines.
We developed novel machine learning-based models using both traditional supervised classification methods and state-of-the-art deep learning methods to detect elevated air pollution levels at the US city level.
arXiv Detail & Related papers (2022-11-09T23:56:35Z) - Evaluation of Time Series Forecasting Models for Estimation of PM2.5
Levels in Air [0.0]
The study adopts ARIMA, FBProphet, and deep learning models such as LSTM, 1D CNN, to estimate the concentration of PM2.5 in the environment.
Our predicted results convey that all adopted methods give comparative outcomes in terms of average root mean squared error.
arXiv Detail & Related papers (2021-04-07T16:24:39Z) - A Novel Prediction Approach for Exploring PM2.5 Spatiotemporal
Propagation Based on Convolutional Recursive Neural Networks [7.131106953836335]
The prediction system of PM2.5 propagation provides more detailed and accurate information as an early warning system to reduce health impacts on the community.
This research was conducted by using dataset of air quality monitoring systems in Taiwan.
In general, the model is able to provide accurate predictive results by considering the bonds among measurement nodes in both spatially and temporally.
arXiv Detail & Related papers (2021-01-15T17:00:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.