AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
- URL: http://arxiv.org/abs/2502.17919v1
- Date: Tue, 25 Feb 2025 07:34:18 GMT
- Title: AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment
- Authors: Vishal Nedungadi, Muhammad Akhtar Munir, Marc RuĆwurm, Ron Sarafian, Ioannis N. Athanasiadis, Yinon Rudich, Fahad Shahbaz Khan, Salman Khan,
- Abstract summary: Air pollution remains a leading global health risk, exacerbated by rapid industrialization and urbanization.<n>We introduce AirCast, a novel multi-variable air pollution forecasting model.<n>AirCast employs a multi-task head architecture that simultaneously forecasts atmospheric conditions and pollutant concentrations.
- Score: 46.56288727659417
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Air pollution remains a leading global health risk, exacerbated by rapid industrialization and urbanization, contributing significantly to morbidity and mortality rates. In this paper, we introduce AirCast, a novel multi-variable air pollution forecasting model, by combining weather and air quality variables. AirCast employs a multi-task head architecture that simultaneously forecasts atmospheric conditions and pollutant concentrations, improving its understanding of how weather patterns affect air quality. Predicting extreme pollution events is challenging due to their rare occurrence in historic data, resulting in a heavy-tailed distribution of pollution levels. To address this, we propose a novel Frequency-weighted Mean Absolute Error (fMAE) loss, adapted from the class-balanced loss for regression tasks. Informed from domain knowledge, we investigate the selection of key variables known to influence pollution levels. Additionally, we align existing weather and chemical datasets across spatial and temporal dimensions. AirCast's integrated approach, combining multi-task learning, frequency weighted loss and domain informed variable selection, enables more accurate pollution forecasts. Our source code and models are made public here (https://github.com/vishalned/AirCast.git)
Related papers
- Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network [47.699409089023696]
Air quality prediction plays a crucial role in public health and environmental protection.
Existing works underestimate the critical role atmospheric conditions in air quality prediction.
MDSTNet is an encoder framework explicitly that captures atmosphere-pollution dependencies for prediction.
ChinaAirNet is the first dataset combining air quality records with multi-pressure-level meteorological observations.
arXiv Detail & Related papers (2025-04-14T09:18:11Z) - Back to the Future: GNN-based NO$_2$ Forecasting via Future Covariates [49.93577170464313]
We deal with air quality observations in a city-wide network of ground monitoring stations.
We propose a conditioning block that embeds past and future covariates into the current observations.
We find that conditioning on future weather information has a greater impact than considering past traffic conditions.
arXiv Detail & Related papers (2024-04-08T09:13:16Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Multi-scale Digital Twin: Developing a fast and physics-informed
surrogate model for groundwater contamination with uncertain climate models [53.44486283038738]
Climate change exacerbates the long-term soil management problem of groundwater contamination.
We develop a physics-informed machine learning surrogate model using U-Net enhanced Fourier Neural Contaminated (PDENO)
In parallel, we develop a convolutional autoencoder combined with climate data to reduce the dimensionality of climatic region similarities across the United States.
arXiv Detail & Related papers (2022-11-20T06:46:35Z) - Data-driven Real-time Short-term Prediction of Air Quality: Comparison
of ES, ARIMA, and LSTM [0.0]
We use a data-driven approach to predict air quality based on historical data.
Considering prediction accuracy and time complexity, our experiments reveal that for short-term air pollution prediction ES performs better than ARIMA and LSTM.
arXiv Detail & Related papers (2022-11-16T09:37:08Z) - Deciphering Environmental Air Pollution with Large Scale City Data [0.0]
Various factors ranging from emissions from traffic and power plants, household emissions, natural causes are known to be primary causal agents or influencers behind rising air pollution levels.
We introduce a large scale city-wise dataset for exploring the relationships among these agents over a long period of time.
Also, we provide a set of benchmarks for the problem of estimating or forecasting pollutant levels with a set of diverse models and methodologies.
arXiv Detail & Related papers (2021-09-09T22:00:51Z) - Estimation of Air Pollution with Remote Sensing Data: Revealing
Greenhouse Gas Emissions from Space [1.9659095632676094]
Existing models for surface-level air pollution rely on extensive land-use datasets which are often locally restricted and temporally static.
This work proposes a deep learning approach for the prediction of ambient air pollution that only relies on remote sensing data that is globally available and frequently updated.
arXiv Detail & Related papers (2021-08-31T14:58:04Z) - HighAir: A Hierarchical Graph Neural Network-Based Air Quality
Forecasting Method [8.86417830514213]
HighAir is a hierarchical graph neural network-based air quality forecasting method.
We construct a city-level graph and station-level graphs from a hierarchical perspective.
We compare HighAir with the state-of-the-art air quality forecasting methods on the dataset of Yangtze River Delta city group.
arXiv Detail & Related papers (2021-01-12T02:31:14Z) - DeepClimGAN: A High-Resolution Climate Data Generator [60.59639064716545]
Earth system models (ESMs) are often used to generate future projections of climate change scenarios.
As a compromise, emulators are substantially less expensive but may not have all of the complexity of an ESM.
Here we demonstrate the use of a conditional generative adversarial network (GAN) to act as an ESM emulator.
arXiv Detail & Related papers (2020-11-23T20:13:37Z) - Conditional Generative Adversarial Networks to Model Urban Outdoor Air
Pollution [0.8122270502556374]
We propose to train models able to generate synthetic nitrogen dioxide daily time series according to a given classification.
The proposed approach is able to generate accurate and diverse pollution daily time series, while requiring reduced computational time.
arXiv Detail & Related papers (2020-10-05T18:01:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.