A review on development of eco-friendly filters in Nepal for use in cigarettes and masks and Air Pollution Analysis with Machine Learning and SHAP Interpretability
- URL: http://arxiv.org/abs/2501.13369v1
- Date: Thu, 23 Jan 2025 04:16:58 GMT
- Title: A review on development of eco-friendly filters in Nepal for use in cigarettes and masks and Air Pollution Analysis with Machine Learning and SHAP Interpretability
- Authors: Bishwash Paneru, Biplov Paneru, Tanka Mukhiya, Khem Narayan Poudyal,
- Abstract summary: The Air Quality Index (AQI) is predicted in this work using a Random Forest Regressor.
With the lowest Testing RMSE (0.23) and flawless R2 scores (1.00), CatBoost performs better than other models.
This study investigates the Hydrogen-Alpha (HA) biodegradable filter as a novel way to reduce the related health hazards.
- Score: 0.0
- License:
- Abstract: In Nepal, air pollution is a serious public health concern, especially in cities like Kathmandu where particulate matter (PM2.5 and PM10) has a major influence on respiratory health and air quality. The Air Quality Index (AQI) is predicted in this work using a Random Forest Regressor, and the model's predictions are interpreted using SHAP (SHapley Additive exPlanations) analysis. With the lowest Testing RMSE (0.23) and flawless R2 scores (1.00), CatBoost performs better than other models, demonstrating its greater accuracy and generalization which is cross validated using a nested cross validation approach. NowCast Concentration and Raw Concentration are the most important elements influencing AQI values, according to SHAP research, which shows that the machine learning results are highly accurate. Their significance as major contributors to air pollution is highlighted by the fact that high values of these characteristics significantly raise the AQI. This study investigates the Hydrogen-Alpha (HA) biodegradable filter as a novel way to reduce the related health hazards. With removal efficiency of more than 98% for PM2.5 and 99.24% for PM10, the HA filter offers exceptional defense against dangerous airborne particles. These devices, which are biodegradable face masks and cigarette filters, address the environmental issues associated with traditional filters' non-biodegradable trash while also lowering exposure to air contaminants.
Related papers
- Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation [46.148465860465095]
We study the effects of contamination on language models at 1B and 8B scales on the machine translation task.
Our experiments reveal that contamination with both source and target substantially inflates BLEU scores.
In contrast, source-only and target-only contamination generally produce smaller, less consistent over-estimations.
arXiv Detail & Related papers (2025-01-30T21:51:18Z) - Can Explainable AI Assess Personalized Health Risks from Indoor Air Pollution? [0.0594961162060159]
Existing research primarily focused on monitoring, lacking accuracy in pinpointing indoor pollution sources.
A survey of 143 participants revealed limited awareness of indoor air pollution.
Our method integrates Decision Trees, Random Forest, Naive Bayes, and SVM models, excelling at 99.8% accuracy with Decision Trees.
arXiv Detail & Related papers (2025-01-07T03:37:13Z) - Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations.
We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate.
Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z) - Investigating Data Contamination for Pre-training Language Models [46.335755305642564]
We explore the impact of data contamination at the pre-training stage by pre-training a series of GPT-2 models.
We highlight the effect of both text contamination (textiti.e. input text of the evaluation samples) and ground-truth contamination (textiti.e. the prompts asked on the input and the desired outputs) from evaluation data.
arXiv Detail & Related papers (2024-01-11T17:24:49Z) - Novel Regression and Least Square Support Vector Machine Learning
Technique for Air Pollution Forecasting [0.0]
Improper detection of air pollution benchmarks results in severe complications for humans and living creatures.
A novel technique called, Discretized Regression and Least Square Support Vector (DR-LSSV) based air pollution forecasting is proposed.
The results indicate that the proposed DR-LSSV Technique can efficiently enhance air pollution forecasting performance.
arXiv Detail & Related papers (2023-06-11T06:56:00Z) - Evaluation of the potential of Near Infrared Hyperspectral Imaging for
monitoring the invasive brown marmorated stink bug [53.682955739083056]
The brown marmorated stink bug (BMSB), Halyomorpha halys, is an invasive insect pest of global importance that damages several crops.
The present study consists in a preliminary evaluation at the laboratory level of Near Infrared Hyperspectral Imaging (NIR-HSI) as a possible technology to detect BMSB specimens.
arXiv Detail & Related papers (2023-01-19T11:37:20Z) - AirFormer: Predicting Nationwide Air Quality in China with Transformers [43.48965814702661]
AirFormer is a novel Transformer architecture to collectively predict nationwide air quality in China.
AirFormer reduces prediction errors by 5%8% on 72-hour future predictions.
arXiv Detail & Related papers (2022-11-29T07:22:49Z) - Using Machine Learning to Predict Air Quality Index in New Delhi [0.0]
We use a Support Vector Regression (SVR) model to forecast the levels of various pollutants and the air quality index.
The model predicts levels of various pollutants, like, sulfur dioxide, carbon monoxide, nitrogen dioxide, particulate matter 2.5, and ground-level ozone, at an accuracy of 93.4 percent.
arXiv Detail & Related papers (2021-12-10T00:20:05Z) - Evaluation of Time Series Forecasting Models for Estimation of PM2.5
Levels in Air [0.0]
The study adopts ARIMA, FBProphet, and deep learning models such as LSTM, 1D CNN, to estimate the concentration of PM2.5 in the environment.
Our predicted results convey that all adopted methods give comparative outcomes in terms of average root mean squared error.
arXiv Detail & Related papers (2021-04-07T16:24:39Z) - HVAQ: A High-Resolution Vision-Based Air Quality Dataset [3.9523800511973017]
We present a high temporal and spatial resolution air quality dataset consisting of PM2.5, PM10, temperature, and humidity data.
We evaluate several vision-based state-of-art PM concentration prediction algorithms on our dataset and demonstrate that prediction accuracy increases with sensor density and image.
arXiv Detail & Related papers (2021-02-18T13:42:34Z) - Unassisted Noise Reduction of Chemical Reaction Data Sets [59.127921057012564]
We propose a machine learning-based, unassisted approach to remove chemically wrong entries from data sets.
Our results show an improved prediction quality for models trained on the cleaned and balanced data sets.
arXiv Detail & Related papers (2021-02-02T09:34:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.