The Unpaid Toll: Quantifying and Addressing the Public Health Impact of Data Centers
- URL: http://arxiv.org/abs/2412.06288v2
- Date: Thu, 23 Oct 2025 22:23:59 GMT
- Title: The Unpaid Toll: Quantifying and Addressing the Public Health Impact of Data Centers
- Authors: Yuelin Han, Zhifeng Wu, Pengfei Li, Adam Wierman, Shaolei Ren,
- Abstract summary: surging demand for AI has led to a rapid expansion of energy-intensive data centers.<n>While significant attention has been paid to data centers' growing environmental footprint, the public health burden has been largely overlooked.<n>This paper introduces a principled methodology to model lifecycle pollutant emissions for data centers and computing tasks.
- Score: 32.67029206382299
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The surging demand for AI has led to a rapid expansion of energy-intensive data centers, impacting the environment through escalating carbon emissions and water consumption. While significant attention has been paid to data centers' growing environmental footprint, the public health burden, a hidden toll of data centers, has been largely overlooked. Specifically, data centers' lifecycle, from chip manufacturing to operation, can significantly degrade air quality through emissions of criteria air pollutants such as fine particulate matter, substantially impacting public health. This paper introduces a principled methodology to model lifecycle pollutant emissions for data centers and computing tasks, quantifying the public health impacts. Our findings reveal that training a large AI model comparable to the Llama-3.1 scale can produce air pollutants equivalent to more than 10,000 round trips by car between Los Angeles and New York City. The growing demand for AI is projected to push the total annual public health burden of U.S. data centers up to more than $20 billion in 2028, rivaling that of on-road emissions of California. Further, the public health costs are more felt in disadvantaged communities, where the per-household health burden could be 200x more than that in less-impacted communities. Finally, we propose a health-informed computing framework that explicitly incorporates public health risk as a key metric for scheduling data center workloads across space and time, which can effectively mitigate adverse health impacts while advancing environmental sustainability. More broadly, we also recommend adopting a standard reporting protocol for the public health impacts of data centers and paying attention to all impacted communities.
Related papers
- Predicting Public Health Impacts of Electricity Usage [20.400038737666545]
Electric power sector is a leading source of air pollutant emissions, impacting the public health of nearly every community.<n>We introduce HealthPredictor, a domain-specific AI model that provides an end-to-end pipeline linking electricity use to public health outcomes.
arXiv Detail & Related papers (2025-11-27T02:33:13Z) - Synergistic Neural Forecasting of Air Pollution with Stochastic Sampling [50.3911487821783]
Air pollution remains a leading global health and environmental risk, particularly in regions vulnerable to episodic air pollution spikes due to wildfires, urban haze and dust storms.<n>Here, we present SynCast, a high-resolution neural forecasting model that integrates meteorological and air composition data to improve predictions of both average and extreme pollution levels.
arXiv Detail & Related papers (2025-10-28T01:18:00Z) - SatHealth: A Multimodal Public Health Dataset with Satellite-based Environmental Factors [8.107949747896107]
SatHealth is a dataset combining environmental data, satellite images, all-disease prevalences estimated from medical claims, and social determinants of health indicators.<n>We conducted experiments under two use cases with SatHealth: regional public health modeling and personal disease risk prediction.<n>Results show that living environmental information can significantly improve AI models' performance and temporal-spatial generalizability.
arXiv Detail & Related papers (2025-06-16T16:34:41Z) - An AI-driven framework for the prediction of personalised health response to air pollution [30.858937705130106]
Air pollution poses a significant threat to public health, causing or exacerbating many respiratory and cardiovascular diseases.<n>Recent advances in personal sensing have transformed the collection of behavioural and physiological data.<n>We present a novel workflow for predicting personalised health responses to pollution by integrating physiological data from wearable fitness devices with real-time environmental exposures.
arXiv Detail & Related papers (2025-05-15T17:59:07Z) - AirCast: Improving Air Pollution Forecasting Through Multi-Variable Data Alignment [46.56288727659417]
Air pollution remains a leading global health risk, exacerbated by rapid industrialization and urbanization.
We introduce AirCast, a novel multi-variable air pollution forecasting model.
AirCast employs a multi-task head architecture that simultaneously forecasts atmospheric conditions and pollutant concentrations.
arXiv Detail & Related papers (2025-02-25T07:34:18Z) - From Efficiency Gains to Rebound Effects: The Problem of Jevons' Paradox in AI's Polarized Environmental Debate [69.05573887799203]
Much of this debate has concentrated on direct impact without addressing the significant indirect effects.
This paper examines how the problem of Jevons' Paradox applies to AI, whereby efficiency gains may paradoxically spur increased consumption.
We argue that understanding these second-order impacts requires an interdisciplinary approach, combining lifecycle assessments with socio-economic analyses.
arXiv Detail & Related papers (2025-01-27T22:45:06Z) - Towards Optimal Environmental Policies: Policy Learning under Arbitrary Bipartite Network Interference [0.0]
Emissions-reducing interventions on coal-fired power plants have proven to be an effective, but costly, strategy for reducing pollution-related health burdens.
We introduce novel learning methods to determine the optimal policy under arbitrary network interference (BNI)
We find that annual IHD hospitalization rates could be reduced in a range from 20.66-44.51 per 10,000 person-years through optimal policies under different cost constraints.
arXiv Detail & Related papers (2024-10-10T20:39:53Z) - Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI [67.58673784790375]
We argue that the 'bigger is better' AI paradigm is not only fragile scientifically, but comes with undesirable consequences.
First, it is not sustainable, as its compute demands increase faster than model performance, leading to unreasonable economic requirements and a disproportionate environmental footprint.
Second, it implies focusing on certain problems at the expense of others, leaving aside important applications, e.g. health, education, or the climate.
arXiv Detail & Related papers (2024-09-21T14:43:54Z) - Data Contamination Report from the 2024 CONDA Shared Task [78.50743680642405]
This first compilation paper is based on 566 reported entries over 91 contaminated sources from a total of 23 contributors.
The goal of the shared task and associated database is to assist the community in understanding the extent of the problem and to assist researchers in avoiding reporting evaluation results on known contaminated resources.
arXiv Detail & Related papers (2024-07-31T11:26:57Z) - Generative AI for Low-Carbon Artificial Intelligence of Things with Large Language Models [67.0243099823109]
Generative AI (GAI) holds immense potential to reduce carbon emissions of Artificial Intelligence of Things (AIoT)
In this article, we explore the potential of GAI for carbon emissions reduction and propose a novel GAI-enabled solution for low-carbon AIoT.
We propose a Large Language Model (LLM)-enabled carbon emission optimization framework, in which we design pluggable LLM and Retrieval Augmented Generation (RAG) modules.
arXiv Detail & Related papers (2024-04-28T05:46:28Z) - Environmental Insights: Democratizing Access to Ambient Air Pollution
Data and Predictive Analytics with an Open-Source Python Package [0.0]
Environmental Insights is an open-source Python package designed to democratize access to air pollution concentration data.
This tool enables users to retrieve historical air pollution data and employ a Machine Learning model for forecasting potential future conditions.
arXiv Detail & Related papers (2024-03-06T12:34:50Z) - A Framework for Scalable Ambient Air Pollution Concentration Estimation [0.0]
Ambient air pollution remains a critical issue in the United Kingdom, where data on air pollution concentrations form the foundation for interventions aimed at improving air quality.
We introduce a data-driven supervised machine learning model framework designed to address temporal and spatial data gaps by filling missing measurements.
This approach provides a comprehensive dataset for England throughout 2018 at a 1kmx1km hourly resolution.
arXiv Detail & Related papers (2024-01-16T18:03:07Z) - Novel Regression and Least Square Support Vector Machine Learning
Technique for Air Pollution Forecasting [0.0]
Improper detection of air pollution benchmarks results in severe complications for humans and living creatures.
A novel technique called, Discretized Regression and Least Square Support Vector (DR-LSSV) based air pollution forecasting is proposed.
The results indicate that the proposed DR-LSSV Technique can efficiently enhance air pollution forecasting performance.
arXiv Detail & Related papers (2023-06-11T06:56:00Z) - Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language
Model [72.65502770895417]
We quantify the carbon footprint of BLOOM, a 176-billion parameter language model, across its life cycle.
We estimate that BLOOM's final training emitted approximately 24.7 tonnes ofcarboneqif we consider only the dynamic power consumption.
We conclude with a discussion regarding the difficulty of precisely estimating the carbon footprint of machine learning models.
arXiv Detail & Related papers (2022-11-03T17:13:48Z) - AiR -- An Augmented Reality Application for Visualizing Air Pollution [5.564705758320338]
AiR considers the air quality measured by CPCB, in a locality detected by the user's GPS or in a locality of user's choice, and visualizes various air pollutants present in the locality.
AiR also creates awareness in an interactive manner about the different pollutants, sources, and their impacts on health.
arXiv Detail & Related papers (2020-06-03T10:03:47Z) - HazeDose: Design and Analysis of a Personal Air Pollution Inhaled Dose
Estimation System using Wearable Sensors [6.284628903370058]
We extend the paradigm to HazeDose system, which can personalize the individuals' air pollution exposure.
Users can visualize their personalized air pollution exposure information via a mobile application.
One algorithm is also introduced to balance the execution time and dosage reduction for alternative routes scenarios.
arXiv Detail & Related papers (2020-05-28T02:35:13Z) - COVI White Paper [67.04578448931741]
Contact tracing is an essential tool to change the course of the Covid-19 pandemic.
We present an overview of the rationale, design, ethical considerations and privacy strategy of COVI,' a Covid-19 public peer-to-peer contact tracing and risk awareness mobile application developed in Canada.
arXiv Detail & Related papers (2020-05-18T07:40:49Z) - Digital Ariadne: Citizen Empowerment for Epidemic Control [55.41644538483948]
The COVID-19 crisis represents the most dangerous threat to public health since the H1N1 pandemic of 1918.
Technology-assisted location and contact tracing, if broadly adopted, may help limit the spread of infectious diseases.
We present a tool, called 'diAry' or 'digital Ariadne', based on voluntary location and Bluetooth tracking on personal devices.
arXiv Detail & Related papers (2020-04-16T15:53:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.