A fairness assessment of mobility-based COVID-19 case prediction models
- URL: http://arxiv.org/abs/2210.03901v2
- Date: Thu, 16 Feb 2023 05:18:36 GMT
- Title: A fairness assessment of mobility-based COVID-19 case prediction models
- Authors: Abdolmajid Erfani, Vanessa Frias-Martinez
- Abstract summary: We tested the hypothesis that bias in the mobility data used to train the predictive models might lead to unfairly less accurate predictions for certain demographic groups.
Specifically, the models tend to favor large, highly educated, wealthy young, urban, and non-black-dominated counties.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In light of the outbreak of COVID-19, analyzing and measuring human mobility
has become increasingly important. A wide range of studies have explored
spatiotemporal trends over time, examined associations with other variables,
evaluated non-pharmacologic interventions (NPIs), and predicted or simulated
COVID-19 spread using mobility data. Despite the benefits of publicly available
mobility data, a key question remains unanswered: are models using mobility
data performing equitably across demographic groups? We hypothesize that bias
in the mobility data used to train the predictive models might lead to unfairly
less accurate predictions for certain demographic groups. To test our
hypothesis, we applied two mobility-based COVID infection prediction models at
the county level in the United States using SafeGraph data, and correlated
model performance with sociodemographic traits. Findings revealed that there is
a systematic bias in models performance toward certain demographic
characteristics. Specifically, the models tend to favor large, highly educated,
wealthy, young, urban, and non-black-dominated counties. We hypothesize that
the mobility data currently used by many predictive models tends to capture
less information about older, poorer, non-white, and less educated regions,
which in turn negatively impacts the accuracy of the COVID-19 prediction in
these regions. Ultimately, this study points to the need of improved data
collection and sampling approaches that allow for an accurate representation of
the mobility patterns across demographic groups.
Related papers
- Modeling, Inference, and Prediction in Mobility-Based Compartmental Models for Epidemiology [5.079807662054658]
We introduce individual mobility as a key factor in disease transmission and control.
We characterize disease dynamics using mobility distribution functions for each compartment.
We infer mobility distributions from the time series of the infected population.
arXiv Detail & Related papers (2024-06-17T18:13:57Z) - DemOpts: Fairness corrections in COVID-19 case prediction models [0.24999074238880484]
We show that state of the art deep learning models output mean prediction errors that are significantly different across racial and ethnic groups.
We propose a novel de-biasing method, DemOpts, to increase the fairness of deep learning based forecasting models trained on potentially biased datasets.
arXiv Detail & Related papers (2024-05-15T16:22:46Z) - A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds [49.34500499203579]
We create a variational autoencoder (VAE)-based model, DemoVAE, to decorrelate fMRI features from demographics.
We generate high-quality synthetic fMRI data based on user-supplied demographics.
arXiv Detail & Related papers (2024-05-13T17:49:20Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals [91.59906995214209]
We propose a new evaluation method, Counterfactual Attentiveness Test (CAT)
CAT uses counterfactuals by replacing part of the input with its counterpart from a different example, expecting an attentive model to change its prediction.
We show that GPT3 becomes less attentive with an increased number of demonstrations, while its accuracy on the test data improves.
arXiv Detail & Related papers (2023-11-16T06:27:35Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Forecasting COVID-19 spreading trough an ensemble of classical and
machine learning models: Spain's case study [0.0]
We evaluate the applicability of an ensemble of population models and machine learning models to predict the near future evolution of the COVID-19 pandemic.
We rely solely in open and public datasets, fusing incidence, vaccination, human mobility and weather data.
arXiv Detail & Related papers (2022-07-12T08:16:44Z) - Biases in human mobility data impact epidemic modeling [0.0]
We identify two types of bias caused by unequal access to, and unequal usage of mobile phones.
We find evidence for data generation bias in all examined datasets in that high-wealth individuals are overrepresented.
To mitigate the skew, we present a framework to debias data and show how simple techniques can be used to increase representativeness.
arXiv Detail & Related papers (2021-12-23T13:20:54Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Combining Graph Neural Networks and Spatio-temporal Disease Models to
Predict COVID-19 Cases in Germany [0.0]
Several experts have called for the necessity to account for human mobility to explain the spread of COVID-19.
Most statistical or epidemiological models cannot directly incorporate unstructured data sources, including data that may encode human mobility.
We propose a trade-off between both research directions and present a novel learning approach that combines the advantages of statistical regression and machine learning models.
arXiv Detail & Related papers (2021-01-03T16:39:00Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.