Predicting Zip Code-Level Vaccine Hesitancy in US Metropolitan Areas
Using Machine Learning Models on Public Tweets
- URL: http://arxiv.org/abs/2108.01699v1
- Date: Tue, 3 Aug 2021 18:43:46 GMT
- Title: Predicting Zip Code-Level Vaccine Hesitancy in US Metropolitan Areas
Using Machine Learning Models on Public Tweets
- Authors: Sara Melotte and Mayank Kejriwal
- Abstract summary: We present a methodology and experimental study, using publicly available Twitter data collected over the last year.
Our goal is not to devise novel machine learning algorithms, but to evaluate existing and established models in a comparative framework.
- Score: 10.45742327204133
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Although the recent rise and uptake of COVID-19 vaccines in the United States
has been encouraging, there continues to be significant vaccine hesitancy in
various geographic and demographic clusters of the adult population. Surveys,
such as the one conducted by Gallup over the past year, can be useful in
determining vaccine hesitancy, but can be expensive to conduct and do not
provide real-time data. At the same time, the advent of social media suggests
that it may be possible to get vaccine hesitancy signals at an aggregate level
(such as at the level of zip codes) by using machine learning models and
socioeconomic (and other) features from publicly available sources. It is an
open question at present whether such an endeavor is feasible, and how it
compares to baselines that only use constant priors. To our knowledge, a proper
methodology and evaluation results using real data has also not been presented.
In this article, we present such a methodology and experimental study, using
publicly available Twitter data collected over the last year. Our goal is not
to devise novel machine learning algorithms, but to evaluate existing and
established models in a comparative framework. We show that the best models
significantly outperform constant priors, and can be set up using open-source
tools.
Related papers
- Deep Reinforcement Learning Framework for Thoracic Diseases
Classification via Prior Knowledge Guidance [49.87607548975686]
The scarcity of labeled data for related diseases poses a huge challenge to an accurate diagnosis.
We propose a novel deep reinforcement learning framework, which introduces prior knowledge to direct the learning of diagnostic agents.
Our approach's performance was demonstrated using the well-known NIHX-ray 14 and CheXpert datasets.
arXiv Detail & Related papers (2023-06-02T01:46:31Z) - ASPEST: Bridging the Gap Between Active Learning and Selective
Prediction [56.001808843574395]
Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain.
Active learning aims to lower the overall labeling effort, and hence human dependence, by querying the most informative examples.
In this work, we introduce a new learning paradigm, active selective prediction, which aims to query more informative samples from the shifted target domain.
arXiv Detail & Related papers (2023-04-07T23:51:07Z) - Evaluating COVID-19 vaccine allocation policies using Bayesian $m$-top
exploration [53.122045119395594]
We present a novel technique for evaluating vaccine allocation strategies using a multi-armed bandit framework.
$m$-top exploration allows the algorithm to learn $m$ policies for which it expects the highest utility.
We consider the Belgian COVID-19 epidemic using the individual-based model STRIDE, where we learn a set of vaccination policies.
arXiv Detail & Related papers (2023-01-30T12:22:30Z) - "Double vaccinated, 5G boosted!": Learning Attitudes towards COVID-19
Vaccination from Social Media [4.178929174617172]
We leverage the textual posts on social media to extract and track users' vaccination stances in near real time.
We integrate the recent posts of a user's social network neighbours to help detect the user's genuine attitude.
Based on our annotated dataset from Twitter, the models instantiated from our framework can increase the performance of attitude extraction by up to 23%.
arXiv Detail & Related papers (2022-06-27T17:04:56Z) - A Multilingual Dataset of COVID-19 Vaccination Attitudes on Twitter [4.696697601424039]
We describe the collection and release of a dataset of tweets related to COVID-19 vaccines.
This dataset consists of the IDs of 2,198,090 tweets collected from Western Europe, 17,934 of which are annotated with the originators' vaccination stances.
arXiv Detail & Related papers (2022-06-27T13:44:48Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Machine learning approaches for localized lockdown during COVID-19: a
case study analysis [0.0]
Sars-CoV-2 emerged as a significant acute respiratory disease that has become a global pandemic.
Brazil has had difficulty in dealing with the virus due to the high socioeconomic difference of states and municipalities.
This study presents a new approach using different machine learning and deep learning algorithms applied to Brazilian COVID-19 data.
arXiv Detail & Related papers (2022-01-03T15:32:06Z) - Can Self Reported Symptoms Predict Daily COVID-19 Cases? [12.029443053416399]
We develop machine learning models to estimate the prevalence of COVID-19 using self-reported symptoms.
Our results indicate a lower error on the local models as opposed to the global model.
This work demonstrates that the models developed on crowd-sourced data, curated via online platforms, can complement the existing epidemiological surveillance infrastructure.
arXiv Detail & Related papers (2021-05-18T07:26:09Z) - Impact of Interventional Policies Including Vaccine on Covid-19
Propagation and Socio-Economic Factors [0.7874708385247353]
This study aims to provide a predictive analytics framework to model, predict and simulate COVID-19 propagation and socio-economic impact.
We have leveraged a recently launched open-source COVID-19 big data platform and used published research to find potentially relevant variables.
An advanced machine learning pipeline has been developed armed with a self-evolving model, deployed on a modern machine learning architecture.
arXiv Detail & Related papers (2021-01-11T15:08:07Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.