COVID-19 Vaccine Misinformation in Middle Income Countries
- URL: http://arxiv.org/abs/2311.18195v1
- Date: Thu, 30 Nov 2023 02:27:34 GMT
- Title: COVID-19 Vaccine Misinformation in Middle Income Countries
- Authors: Jongin Kim, Byeo Rhee Back, Aditya Agrawal, Jiaxi Wu, Veronika J.
Wirtz, Traci Hong, Derry Wijaya
- Abstract summary: This paper introduces a multilingual dataset of COVID-19 vaccine misinformation, consisting of annotated tweets from three middle-income countries: Brazil, Indonesia, and Nigeria.
The dataset includes annotations for 5,952 tweets, assessing their relevance to COVID-19 vaccines, presence of misinformation, and the themes of the misinformation.
- Score: 5.891662430960944
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper introduces a multilingual dataset of COVID-19 vaccine
misinformation, consisting of annotated tweets from three middle-income
countries: Brazil, Indonesia, and Nigeria. The expertly curated dataset
includes annotations for 5,952 tweets, assessing their relevance to COVID-19
vaccines, presence of misinformation, and the themes of the misinformation. To
address challenges posed by domain specificity, the low-resource setting, and
data imbalance, we adopt two approaches for developing COVID-19 vaccine
misinformation detection models: domain-specific pre-training and text
augmentation using a large language model. Our best misinformation detection
models demonstrate improvements ranging from 2.7 to 15.9 percentage points in
macro F1-score compared to the baseline models. Additionally, we apply our
misinformation detection models in a large-scale study of 19 million unlabeled
tweets from the three countries between 2020 and 2022, showcasing the practical
application of our dataset and models for detecting and analyzing vaccine
misinformation in multiple countries and languages. Our analysis indicates that
percentage changes in the number of new COVID-19 cases are positively
associated with COVID-19 vaccine misinformation rates in a staggered manner for
Brazil and Indonesia, and there are significant positive associations between
the misinformation rates across the three countries.
Related papers
- Analyzing COVID-19 Vaccination Sentiments in Nigerian Cyberspace:
Insights from a Manually Annotated Twitter Dataset [2.820717448579396]
We explore the use of transformer-based language models to study people's acceptance of vaccines in Nigeria.
We developed a novel dataset by crawling multi-lingual tweets using relevant hashtags and keywords.
Our analysis and visualizations revealed that most tweets expressed neutral sentiments about COVID-19 vaccines, with some individuals expressing positive views.
arXiv Detail & Related papers (2024-01-23T22:49:19Z) - Leveraging Large Language Models and Weak Supervision for Social Media
data annotation: an evaluation using COVID-19 self-reported vaccination
tweets [1.9988653168573556]
Social media platforms have become a popular medium for discussions on vaccine-related topics.
In this study, we evaluate the usage of Large Language Models, in this case GPT-4, and weak supervision, to identify COVID-19 vaccine-related tweets.
arXiv Detail & Related papers (2023-09-12T18:18:23Z) - Dense Feature Memory Augmented Transformers for COVID-19 Vaccination
Search Classification [60.49594822215981]
This paper presents a classification model for detecting COVID-19 vaccination related search queries.
We propose a novel approach of considering dense features as memory tokens that the model can attend to.
We show that this new modeling approach enables a significant improvement to the Vaccine Search Insights (VSI) task.
arXiv Detail & Related papers (2022-12-16T13:57:41Z) - Visual Exploratory Data Analysis of the Covid-19 Vaccination Progress in
Nigeria [0.0]
Since the covid-19 vaccine came to Nigeria; 18,728,188 people have been fully vaccinated as at May 31st, 2022.
This is roughly 10% of the Nigerian population estimated at 206.7 million.
The data used for this research contain the state-by-state breakdown of Covid-19 vaccine distribution recorded between March 5th, 2021, and May 31st, 2022.
arXiv Detail & Related papers (2022-08-20T09:52:18Z) - A Multilingual Dataset of COVID-19 Vaccination Attitudes on Twitter [4.696697601424039]
We describe the collection and release of a dataset of tweets related to COVID-19 vaccines.
This dataset consists of the IDs of 2,198,090 tweets collected from Western Europe, 17,934 of which are annotated with the originators' vaccination stances.
arXiv Detail & Related papers (2022-06-27T13:44:48Z) - Few-Shot Cross-lingual Transfer for Coarse-grained De-identification of
Code-Mixed Clinical Texts [56.72488923420374]
Pre-trained language models (LMs) have shown great potential for cross-lingual transfer in low-resource settings.
We show the few-shot cross-lingual transfer property of LMs for named recognition (NER) and apply it to solve a low-resource and real-world challenge of code-mixed (Spanish-Catalan) clinical notes de-identification in the stroke.
arXiv Detail & Related papers (2022-04-10T21:46:52Z) - Automatic Detection of COVID-19 Vaccine Misinformation with Graph Link
Prediction [2.0625936401496237]
Vaccine hesitancy fueled by social media misinformation about COVID-19 vaccines became a major hurdle.
We present CoVaxLies, a new dataset of tweets judged relevant to several misinformation targets about COVID-19 vaccines.
Our method organizes CoVaxLies in a Misinformation Knowledge Graph as it casts misinformation detection as a graph link prediction problem.
arXiv Detail & Related papers (2021-08-04T23:27:10Z) - Classification supporting COVID-19 diagnostics based on patient survey
data [82.41449972618423]
logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19 were generated.
The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease.
This data set consists of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
arXiv Detail & Related papers (2020-11-24T17:44:01Z) - TICO-19: the Translation Initiative for Covid-19 [112.5601530395345]
The Translation Initiative for COvid-19 (TICO-19) has made test and development data available to AI and MT researchers in 35 different languages.
The same data is translated into all of the languages represented, meaning that testing or development can be done for any pairing of languages in the set.
arXiv Detail & Related papers (2020-07-03T16:26:17Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.