COVID-19 Vaccine Hesitancy on Social Media: Building a Public Twitter
Dataset of Anti-vaccine Content, Vaccine Misinformation and Conspiracies
- URL: http://arxiv.org/abs/2105.05134v2
- Date: Fri, 14 May 2021 21:05:27 GMT
- Title: COVID-19 Vaccine Hesitancy on Social Media: Building a Public Twitter
Dataset of Anti-vaccine Content, Vaccine Misinformation and Conspiracies
- Authors: Goran Muric, Yusong Wu, Emilio Ferrara
- Abstract summary: False claims about COVID-19 vaccines can undermine public trust in ongoing vaccination campaigns.
We present a dataset of Twitter posts that exhibit a strong anti-vaccine stance.
- Score: 10.505633521103018
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: False claims about COVID-19 vaccines can undermine public trust in ongoing
vaccination campaigns, thus posing a threat to global public health.
Misinformation originating from various sources has been spreading online since
the beginning of the COVID-19 pandemic. In this paper, we present a dataset of
Twitter posts that exhibit a strong anti-vaccine stance. The dataset consists
of two parts: a) a streaming keyword-centered data collection with more than
1.8 million tweets, and b) a historical account-level collection with more than
135 million tweets. The former leverages the Twitter streaming API to follow a
set of specific vaccine-related keywords starting from mid-October 2020. The
latter consists of all historical tweets of 70K accounts that were engaged in
the active spreading of anti-vaccine narratives. We present descriptive
analyses showing the volume of activity over time, geographical distributions,
topics, news sources, and inferred account political leaning. This dataset can
be used in studying anti-vaccine misinformation on social media and enable a
better understanding of vaccine hesitancy. In compliance with Twitter's Terms
of Service, our anonymized dataset is publicly available at:
https://github.com/gmuric/avax-tweets-dataset
Related papers
- Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter [3.768191396638854]
Vaccine hesitancy continues to be a main challenge for public health officials during the COVID-19 pandemic.
We present Vax-Culture, a novel Twitter COVID-19 dataset consisting of 6373 vaccine-related tweets.
We hope this can lead to effective and targeted public health communication strategies for reaching individuals with anti-vaccine beliefs.
arXiv Detail & Related papers (2023-04-13T23:04:30Z) - Doctors vs. Nurses: Understanding the Great Divide in Vaccine Hesitancy
among Healthcare Workers [64.1526243118151]
We find that doctors are overall more positive toward the COVID-19 vaccines.
Doctors are more concerned with the effectiveness of the vaccines over newer variants.
Nurses pay more attention to the potential side effects on children.
arXiv Detail & Related papers (2022-09-11T14:22:16Z) - "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - CAVES: A Dataset to facilitate Explainable Classification and
Summarization of Concerns towards COVID Vaccines [18.617543658780367]
We have curated CAVES, the first large-scale dataset containing about 10k COVID-19 anti-vaccine tweets labelled into various specific anti-vaccine concerns.
This is also the first multi-label classification dataset that provides explanations for each of the labels.
arXiv Detail & Related papers (2022-04-28T19:26:54Z) - Manipulating Twitter Through Deletions [64.33261764633504]
Research into influence campaigns on Twitter has mostly relied on identifying malicious activities from tweets obtained via public APIs.
Here, we provide the first exhaustive, large-scale analysis of anomalous deletion patterns involving more than a billion deletions by over 11 million accounts.
We find that a small fraction of accounts delete a large number of tweets daily.
First, limits on tweet volume are circumvented, allowing certain accounts to flood the network with over 26 thousand daily tweets.
Second, coordinated networks of accounts engage in repetitive likes and unlikes of content that is eventually deleted, which can manipulate ranking algorithms.
arXiv Detail & Related papers (2022-03-25T20:07:08Z) - Automated clustering of COVID-19 anti-vaccine discourse on Twitter [0.0]
An observational study of Twitter vaccine discourse is found in Ojea Quintana et al.
This work expands upon Ojea Quintana et al. (2021) with two main contributions from data science.
First, based on the authors' initial network clustering and qualitative analysis techniques, we are able to clearly demarcate and visualize the language patterns used in discourse by Antivaxxers.
Second, using the characteristics of Antivaxxers' tweets, we develop text classifiers to determine the likelihood a given user is employing anti-vaccination language.
arXiv Detail & Related papers (2022-03-03T07:38:31Z) - ArCovidVac: Analyzing Arabic Tweets About COVID-19 Vaccination [7.594204373985492]
We release the first largest manually annotated Arabic tweet dataset, ArCovidVac, for the COVID-19 vaccination campaign.
The dataset is enriched with different layers of annotation, including, (i) Informativeness (more vs. less importance of the tweets); (ii) fine-grained tweet content types (e.g., advice, rumors, restriction, authenticate news/information); and (iii) stance towards vaccination.
arXiv Detail & Related papers (2022-01-17T16:19:21Z) - A Python Package to Detect Anti-Vaccine Users on Twitter [1.1602089225841632]
Anti-vaccine hesitancy has been recently driven by the anti-vaccine narratives shared online.
We introduce a Python package capable of analyzing Twitter profiles to assess how likely that profile is to spread anti-vaccine sentiment.
We leverage the data on such users to understand what are the moral and emotional characteristics of anti-vaccine spreaders.
arXiv Detail & Related papers (2021-10-21T17:59:25Z) - Understanding the Hoarding Behaviors during the COVID-19 Pandemic using
Large Scale Social Media Data [77.34726150561087]
We analyze the hoarding and anti-hoarding patterns of over 42,000 unique Twitter users in the United States from March 1 to April 30, 2020.
We find the percentage of females in both hoarding and anti-hoarding groups is higher than that of the general Twitter user population.
The LIWC anxiety mean for the hoarding-related tweets is significantly higher than the baseline Twitter anxiety mean.
arXiv Detail & Related papers (2020-10-15T16:02:25Z) - Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media
during the COVID-19 Crisis [51.39895377836919]
COVID-19 has sparked racism and hate on social media targeted towards Asian communities.
We study the evolution and spread of anti-Asian hate speech through the lens of Twitter.
We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months.
arXiv Detail & Related papers (2020-05-25T21:58:09Z) - Falling into the Echo Chamber: the Italian Vaccination Debate on Twitter [65.7192861893042]
We examine the extent to which the vaccination debate on Twitter is conductive to potential outreach to the vaccination hesitant.
We discover that the vaccination skeptics, as well as the advocates, reside in their own distinct "echo chambers"
At the center of these echo chambers we find the ardent supporters, for which we build highly accurate network- and content-based classifiers.
arXiv Detail & Related papers (2020-03-26T13:55:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.