Twitter discussions and emotions about COVID-19 pandemic: a machine
learning approach
- URL: http://arxiv.org/abs/2005.12830v2
- Date: Fri, 19 Jun 2020 02:43:13 GMT
- Title: Twitter discussions and emotions about COVID-19 pandemic: a machine
learning approach
- Authors: Jia Xue (University of Toronto), Junxiang Chen (University of
Pittsburgh), Ran Hu (University of Toronto), Chen Chen (University of
Toronto), ChengDa Zheng (University of Toronto), Xiaoqian Liu (Chinese
Academy of Sciences), Tingshao Zhu (China Academy of Science)
- Abstract summary: We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 1 to April 21 in 2020.
We identify 13 discussion topics and categorize them into five different themes, such as "public health measures to slow the spread of COVID-19," "social stigma associated with COVID-19," "coronavirus news cases and deaths," "COVID-19 in the United States," and "coronavirus cases in the rest of the world"
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The objective of the study is to examine coronavirus disease (COVID-19)
related discussions, concerns, and sentiments that emerged from tweets posted
by Twitter users. We analyze 4 million Twitter messages related to the COVID-19
pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19,"
"quarantine" from March 1 to April 21 in 2020. We use a machine learning
approach, Latent Dirichlet Allocation (LDA), to identify popular unigram,
bigrams, salient topics and themes, and sentiments in the collected Tweets.
Popular unigrams include "virus," "lockdown," and "quarantine." Popular bigrams
include "COVID-19," "stay home," "corona virus," "social distancing," and "new
cases." We identify 13 discussion topics and categorize them into five
different themes, such as "public health measures to slow the spread of
COVID-19," "social stigma associated with COVID-19," "coronavirus news cases
and deaths," "COVID-19 in the United States," and "coronavirus cases in the
rest of the world". Across all identified topics, the dominant sentiments for
the spread of coronavirus are anticipation that measures that can be taken,
followed by a mixed feeling of trust, anger, and fear for different topics. The
public reveals a significant feeling of fear when they discuss the coronavirus
new cases and deaths than other topics. The study shows that Twitter data and
machine learning approaches can be leveraged for infodemiology study by
studying the evolving public discussions and sentiments during the COVID-19.
Real-time monitoring and assessment of the Twitter discussion and concerns can
be promising for public health emergency responses and planning. Already
emerged pandemic fear, stigma, and mental health concerns may continue to
influence public trust when there occurs a second wave of COVID-19 or a new
surge of the imminent pandemic.
Related papers
- "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - The relationship between sentiment score and COVID-19 cases in the
United States [0.0]
We consider a framework for extracting sentiment scores and opinions from COVID-19 related tweets.
We connect users' sentiment with COVID-19 cases across the USA and investigate the effect of specific COVID-19 milestones on public sentiment.
arXiv Detail & Related papers (2022-01-09T01:07:13Z) - American Twitter Users Revealed Social Determinants-related Oral Health
Disparities amid the COVID-19 Pandemic [72.44305630014534]
We collected oral health-related tweets during the COVID-19 pandemic from 9,104 Twitter users across 26 states.
Women and younger adults (19-29) are more likely to talk about oral health problems.
People from counties at a higher risk of COVID-19 talk more about tooth decay/gum bleeding and chipped tooth/tooth break.
arXiv Detail & Related papers (2021-09-16T01:10:06Z) - Surveillance of COVID-19 Pandemic using Social Media: A Reddit Study in
North Carolina [5.38552621556164]
We tap into social media to surveil the uptake of mitigation and detection strategies, and capture issues and concerns about the pandemic.
After extracting COVID-related posts from the four largest subreddit communities of North Carolina over six months, we performed NLP-based preprocessing to clean the noisy data.
We observed that'mask', 'flu', and 'testing' are the most prevalent named-entities for "Personal Protective Equipment", "symptoms", and "testing" categories.
arXiv Detail & Related papers (2021-06-07T06:55:25Z) - Understanding the Hoarding Behaviors during the COVID-19 Pandemic using
Large Scale Social Media Data [77.34726150561087]
We analyze the hoarding and anti-hoarding patterns of over 42,000 unique Twitter users in the United States from March 1 to April 30, 2020.
We find the percentage of females in both hoarding and anti-hoarding groups is higher than that of the general Twitter user population.
The LIWC anxiety mean for the hoarding-related tweets is significantly higher than the baseline Twitter anxiety mean.
arXiv Detail & Related papers (2020-10-15T16:02:25Z) - Understanding the temporal evolution of COVID-19 research through
machine learning and natural language processing [66.63200823918429]
The outbreak of the novel coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been continuously affecting human lives and communities around the world.
We used multiple data sources, i.e., PubMed and ArXiv, and built several machine learning models to characterize the landscape of current COVID-19 research.
Our findings confirm the types of research available in PubMed and ArXiv differ significantly, with the former exhibiting greater diversity in terms of COVID-19 related issues.
arXiv Detail & Related papers (2020-07-22T18:02:39Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z) - Critical Impact of Social Networks Infodemic on Defeating Coronavirus
COVID-19 Pandemic: Twitter-Based Study and Research Directions [1.6571886312953874]
An estimated 2.95 billion people in 2019 used social media worldwide.
The widespread of the Coronavirus COVID-19 resulted with a tsunami of social media.
This paper presents a large-scale study based on data mined from Twitter.
arXiv Detail & Related papers (2020-05-18T15:53:13Z) - Public discourse and sentiment during the COVID-19 pandemic: using
Latent Dirichlet Allocation for topic modeling on Twitter [10.857375706178622]
The study aims to understand Twitter users' discourse and psychological reactions to COVID-19.
We use machine learning techniques to analyze about 1.9 million Tweets related to coronavirus collected from January 23 to March 7, 2020.
Results do not reveal treatments and symptoms related messages as prevalent topics on Twitter.
arXiv Detail & Related papers (2020-05-18T15:50:38Z) - The Ivory Tower Lost: How College Students Respond Differently than the
General Public to the COVID-19 Pandemic [66.80677233314002]
Pandemic of the novel Coronavirus Disease 2019 (COVID-19) has presented governments with ultimate challenges.
In the United States, the country with the highest confirmed COVID-19 infection cases, a nationwide social distancing protocol has been implemented by the President.
This paper aims to discover the social implications of this unprecedented disruption in our interactive society by mining people's opinions on social media.
arXiv Detail & Related papers (2020-04-21T13:02:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.