Fake news agenda in the era of COVID-19: Identifying trends through
fact-checking content
- URL: http://arxiv.org/abs/2012.11004v1
- Date: Sun, 20 Dec 2020 19:35:25 GMT
- Title: Fake news agenda in the era of COVID-19: Identifying trends through
fact-checking content
- Authors: Wilson Ceron, Mathias-Felipe de-Lima-Santos and Marcos G. Quiles
- Abstract summary: We introduce a novel Markov-inspired computational method for identifying topics in tweets.
We collected data from Twitter accounts of two Brazilian fact-checking outlets.
Our method resulted in an important technique to cluster topics in a wide range of scenarios.
- Score: 0.8594140167290099
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The rise of social media has ignited an unprecedented circulation of false
information in our society. It is even more evident in times of crises, such as
the COVID-19 pandemic. Fact-checking efforts have expanded greatly and have
been touted as among the most promising solutions to fake news, especially in
times like these. Several studies have reported the development of
fact-checking organizations in Western societies, albeit little attention has
been given to the Global South. Here, to fill this gap, we introduce a novel
Markov-inspired computational method for identifying topics in tweets. In
contrast to other topic modeling approaches, our method clusters topics and
their current evolution in a predefined time window. Through these, we
collected data from Twitter accounts of two Brazilian fact-checking outlets and
presented the topics debunked by these initiatives in fortnights throughout the
pandemic. By comparing these organizations, we could identify similarities and
differences in what was shared by them. Our method resulted in an important
technique to cluster topics in a wide range of scenarios, including an
infodemic -- a period overabundance of the same information. In particular, the
data clearly revealed a complex intertwining between politics and the health
crisis during this period. We conclude by proposing a generic model which, in
our opinion, is suitable for topic modeling and an agenda for future research.
Related papers
- DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet
Classification via Memory Bank [52.20298962359658]
In crisis events, people often use social media platforms such as Twitter to disseminate information about the situation, warnings, advice, and support.
fully-supervised approaches require annotating vast amounts of data and are impractical due to limited response time.
Semi-supervised models can be biased, performing moderately well for certain classes while performing extremely poorly for others.
We propose a simple but effective debiasing method, DeCrisisMB, that utilizes a Memory Bank to store and perform equal sampling for generated pseudo-labels from each class at each training.
arXiv Detail & Related papers (2023-10-23T05:25:51Z) - Cross-Platform Social Dynamics: An Analysis of ChatGPT and COVID-19
Vaccine Conversations [37.69303106863453]
We analyzed over 12 million posts and news articles related to two significant events: the release of ChatGPT in 2022 and the global discussions about COVID-19 vaccines in 2021.
Data was collected from multiple platforms, including Twitter, Facebook, Instagram, Reddit, YouTube, and GDELT.
We employed topic modeling techniques to uncover the distinct thematic emphases on each platform, which reflect their specific features and target audiences.
arXiv Detail & Related papers (2023-10-17T09:58:55Z) - Understanding Divergent Framing of the Supreme Court Controversies:
Social Media vs. News Outlets [56.67097829383139]
We focus on the nuanced distinctions in framing of social media and traditional media outlets concerning a series of U.S. Supreme Court rulings.
We observe significant polarization in the news media's treatment of affirmative action and abortion rights, whereas the topic of student loans tends to exhibit a greater degree of consensus.
arXiv Detail & Related papers (2023-09-18T06:40:21Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
Comparing What Sources Say About Entities as a Start [65.28355014154549]
This paper investigates the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora.
We show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide.
arXiv Detail & Related papers (2023-04-06T23:36:45Z) - Bias or Diversity? Unraveling Fine-Grained Thematic Discrepancy in U.S.
News Headlines [63.52264764099532]
We use a large dataset of 1.8 million news headlines from major U.S. media outlets spanning from 2014 to 2022.
We quantify the fine-grained thematic discrepancy related to four prominent topics - domestic politics, economic issues, social issues, and foreign affairs.
Our findings indicate that on domestic politics and social issues, the discrepancy can be attributed to a certain degree of media bias.
arXiv Detail & Related papers (2023-03-28T03:31:37Z) - Contextualizing Emerging Trends in Financial News Articles [2.9483477138814287]
We focus on emerging trends detection in financial news articles about Microsoft, collected before and during the start of the COVID-19 pandemic.
We make the dataset accessible and propose a strong baseline for exploring the dynamics of similarities between pairs of keywords.
We evaluate against a gold standard (Google Trends) and present noteworthy real-world scenarios regarding the influence of the pandemic on Microsoft.
arXiv Detail & Related papers (2023-01-20T12:56:52Z) - Topic Modeling and Progression of American Digital News Media During the
Onset of the COVID-19 Pandemic [2.798697306330988]
Currently, the world is in the midst of a severe global pandemic, which has affected all aspects of people's lives.
There is a deluge of COVID-related digital media articles published in the United States, due to the disparate effects of the pandemic.
We develop a Natural Language Processing pipeline that is capable of automatically distilling various digital articles into manageable pieces of information.
arXiv Detail & Related papers (2021-05-25T14:27:47Z) - Automatic Monitoring Social Dynamics During Big Incidences: A Case Study
of COVID-19 in Bangladesh [0.26651200086513094]
This study analyzed a large set of Bangladeshi newspaper data related to CO-19 pandemic.
This analysis will help the government and other organizations to figure out the challenges that have arisen in society due to this pandemic.
arXiv Detail & Related papers (2021-01-24T07:46:17Z) - Leveraging Natural Language Processing to Mine Issues on Twitter During
the COVID-19 Pandemic [0.3674863913115431]
The recent global outbreak of the coronavirus disease (COVID-19) has spread to all corners of the globe.
To understand the public concerns and responses to the pandemic, a system that can leverage machine learning techniques to filter out irrelevant tweets is needed.
In this study, we constructed a system to identify the relevant tweets related to the COVID-19 pandemic throughout January 1st, 2020 to April 30th, 2020.
arXiv Detail & Related papers (2020-10-31T22:26:26Z) - Mining Disinformation and Fake News: Concepts, Methods, and Recent
Advancements [55.33496599723126]
disinformation including fake news has become a global phenomenon due to its explosive growth.
Despite the recent progress in detecting disinformation and fake news, it is still non-trivial due to its complexity, diversity, multi-modality, and costs of fact-checking or annotation.
arXiv Detail & Related papers (2020-01-02T21:01:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.