Classification Aware Neural Topic Model and its Application on a New
COVID-19 Disinformation Corpus
- URL: http://arxiv.org/abs/2006.03354v2
- Date: Thu, 11 Mar 2021 12:55:12 GMT
- Title: Classification Aware Neural Topic Model and its Application on a New
COVID-19 Disinformation Corpus
- Authors: Xingyi Song, Johann Petrak, Ye Jiang, Iknoor Singh, Diana Maynard and
Kalina Bontcheva
- Abstract summary: The explosion of disinformation following the COVID-19 pandemic has overloaded fact-checkers and media worldwide.
To help tackle this, we developed computational methods to categorise COVID-19 disinformation.
- Score: 2.492887522265771
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The explosion of disinformation accompanying the COVID-19 pandemic has
overloaded fact-checkers and media worldwide, and brought a new major challenge
to government responses worldwide. Not only is disinformation creating
confusion about medical science amongst citizens, but it is also amplifying
distrust in policy makers and governments. To help tackle this, we developed
computational methods to categorise COVID-19 disinformation. The COVID-19
disinformation categories could be used for a) focusing fact-checking efforts
on the most damaging kinds of COVID-19 disinformation; b) guiding policy makers
who are trying to deliver effective public health messages and counter
effectively COVID-19 disinformation. This paper presents: 1) a corpus
containing what is currently the largest available set of manually annotated
COVID-19 disinformation categories; 2) a classification-aware neural topic
model (CANTM) designed for COVID-19 disinformation category classification and
topic discovery; 3) an extensive analysis of COVID-19 disinformation categories
with respect to time, volume, false type, media type and origin source.
Related papers
- AMIR: Automated MisInformation Rebuttal -- A COVID-19 Vaccination Datasets based Recommendation System [0.05461938536945722]
This work explored how existing information obtained from social media can be harnessed to facilitate automated rebuttal of misinformation at scale.
It leverages two publicly available datasets, FaCov (fact-checked articles) and misleading (social media Twitter) data on COVID-19 Vaccination.
arXiv Detail & Related papers (2023-10-29T13:07:33Z) - A Large-Scale Comparative Study of Accurate COVID-19 Information versus
Misinformation [4.926199465135915]
The COVID-19 pandemic led to an infodemic where an overwhelming amount of COVID-19 related content was being disseminated at high velocity through social media.
This motivated us to carry out a comparative study of the characteristics of COVID-19 misinformation versus those of accurate COVID-19 information through a large-scale computational analysis of over 242 million tweets.
An added contribution of this study is the creation of a COVID-19 misinformation classification dataset.
arXiv Detail & Related papers (2023-04-10T18:44:41Z) - Not cool, calm or collected: Using emotional language to detect COVID-19
misinformation [0.0]
COVID-19 misinformation on social media platforms such as twitter is a threat to effective pandemic management.
We present a novel COVID-19 misinformation model, which uses both a tweet emotion encoder and COVID-19 misinformation encoder to predict whether a tweet contains COVID-19 misinformation.
arXiv Detail & Related papers (2023-03-27T22:24:05Z) - "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - Testing the Generalization of Neural Language Models for COVID-19
Misinformation Detection [6.1204874238049705]
A drastic rise in potentially life-threatening misinformation has been a by-product of the COVID-19 pandemic.
We evaluate fifteen Transformer-based models on five COVID-19 misinformation datasets.
We show tokenizers and models tailored to COVID-19 data do not provide a significant advantage over general-purpose ones.
arXiv Detail & Related papers (2021-11-15T15:01:55Z) - Cross-lingual COVID-19 Fake News Detection [54.125563009333995]
We make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English)
We propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content.
Empirical results on our dataset demonstrate the effectiveness of CrossFake under the cross-lingual setting.
arXiv Detail & Related papers (2021-10-13T04:44:02Z) - Automatic Detection of COVID-19 Vaccine Misinformation with Graph Link
Prediction [2.0625936401496237]
Vaccine hesitancy fueled by social media misinformation about COVID-19 vaccines became a major hurdle.
We present CoVaxLies, a new dataset of tweets judged relevant to several misinformation targets about COVID-19 vaccines.
Our method organizes CoVaxLies in a Misinformation Knowledge Graph as it casts misinformation detection as a graph link prediction problem.
arXiv Detail & Related papers (2021-08-04T23:27:10Z) - A System for Worldwide COVID-19 Information Aggregation [92.60866520230803]
We build a system for worldwide COVID-19 information aggregation containing reliable articles from 10 regions in 7 languages sorted by topics.
A neural machine translation module translates articles in other languages into Japanese and English.
A BERT-based topic-classifier trained on our article-topic pair dataset helps users find their interested information efficiently.
arXiv Detail & Related papers (2020-07-28T01:33:54Z) - Misinformation Has High Perplexity [55.47422012881148]
We propose to leverage the perplexity to debunk false claims in an unsupervised manner.
First, we extract reliable evidence from scientific and news sources according to sentence similarity to the claims.
Second, we prime a language model with the extracted evidence and finally evaluate the correctness of given claims based on the perplexity scores at debunking time.
arXiv Detail & Related papers (2020-06-08T15:13:44Z) - COVID-Net: A Tailored Deep Convolutional Neural Network Design for
Detection of COVID-19 Cases from Chest X-Ray Images [93.0013343535411]
We introduce COVID-Net, a deep convolutional neural network design tailored for the detection of COVID-19 cases from chest X-ray (CXR) images.
To the best of the authors' knowledge, COVID-Net is one of the first open source network designs for COVID-19 detection from CXR images.
We also introduce COVIDx, an open access benchmark dataset that we generated comprising of 13,975 CXR images across 13,870 patient patient cases.
arXiv Detail & Related papers (2020-03-22T12:26:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.