Fighting an Infodemic: COVID-19 Fake News Dataset
- URL: http://arxiv.org/abs/2011.03327v4
- Date: Wed, 26 May 2021 15:38:55 GMT
- Title: Fighting an Infodemic: COVID-19 Fake News Dataset
- Authors: Parth Patwa, Shivam Sharma, Srinivas Pykl, Vineeth Guptha, Gitanjali
Kumari, Md Shad Akhtar, Asif Ekbal, Amitava Das, Tanmoy Chakraborty
- Abstract summary: Fake news and rumors are rampant on social media.
To tackle this, we curate and release a manually annotated dataset of 10,700 social media posts and articles of real and fake news on COVID-19.
- Score: 40.418407303807456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Along with COVID-19 pandemic we are also fighting an `infodemic'. Fake news
and rumors are rampant on social media. Believing in rumors can cause
significant harm. This is further exacerbated at the time of a pandemic. To
tackle this, we curate and release a manually annotated dataset of 10,700
social media posts and articles of real and fake news on COVID-19. We benchmark
the annotated dataset with four machine learning baselines - Decision Tree,
Logistic Regression, Gradient Boost, and Support Vector Machine (SVM). We
obtain the best performance of 93.46% F1-score with SVM. The data and code is
available at: https://github.com/parthpatwa/covid19-fake-news-dectection
Related papers
- "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - Cross-lingual COVID-19 Fake News Detection [54.125563009333995]
We make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English)
We propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content.
Empirical results on our dataset demonstrate the effectiveness of CrossFake under the cross-lingual setting.
arXiv Detail & Related papers (2021-10-13T04:44:02Z) - MMCoVaR: Multimodal COVID-19 Vaccine Focused Data Repository for Fake
News Detection and a Baseline Architecture for Classification [1.587767395906846]
We provide a new multimodal labeled dataset containing news articles and tweets on the COVID-19 vaccine.
We combine ratings from three news media ranking sites to classify the news dataset into two levels of credibility: reliable and unreliable.
We find that the proposed architecture has an F-Score of 0.919 and accuracy of 0.882 for fake news detection.
arXiv Detail & Related papers (2021-09-14T03:57:50Z) - Tackling COVID-19 Infodemic using Deep Learning [0.0]
We try to detect and classify fake news on online media to detect fake information relating to COVID-19 and coronavirus.
The dataset contained fake posts, articles and news gathered from fact checking websites like politifact whereas real tweets were taken from verified twitter handles.
arXiv Detail & Related papers (2021-07-01T11:07:47Z) - AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech
Detection Dataset [0.0]
"AraCOVID19-MFH" is a manually annotated multi-label Arabic COVID-19 fake news and hate speech detection dataset.
Our dataset contains 10,828 Arabic tweets annotated with 10 different labels.
It can also be used for hate speech detection, opinion/news classification, dialect identification, and many other tasks.
arXiv Detail & Related papers (2021-05-07T09:52:44Z) - ExcavatorCovid: Extracting Events and Relations from Text Corpora for
Temporal and Causal Analysis for COVID-19 [63.72766553648224]
ExcavatorCovid is a machine reading system that ingests open-source text documents.
It extracts COVID19 related events and relations between them, and builds a Temporal and Causal Analysis Graph.
arXiv Detail & Related papers (2021-05-05T01:18:46Z) - User Preference-aware Fake News Detection [61.86175081368782]
Existing fake news detection algorithms focus on mining news content for deceptive signals.
We propose a new framework, UPFD, which simultaneously captures various signals from user preferences by joint content and graph modeling.
arXiv Detail & Related papers (2021-04-25T21:19:24Z) - Misinfo Belief Frames: A Case Study on Covid & Climate News [49.979419711713795]
We propose a formalism for understanding how readers perceive the reliability of news and the impact of misinformation.
We introduce the Misinfo Belief Frames (MBF) corpus, a dataset of 66k inferences over 23.5k headlines.
Our results using large-scale language modeling to predict misinformation frames show that machine-generated inferences can influence readers' trust in news headlines.
arXiv Detail & Related papers (2021-04-18T09:50:11Z) - Transformer based Automatic COVID-19 Fake News Detection System [9.23545668304066]
Misinformation is especially prevalent in the ongoing coronavirus disease (COVID-19) pandemic.
We report a methodology to analyze the reliability of information shared on social media pertaining to the COVID-19 pandemic.
Our system obtained 0.9855 f1-score on testset and ranked 5th among 160 teams.
arXiv Detail & Related papers (2021-01-01T06:49:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.