Detecting Chinese Fake News on Twitter during the COVID-19 Pandemic
- URL: http://arxiv.org/abs/2304.03454v1
- Date: Fri, 7 Apr 2023 03:03:11 GMT
- Title: Detecting Chinese Fake News on Twitter during the COVID-19 Pandemic
- Authors: Yongjun Zhang, Sijia Liu, Yi Wang, Xinguang Fan
- Abstract summary: The outbreak of COVID-19 has led to a global surge of Sinophobia partly because of the spread of misinformation, disinformation, and fake news on China.
We report on the creation of a novel classifier that detects whether Chinese-language social media posts from Twitter are related to fake news about China.
We provide the final model and a new training dataset with 18,425 tweets for researchers to study fake news in the Chinese language during the COVID-19 pandemic.
- Score: 14.572408454393042
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The outbreak of COVID-19 has led to a global surge of Sinophobia partly
because of the spread of misinformation, disinformation, and fake news on
China. In this paper, we report on the creation of a novel classifier that
detects whether Chinese-language social media posts from Twitter are related to
fake news about China. The classifier achieves an F1 score of 0.64 and an
accuracy rate of 93%. We provide the final model and a new training dataset
with 18,425 tweets for researchers to study fake news in the Chinese language
during the COVID-19 pandemic. We also introduce a new dataset generated by our
classifier that tracks the dynamics of fake news in the Chinese language during
the early pandemic.
Related papers
- MCFEND: A Multi-source Benchmark Dataset for Chinese Fake News Detection [5.288018460787191]
Methods trained on purely one single news source can hardly be applicable to real-world scenarios.
We constructed the first multi-source benchmark dataset for Chinese fake news detection, termed MCFEND.
MCFEND, as a benchmark dataset, aims to advance Chinese fake news detection approaches in real-world scenarios.
arXiv Detail & Related papers (2024-03-14T04:32:13Z) - Adapting Fake News Detection to the Era of Large Language Models [48.5847914481222]
We study the interplay between machine-(paraphrased) real news, machine-generated fake news, human-written fake news, and human-written real news.
Our experiments reveal an interesting pattern that detectors trained exclusively on human-written articles can indeed perform well at detecting machine-generated fake news, but not vice versa.
arXiv Detail & Related papers (2023-11-02T08:39:45Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Cross-lingual COVID-19 Fake News Detection [54.125563009333995]
We make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English)
We propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content.
Empirical results on our dataset demonstrate the effectiveness of CrossFake under the cross-lingual setting.
arXiv Detail & Related papers (2021-10-13T04:44:02Z) - User Preference-aware Fake News Detection [61.86175081368782]
Existing fake news detection algorithms focus on mining news content for deceptive signals.
We propose a new framework, UPFD, which simultaneously captures various signals from user preferences by joint content and graph modeling.
arXiv Detail & Related papers (2021-04-25T21:19:24Z) - How does Truth Evolve into Fake News? An Empirical Study of Fake News
Evolution [55.27685924751459]
We present the Fake News Evolution dataset: a new dataset tracking the fake news evolution process.
Our dataset is composed of 950 paired data, each of which consists of articles representing the truth, the fake news, and the evolved fake news.
We observe the features during the evolution and they are the disinformation techniques, text similarity, top 10 keywords, classification accuracy, parts of speech, and sentiment properties.
arXiv Detail & Related papers (2021-03-10T09:01:34Z) - Cross-SEAN: A Cross-Stitch Semi-Supervised Neural Attention Model for
COVID-19 Fake News Detection [14.771202995527315]
COVID-19 related fake news has been spreading faster than the facts.
We introduce CTF, the first COVID-19 Twitter fake news dataset with labeled genuine and fake tweets.
We also propose Cross-SEAN, a cross-stitch based semi-supervised end-to-end neural attention model.
arXiv Detail & Related papers (2021-02-17T18:30:43Z) - Evaluating Deep Learning Approaches for Covid19 Fake News Detection [0.0]
We look at automated techniques for fake news detection from a data mining perspective.
We evaluate different supervised text classification algorithms on Contraint@AAAI 2021 Covid-19 Fake news detection dataset.
We report the best accuracy of 98.41% on the Covid-19 Fake news detection dataset.
arXiv Detail & Related papers (2021-01-11T16:39:03Z) - Transformer based Automatic COVID-19 Fake News Detection System [9.23545668304066]
Misinformation is especially prevalent in the ongoing coronavirus disease (COVID-19) pandemic.
We report a methodology to analyze the reliability of information shared on social media pertaining to the COVID-19 pandemic.
Our system obtained 0.9855 f1-score on testset and ranked 5th among 160 teams.
arXiv Detail & Related papers (2021-01-01T06:49:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.