Arabic Fake News Detection Based on Deep Contextualized Embedding Models
- URL: http://arxiv.org/abs/2205.03114v1
- Date: Fri, 6 May 2022 09:54:35 GMT
- Title: Arabic Fake News Detection Based on Deep Contextualized Embedding Models
- Authors: Ali Bou Nassif, Ashraf Elnagar, Omar Elgendy, Yaman Afadar
- Abstract summary: Many studies have been conducted to help detect fake news in English, but research conducted on fake news detection in the Arabic language is scarce.
We have constructed a large and diverse Arabic fake news dataset.
We have developed and evaluated transformer-based classifiers to identify fake news while utilizing eight state-of-the-art Arabic contextualized embedding models.
We conduct a thorough analysis of the state-of-the-art Arabic contextualized embedding models as well as comparison with similar fake news detection systems.
- Score: 3.425727850372357
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Social media is becoming a source of news for many people due to its ease and
freedom of use. As a result, fake news has been spreading quickly and easily
regardless of its credibility, especially in the last decade. Fake news
publishers take advantage of critical situations such as the Covid-19 pandemic
and the American presidential elections to affect societies negatively. Fake
news can seriously impact society in many fields including politics, finance,
sports, etc. Many studies have been conducted to help detect fake news in
English, but research conducted on fake news detection in the Arabic language
is scarce. Our contribution is twofold: first, we have constructed a large and
diverse Arabic fake news dataset. Second, we have developed and evaluated
transformer-based classifiers to identify fake news while utilizing eight
state-of-the-art Arabic contextualized embedding models. The majority of these
models had not been previously used for Arabic fake news detection. We conduct
a thorough analysis of the state-of-the-art Arabic contextualized embedding
models as well as comparison with similar fake news detection systems.
Experimental results confirm that these state-of-the-art models are robust,
with accuracy exceeding 98%.
Related papers
- Adapting Fake News Detection to the Era of Large Language Models [48.5847914481222]
We study the interplay between machine-(paraphrased) real news, machine-generated fake news, human-written fake news, and human-written real news.
Our experiments reveal an interesting pattern that detectors trained exclusively on human-written articles can indeed perform well at detecting machine-generated fake news, but not vice versa.
arXiv Detail & Related papers (2023-11-02T08:39:45Z) - Fake News Detection and Behavioral Analysis: Case of COVID-19 [0.22940141855172028]
"Infodemic" due to spread of fake news regarding the pandemic has been a global issue.
Readers could mistake fake news for real news, and consequently have less access to authentic information.
It is challenging to accurately identify fake news data in social media posts.
arXiv Detail & Related papers (2023-05-25T13:42:08Z) - Nothing Stands Alone: Relational Fake News Detection with Hypergraph
Neural Networks [49.29141811578359]
We propose to leverage a hypergraph to represent group-wise interaction among news, while focusing on important news relations with its dual-level attention mechanism.
Our approach yields remarkable performance and maintains the high performance even with a small subset of labeled news data.
arXiv Detail & Related papers (2022-12-24T00:19:32Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - FALSE: Fake News Automatic and Lightweight Solution [0.20999222360659603]
In this paper, R code have been used to study and visualize a modern fake news dataset.
We use clustering, classification, correlation and various plots to analyze and present the data.
arXiv Detail & Related papers (2022-08-16T11:53:30Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Stance Detection with BERT Embeddings for Credibility Analysis of
Information on Social Media [1.7616042687330642]
We propose a model for detecting fake news using stance as one of the features along with the content of the article.
Our work interprets the content with automatic feature extraction and the relevance of the text pieces.
The experiment conducted on the real-world dataset indicates that our model outperforms the previous work and enables fake news detection with an accuracy of 95.32%.
arXiv Detail & Related papers (2021-05-21T10:46:43Z) - User Preference-aware Fake News Detection [61.86175081368782]
Existing fake news detection algorithms focus on mining news content for deceptive signals.
We propose a new framework, UPFD, which simultaneously captures various signals from user preferences by joint content and graph modeling.
arXiv Detail & Related papers (2021-04-25T21:19:24Z) - How does Truth Evolve into Fake News? An Empirical Study of Fake News
Evolution [55.27685924751459]
We present the Fake News Evolution dataset: a new dataset tracking the fake news evolution process.
Our dataset is composed of 950 paired data, each of which consists of articles representing the truth, the fake news, and the evolved fake news.
We observe the features during the evolution and they are the disinformation techniques, text similarity, top 10 keywords, classification accuracy, parts of speech, and sentiment properties.
arXiv Detail & Related papers (2021-03-10T09:01:34Z) - Fake or Real? A Study of Arabic Satirical Fake News [3.007949058551534]
This study conducts several exploratory analyses to identify the linguistic properties of Arabic fake news with satirical content.
We exploit these features to build a number of machine learning models capable of identifying satirical fake news with an accuracy of up to 98.6%.
arXiv Detail & Related papers (2020-11-01T08:56:56Z) - Fake News Spreader Detection on Twitter using Character N-Grams.
Notebook for PAN at CLEF 2020 [0.0]
This notebook describes our profiling system for the fake news detection task on Twitter.
We conduct different feature extraction techniques and learning experiments from a multilingual perspective.
Our models achieve an overall accuracy of 73% and 79% on the English and Spanish official test set.
arXiv Detail & Related papers (2020-09-29T08:32:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.