It's All in the Embedding! Fake News Detection Using Document Embeddings
- URL: http://arxiv.org/abs/2304.07781v1
- Date: Sun, 16 Apr 2023 13:30:06 GMT
- Title: It's All in the Embedding! Fake News Detection Using Document Embeddings
- Authors: Ciprian-Octavian Truic\u{a} and Elena-Simona Apostol
- Abstract summary: We propose a new approach that uses document embeddings to build multiple models that accurately label news articles as reliable or fake.
We also present a benchmark on different architectures that detect fake news using binary or multi-labeled classification.
- Score: 0.6091702876917281
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: With the current shift in the mass media landscape from journalistic rigor to
social media, personalized social media is becoming the new norm. Although the
digitalization progress of the media brings many advantages, it also increases
the risk of spreading disinformation, misinformation, and malformation through
the use of fake news. The emergence of this harmful phenomenon has managed to
polarize society and manipulate public opinion on particular topics, e.g.,
elections, vaccinations, etc. Such information propagated on social media can
distort public perceptions and generate social unrest while lacking the rigor
of traditional journalism. Natural Language Processing and Machine Learning
techniques are essential for developing efficient tools that can detect fake
news. Models that use the context of textual data are essential for resolving
the fake news detection problem, as they manage to encode linguistic features
within the vector representation of words. In this paper, we propose a new
approach that uses document embeddings to build multiple models that accurately
label news articles as reliable or fake. We also present a benchmark on
different architectures that detect fake news using binary or multi-labeled
classification. We evaluated the models on five large news corpora using
accuracy, precision, and recall. We obtained better results than more complex
state-of-the-art Deep Neural Network models. We observe that the most important
factor for obtaining high accuracy is the document encoding, not the
classification model's complexity.
Related papers
- Adapting Fake News Detection to the Era of Large Language Models [48.5847914481222]
We study the interplay between machine-(paraphrased) real news, machine-generated fake news, human-written fake news, and human-written real news.
Our experiments reveal an interesting pattern that detectors trained exclusively on human-written articles can indeed perform well at detecting machine-generated fake news, but not vice versa.
arXiv Detail & Related papers (2023-11-02T08:39:45Z) - An Interactive Framework for Profiling News Media Sources [26.386860411085053]
We propose an interactive framework for news media profiling.
It combines the strengths of graph based news media profiling models, Pre-trained Large Language Models, and human insight.
With as little as 5 human interactions, our framework can rapidly detect fake and biased news media.
arXiv Detail & Related papers (2023-09-14T02:03:45Z) - Nothing Stands Alone: Relational Fake News Detection with Hypergraph
Neural Networks [49.29141811578359]
We propose to leverage a hypergraph to represent group-wise interaction among news, while focusing on important news relations with its dual-level attention mechanism.
Our approach yields remarkable performance and maintains the high performance even with a small subset of labeled news data.
arXiv Detail & Related papers (2022-12-24T00:19:32Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - Modelling Social Context for Fake News Detection: A Graph Neural Network
Based Approach [0.39146761527401425]
Detection of fake news is crucial to ensure the authenticity of information and maintain the news ecosystems reliability.
This paper has analyzed the social context of fake news detection with a hybrid graph neural network based approach.
arXiv Detail & Related papers (2022-07-27T12:58:33Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - User Preference-aware Fake News Detection [61.86175081368782]
Existing fake news detection algorithms focus on mining news content for deceptive signals.
We propose a new framework, UPFD, which simultaneously captures various signals from user preferences by joint content and graph modeling.
arXiv Detail & Related papers (2021-04-25T21:19:24Z) - Viable Threat on News Reading: Generating Biased News Using Natural
Language Models [49.90665530780664]
We show that publicly available language models can reliably generate biased news content based on an input original news.
We also show that a large number of high-quality biased news articles can be generated using controllable text generation.
arXiv Detail & Related papers (2020-10-05T16:55:39Z) - Modeling the spread of fake news on Twitter [2.7910505923792637]
We propose a point process model of the spread of fake news on Twitter.
We show that the proposed model is superior to the current state-of-the-art methods in accurately predicting the evolution of the spread of a fake news item.
The proposed model contributes to understanding the dynamics of the spread of fake news on social media.
arXiv Detail & Related papers (2020-07-28T08:28:16Z) - Fake News Detection on News-Oriented Heterogeneous Information Networks
through Hierarchical Graph Attention [12.250335118888891]
We propose a novel fake news detection framework, namely Hierarchical Graph Attention Network(HGAT)
HGAT uses a novel hierarchical attention mechanism to perform node representation learning in HIN, and then detects fake news by classifying news article nodes.
Experiments on two real-world fake news datasets show that HGAT can outperform text-based models and other network-based models.
arXiv Detail & Related papers (2020-02-05T19:09:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.