Birds of a Feather Flock Together: Satirical News Detection via Language
Model Differentiation
- URL: http://arxiv.org/abs/2007.02164v1
- Date: Sat, 4 Jul 2020 18:46:36 GMT
- Title: Birds of a Feather Flock Together: Satirical News Detection via Language
Model Differentiation
- Authors: Yigeng Zhang, Fan Yang, Yifan Zhang, Eduard Dragut and Arjun Mukherjee
- Abstract summary: In satirical news, the lexical and pragmatical attributes of the context are the key factors in amusing the readers.
We propose a method that differentiates the satirical news and true news.
- Score: 7.556286423133077
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Satirical news is regularly shared in modern social media because it is
entertaining with smartly embedded humor. However, it can be harmful to society
because it can sometimes be mistaken as factual news, due to its deceptive
character. We found that in satirical news, the lexical and pragmatical
attributes of the context are the key factors in amusing the readers. In this
work, we propose a method that differentiates the satirical news and true news.
It takes advantage of satirical writing evidence by leveraging the difference
between the prediction loss of two language models, one trained on true news
and the other on satirical news, when given a new news article. We compute
several statistical metrics of language model prediction loss as features,
which are then used to conduct downstream classification. The proposed method
is computationally effective because the language models capture the language
usage differences between satirical news documents and traditional news
documents, and are sensitive when applied to documents outside their domains.
Related papers
- SCStory: Self-supervised and Continual Online Story Discovery [53.72745249384159]
SCStory helps people digest rapidly published news article streams in real-time without human annotations.
SCStory employs self-supervised and continual learning with a novel idea of story-indicative adaptive modeling of news article streams.
arXiv Detail & Related papers (2023-11-27T04:50:01Z) - Adapting Fake News Detection to the Era of Large Language Models [48.5847914481222]
We study the interplay between machine-(paraphrased) real news, machine-generated fake news, human-written fake news, and human-written real news.
Our experiments reveal an interesting pattern that detectors trained exclusively on human-written articles can indeed perform well at detecting machine-generated fake news, but not vice versa.
arXiv Detail & Related papers (2023-11-02T08:39:45Z) - Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks [60.14025705964573]
SheepDog is a style-robust fake news detector that prioritizes content over style in determining news veracity.
SheepDog achieves this resilience through (1) LLM-empowered news reframings that inject style diversity into the training process by customizing articles to match different styles; (2) a style-agnostic training scheme that ensures consistent veracity predictions across style-diverse reframings; and (3) content-focused attributions that distill content-centric guidelines from LLMs for debunking fake news.
arXiv Detail & Related papers (2023-10-16T21:05:12Z) - It's All in the Embedding! Fake News Detection Using Document Embeddings [0.6091702876917281]
We propose a new approach that uses document embeddings to build multiple models that accurately label news articles as reliable or fake.
We also present a benchmark on different architectures that detect fake news using binary or multi-labeled classification.
arXiv Detail & Related papers (2023-04-16T13:30:06Z) - Predictive linguistic cues for fake news: a societal artificial
intelligence problem [9.40467099889021]
We present linguistic characteristics of media news items to differentiate between fake news and real news using machine learning algorithms.
We use neural networks which mainly control distributional features rather than evidence.
Features unique, negative, positive, and cardinal numbers with high values on the metrics are observed to provide a high area under the curve (AUC) and F1-score.
arXiv Detail & Related papers (2022-11-26T07:50:01Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Fake or Real? A Study of Arabic Satirical Fake News [3.007949058551534]
This study conducts several exploratory analyses to identify the linguistic properties of Arabic fake news with satirical content.
We exploit these features to build a number of machine learning models capable of identifying satirical fake news with an accuracy of up to 98.6%.
arXiv Detail & Related papers (2020-11-01T08:56:56Z) - Viable Threat on News Reading: Generating Biased News Using Natural
Language Models [49.90665530780664]
We show that publicly available language models can reliably generate biased news content based on an input original news.
We also show that a large number of high-quality biased news articles can be generated using controllable text generation.
arXiv Detail & Related papers (2020-10-05T16:55:39Z) - Modeling the spread of fake news on Twitter [2.7910505923792637]
We propose a point process model of the spread of fake news on Twitter.
We show that the proposed model is superior to the current state-of-the-art methods in accurately predicting the evolution of the spread of a fake news item.
The proposed model contributes to understanding the dynamics of the spread of fake news on social media.
arXiv Detail & Related papers (2020-07-28T08:28:16Z) - Satirical News Detection with Semantic Feature Extraction and
Game-theoretic Rough Sets [5.326582776477692]
We propose a semantic feature based approach to detect satirical news tweets.
Features are extracted by exploring inconsistencies in phrases, entities, and between main and relative clauses.
We apply game-theoretic rough set model to detect satirical news, in which probabilistic thresholds are derived by game equilibrium and repetition learning mechanism.
arXiv Detail & Related papers (2020-04-08T03:22:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.