GREENER: Graph Neural Networks for News Media Profiling
- URL: http://arxiv.org/abs/2211.05533v1
- Date: Thu, 10 Nov 2022 12:46:29 GMT
- Title: GREENER: Graph Neural Networks for News Media Profiling
- Authors: Panayot Panayotov, Utsav Shukla, Husrev Taha Sencar, Mohamed Nabeel,
Preslav Nakov
- Abstract summary: We study the problem of profiling news media on the Web with respect to their factuality of reporting and bias.
Our main focus is on modeling the similarity between media outlets based on the overlap of their audience.
Prediction accuracy is found to improve by 2.5-27 macro-F1 points for the two tasks.
- Score: 24.675574340841163
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study the problem of profiling news media on the Web with respect to their
factuality of reporting and bias. This is an important but under-studied
problem related to disinformation and "fake news" detection, but it addresses
the issue at a coarser granularity compared to looking at an individual article
or an individual claim. This is useful as it allows to profile entire media
outlets in advance. Unlike previous work, which has focused primarily on text
(e.g.,~on the text of the articles published by the target website, or on the
textual description in their social media profiles or in Wikipedia), here our
main focus is on modeling the similarity between media outlets based on the
overlap of their audience. This is motivated by homophily considerations,
i.e.,~the tendency of people to have connections to people with similar
interests, which we extend to media, hypothesizing that similar types of media
would be read by similar kinds of users. In particular, we propose GREENER
(GRaph nEural nEtwork for News mEdia pRofiling), a model that builds a graph of
inter-media connections based on their audience overlap, and then uses graph
neural networks to represent each medium. We find that such representations are
quite useful for predicting the factuality and the bias of news media outlets,
yielding improvements over state-of-the-art results reported on two datasets.
When augmented with conventionally used representations obtained from news
articles, Twitter, YouTube, Facebook, and Wikipedia, prediction accuracy is
found to improve by 2.5-27 macro-F1 points for the two tasks.
Related papers
- Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions [0.7249731529275342]
We propose an extension to a recently presented news media reliability estimation method.
We assess the classification performance of four reinforcement learning strategies on a large news media hyperlink graph.
Our experiments, targeting two challenging bias descriptors, factual reporting and political bias, showed a significant performance improvement at the source media level.
arXiv Detail & Related papers (2024-10-23T08:18:26Z) - DocNet: Semantic Structure in Inductive Bias Detection Models [0.4779196219827508]
In this paper, we explore an often overlooked aspect of bias detection in documents: the semantic structure of news articles.
We present DocNet, a novel, inductive, and low-resource document embedding and bias detection model.
We also demonstrate that the semantic structure of news articles from opposing partisan sides, as represented in document-level graph embeddings, have significant similarities.
arXiv Detail & Related papers (2024-06-16T14:51:12Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
Comparing What Sources Say About Entities as a Start [65.28355014154549]
This paper investigates the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora.
We show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide.
arXiv Detail & Related papers (2023-04-06T23:36:45Z) - Computational Assessment of Hyperpartisanship in News Titles [55.92100606666497]
We first adopt a human-guided machine learning framework to develop a new dataset for hyperpartisan news title detection.
Overall the Right media tends to use proportionally more hyperpartisan titles.
We identify three major topics including foreign issues, political systems, and societal issues that are suggestive of hyperpartisanship in news titles.
arXiv Detail & Related papers (2023-01-16T05:56:58Z) - Unveiling the Hidden Agenda: Biases in News Reporting and Consumption [59.55900146668931]
We build a six-year dataset on the Italian vaccine debate and adopt a Bayesian latent space model to identify narrative and selection biases.
We found a nonlinear relationship between biases and engagement, with higher engagement for extreme positions.
Analysis of news consumption on Twitter reveals common audiences among news outlets with similar ideological positions.
arXiv Detail & Related papers (2023-01-14T18:58:42Z) - Inference of Media Bias and Content Quality Using Natural-Language
Processing [6.092956184948962]
We present a framework to infer both political bias and content quality of media outlets from text.
We apply a bidirectional long short-term memory (LSTM) neural network to a data set of more than 1 million tweets.
Our results illustrate the importance of leveraging word order into machine-learning methods in text analysis.
arXiv Detail & Related papers (2022-12-01T03:04:55Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - MBIC -- A Media Bias Annotation Dataset Including Annotator
Characteristics [0.0]
Media bias, or slanted news coverage, can have a substantial impact on public perception of events.
In this poster, we present a matrix-based methodology to crowdsource such data using a self-developed annotation platform.
We also present MBIC - the first sample of 1,700 statements representing various media bias instances.
arXiv Detail & Related papers (2021-05-20T15:05:17Z) - Content-based Analysis of the Cultural Differences between TikTok and
Douyin [95.32409577885645]
Short-form video social media shifts away from the traditional media paradigm by telling the audience a dynamic story to attract their attention.
In particular, different combinations of everyday objects can be employed to represent a unique scene that is both interesting and understandable.
Offered by the same company, TikTok and Douyin are popular examples of such new media that has become popular in recent years.
The hypothesis that they express cultural differences together with media fashion and social idiosyncrasy is the primary target of our research.
arXiv Detail & Related papers (2020-11-03T01:47:49Z) - How-to Present News on Social Media: A Causal Analysis of Editing News
Headlines for Boosting User Engagement [14.829079057399838]
We analyze media outlets' current practices using a data-driven approach.
We build a parallel corpus of original news articles and their corresponding tweets that eight media outlets shared.
Then, we explore how those media edited tweets against original headlines and the effects of such changes.
arXiv Detail & Related papers (2020-09-17T06:39:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.