Classification of news spreading barriers
- URL: http://arxiv.org/abs/2304.08167v1
- Date: Mon, 10 Apr 2023 20:13:54 GMT
- Title: Classification of news spreading barriers
- Authors: Abdul Sittar, Dunja Mladenic, Marko Grobelnik
- Abstract summary: We propose an approach to barrier classification where we infer the semantics of news articles through Wikipedia concepts.
We collect news articles and annotated them for different kinds of barriers using the metadata of news publishers.
We utilize the Wikipedia concepts along with the body text of news articles as features to infer the news-spreading barriers.
- Score: 3.0036519884678894
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: News media is one of the most effective mechanisms for spreading information
internationally, and many events from different areas are internationally
relevant. However, news coverage for some news events is limited to a specific
geographical region because of information spreading barriers, which can be
political, geographical, economic, cultural, or linguistic. In this paper, we
propose an approach to barrier classification where we infer the semantics of
news articles through Wikipedia concepts. To that end, we collected news
articles and annotated them for different kinds of barriers using the metadata
of news publishers. Then, we utilize the Wikipedia concepts along with the body
text of news articles as features to infer the news-spreading barriers. We
compare our approach to the classical text classification methods, deep
learning, and transformer-based methods. The results show that the proposed
approach using Wikipedia concepts based semantic knowledge offers better
performance than the usual for classifying the news-spreading barriers.
Related papers
- A Multilingual Similarity Dataset for News Article Frame [14.977682986280998]
We introduce an extended version of a large labeled news article dataset with 16,687 new labeled pairs.
Our method frees the work of manual identification of frame classes in traditional news frame analysis studies.
Overall we introduce the most extensive cross-lingual news article similarity dataset available to date with 26,555 labeled news article pairs across 10 languages.
arXiv Detail & Related papers (2024-05-22T01:01:04Z) - Tracking the Newsworthiness of Public Documents [107.12303391111014]
This work focuses on news coverage of local public policy in the San Francisco Bay Area by the San Francisco Chronicle.
First, we gather news articles, public policy documents and meeting recordings and link them using probabilistic relational modeling.
Second, we define a new task: newsworthiness prediction, to predict if a policy item will get covered.
arXiv Detail & Related papers (2023-11-16T10:05:26Z) - Profiling the news spreading barriers using news headlines [3.0036519884678894]
We consider five barriers including cultural, economic, political, linguistic, and geographical.
We label the news headlines automatically for the barriers using the metadata of news publishers.
We then utilize the extracted commonsense inferences and sentiments as features to detect the news spreading barriers.
arXiv Detail & Related papers (2023-04-07T10:16:15Z) - Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
Comparing What Sources Say About Entities as a Start [65.28355014154549]
This paper investigates the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora.
We show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide.
arXiv Detail & Related papers (2023-04-06T23:36:45Z) - Unveiling the Hidden Agenda: Biases in News Reporting and Consumption [59.55900146668931]
We build a six-year dataset on the Italian vaccine debate and adopt a Bayesian latent space model to identify narrative and selection biases.
We found a nonlinear relationship between biases and engagement, with higher engagement for extreme positions.
Analysis of news consumption on Twitter reveals common audiences among news outlets with similar ideological positions.
arXiv Detail & Related papers (2023-01-14T18:58:42Z) - Using the profile of publishers to predict barriers across news articles [0.685316573653194]
We present an approach to barrier detection in news spreading by utilizing Wikipedia-concepts and metadata associated with each barrier.
We believe that our approach can serve to provide useful insights which pave the way for the future development of a system for predicting information spreading barriers over the news.
arXiv Detail & Related papers (2023-01-13T13:32:42Z) - Beyond Discrete Genres: Mapping News Items onto a Multidimensional
Framework of Genre Cues [0.0]
We propose a non-discrete framework for mapping news items in terms of genre cues.
To automatically analyze a large amount of news items, we deliver two computational models for predicting news sentences.
This proposed approach helps in deepening our insight into the evolving nature of news genres.
arXiv Detail & Related papers (2022-12-08T10:54:31Z) - Multiverse: Multilingual Evidence for Fake News Detection [71.51905606492376]
Multiverse is a new feature based on multilingual evidence that can be used for fake news detection.
The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed.
arXiv Detail & Related papers (2022-11-25T18:24:17Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Cross-Domain Learning for Classifying Propaganda in Online Contents [67.10699378370752]
We present an approach to leverage cross-domain learning, based on labeled documents and sentences from news and tweets, as well as political speeches with a clear difference in their degrees of being propagandistic.
Our experiments demonstrate the usefulness of this approach, and identify difficulties and limitations in various configurations of sources and targets for the transfer step.
arXiv Detail & Related papers (2020-11-13T10:19:13Z) - A multi-layer approach to disinformation detection on Twitter [4.663548775064491]
We employ a multi-layer representation of Twitter diffusion networks, and we compute for each layer a set of global network features.
Experimental results with two large-scale datasets, corresponding to diffusion cascades of news shared respectively in the United States and Italy, show that a simple Logistic Regression model is able to classify disinformation vs mainstream networks with high accuracy.
We believe that our network-based approach provides useful insights which pave the way to the future development of a system to detect misleading and harmful information spreading on social media.
arXiv Detail & Related papers (2020-02-28T09:25:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.