Improving and Evaluating the Detection of Fragmentation in News
Recommendations with the Clustering of News Story Chains
- URL: http://arxiv.org/abs/2309.06192v2
- Date: Mon, 18 Sep 2023 13:05:08 GMT
- Title: Improving and Evaluating the Detection of Fragmentation in News
Recommendations with the Clustering of News Story Chains
- Authors: Alessandra Polimeno and Myrthe Reuver and Sanne Vrijenhoek and Antske
Fokkens
- Abstract summary: The Fragmentation metric quantifies the degree of fragmentation of information streams in news recommendations.
This paper presents an investigation of various approaches for quantifying Fragmentation in news recommendations.
- Score: 46.44827993583994
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: News recommender systems play an increasingly influential role in shaping
information access within democratic societies. However, tailoring
recommendations to users' specific interests can result in the divergence of
information streams. Fragmented access to information poses challenges to the
integrity of the public sphere, thereby influencing democracy and public
discourse. The Fragmentation metric quantifies the degree of fragmentation of
information streams in news recommendations. Accurate measurement of this
metric requires the application of Natural Language Processing (NLP) to
identify distinct news events, stories, or timelines. This paper presents an
extensive investigation of various approaches for quantifying Fragmentation in
news recommendations. These approaches are evaluated both intrinsically, by
measuring performance on news story clustering, and extrinsically, by assessing
the Fragmentation scores of different simulated news recommender scenarios. Our
findings demonstrate that agglomerative hierarchical clustering coupled with
SentenceBERT text representation is substantially better at detecting
Fragmentation than earlier implementations. Additionally, the analysis of
simulated scenarios yields valuable insights and recommendations for
stakeholders concerning the measurement and interpretation of Fragmentation.
Related papers
- Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding [118.75567341513897]
Existing methods typically analyze target text in isolation or solely with non-member contexts.
We propose Con-ReCall, a novel approach that leverages the asymmetric distributional shifts induced by member and non-member contexts.
arXiv Detail & Related papers (2024-09-05T09:10:38Z) - Large Language Model Enhanced Clustering for News Event Detection [1.7094064195431142]
This paper presents an event detection framework that leverages Large Language Models (LLMs) combined with clustering analysis.
The framework enhances event clustering through both pre-event detection tasks and post-event detection tasks.
We introduce a novel Cluster Stability Assessment Index (CSAI) to assess the validity and robustness of clustering results.
arXiv Detail & Related papers (2024-06-15T08:13:47Z) - Enhancing Topic Extraction in Recommender Systems with Entropy
Regularization [2.7286395031146062]
This paper introduces a novel approach called entropy regularization to address the issue of low explainability of recommender systems.
Experiment results show a significant improvement in topic coherence, which is quantified by cosine similarity on word embeddings.
arXiv Detail & Related papers (2023-06-12T20:05:09Z) - Open-vocabulary Panoptic Segmentation with Embedding Modulation [71.15502078615587]
Open-vocabulary image segmentation is attracting increasing attention due to its critical applications in the real world.
Traditional closed-vocabulary segmentation methods are not able to characterize novel objects, whereas several recent open-vocabulary attempts obtain unsatisfactory results.
We propose OPSNet, an omnipotent and data-efficient framework for Open-vocabulary Panopticon.
arXiv Detail & Related papers (2023-03-20T17:58:48Z) - FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
Assessment [93.09267863425492]
We argue that understanding both high-level semantics and internal temporal structures of actions in competitive sports videos is the key to making predictions accurate and interpretable.
We construct a new fine-grained dataset, called FineDiving, developed on diverse diving events with detailed annotations on action procedures.
arXiv Detail & Related papers (2022-04-07T17:59:32Z) - A Closer Look at Debiased Temporal Sentence Grounding in Videos:
Dataset, Metric, and Approach [53.727460222955266]
Temporal Sentence Grounding in Videos (TSGV) aims to ground a natural language sentence in an untrimmed video.
Recent studies have found that current benchmark datasets may have obvious moment annotation biases.
We introduce a new evaluation metric "dR@n,IoU@m" that discounts the basic recall scores to alleviate the inflating evaluation caused by biased datasets.
arXiv Detail & Related papers (2022-03-10T08:58:18Z) - Twitter Referral Behaviours on News Consumption with Ensemble Clustering
of Click-Stream Data in Turkish Media [2.9005223064604078]
This study investigates the readers' click activities in the organizations' websites to identify news consumption patterns following referrals from Twitter.
The investigation is widened to a broad perspective by linking the log data with news content to enrich the insights.
arXiv Detail & Related papers (2022-02-04T09:57:13Z) - Neural News Recommendation with Event Extraction [0.0]
A key challenge of online news recommendation is to help users find articles they are interested in.
Traditional news recommendation methods usually use single news information, which is insufficient to encode news and user representation.
We propose an Event Extraction-based News Recommendation framework to overcome these shortcomings.
arXiv Detail & Related papers (2021-11-09T11:56:38Z) - Salience Estimation with Multi-Attention Learning for Abstractive Text
Summarization [86.45110800123216]
In the task of text summarization, salience estimation for words, phrases or sentences is a critical component.
We propose a Multi-Attention Learning framework which contains two new attention learning components for salience estimation.
arXiv Detail & Related papers (2020-04-07T02:38:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.