Discord Questions: A Computational Approach To Diversity Analysis in
News Coverage
- URL: http://arxiv.org/abs/2211.05007v1
- Date: Wed, 9 Nov 2022 16:37:55 GMT
- Title: Discord Questions: A Computational Approach To Diversity Analysis in
News Coverage
- Authors: Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Xiang 'Anthony'
Chen, Caiming Xiong
- Abstract summary: We propose a new framework to assist readers in identifying source differences and gaining an understanding of news coverage diversity.
The framework is based on the generation of Discord Questions: questions with a diverse answer pool.
- Score: 84.55145223950427
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There are many potential benefits to news readers accessing diverse sources.
Modern news aggregators do the hard work of organizing the news, offering
readers a plethora of source options, but choosing which source to read remains
challenging. We propose a new framework to assist readers in identifying source
differences and gaining an understanding of news coverage diversity. The
framework is based on the generation of Discord Questions: questions with a
diverse answer pool, explicitly illustrating source differences. To assemble a
prototype of the framework, we focus on two components: (1) discord question
generation, the task of generating questions answered differently by sources,
for which we propose an automatic scoring method, and create a model that
improves performance from current question generation (QG) methods by 5%, (2)
answer consolidation, the task of grouping answers to a question that are
semantically similar, for which we collect data and repurpose a method that
achieves 81% balanced accuracy on our realistic test set. We illustrate the
framework's feasibility through a prototype interface. Even though model
performance at discord QG still lags human performance by more than 15%,
generated questions are judged to be more interesting than factoid questions
and can reveal differences in the level of detail, sentiment, and reasoning of
sources in news coverage.
Related papers
- Improving Question Generation with Multi-level Content Planning [70.37285816596527]
This paper addresses the problem of generating questions from a given context and an answer, specifically focusing on questions that require multi-hop reasoning across an extended context.
We propose MultiFactor, a novel QG framework based on multi-level content planning. Specifically, MultiFactor includes two components: FA-model, which simultaneously selects key phrases and generates full answers, and Q-model which takes the generated full answer as an additional input to generate questions.
arXiv Detail & Related papers (2023-10-20T13:57:01Z) - Evaluating and Modeling Attribution for Cross-Lingual Question Answering [80.4807682093432]
This work is the first to study attribution for cross-lingual question answering.
We collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system.
We find that a substantial portion of the answers is not attributable to any retrieved passages.
arXiv Detail & Related papers (2023-05-23T17:57:46Z) - Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
Comparing What Sources Say About Entities as a Start [65.28355014154549]
This paper investigates the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora.
We show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide.
arXiv Detail & Related papers (2023-04-06T23:36:45Z) - DIFFQG: Generating Questions to Summarize Factual Changes [41.142542919449355]
We propose representing factual changes between paired documents as question-answer pairs.
DIFFQG consists of 759 QA pairs and 1153 examples of paired passages with no factual change.
arXiv Detail & Related papers (2023-03-01T05:45:48Z) - CONSISTENT: Open-Ended Question Generation From News Articles [38.41162895492449]
We propose CONSISTENT, a new end-to-end system for generating open-ended questions.
We demonstrate our model's strength over several baselines using both automatic and human=based evaluations.
We discuss potential downstream applications for news media organizations.
arXiv Detail & Related papers (2022-10-20T19:10:07Z) - MixQG: Neural Question Generation with Mixed Answer Types [54.23205265351248]
We propose a neural question generator, MixQG, to bridge this gap.
We combine 9 question answering datasets with diverse answer types, including yes/no, multiple-choice, extractive, and abstractive answers.
Our model outperforms existing work in both seen and unseen domains.
arXiv Detail & Related papers (2021-10-15T16:03:40Z) - "Don't quote me on that": Finding Mixtures of Sources in News Articles [85.92467549469147]
We construct an ontological labeling system for sources based on each source's textitaffiliation and textitrole
We build a probabilistic model to infer these attributes for named sources and to describe news articles as mixtures of these sources.
arXiv Detail & Related papers (2021-04-19T21:57:11Z) - Generating Diversified Comments via Reader-Aware Topic Modeling and
Saliency Detection [25.16392119801612]
We propose a reader-aware topic modeling and saliency information detection framework to enhance the quality of generated comments.
For reader-aware topic modeling, we design a variational generative clustering algorithm for latent semantic learning and topic mining from reader comments.
For saliency information detection, we introduce Bernoulli distribution estimating on news content to select saliency information.
arXiv Detail & Related papers (2021-02-13T03:50:31Z) - Multi-hop Inference for Question-driven Summarization [39.08269647808958]
We propose a novel question-driven abstractive summarization method, Multi-hop Selective Generator (MSG)
MSG incorporates multi-hop reasoning into question-driven summarization and, meanwhile, provide justifications for the generated summaries.
Experimental results show that the proposed method consistently outperforms state-of-the-art methods on two non-factoid QA datasets.
arXiv Detail & Related papers (2020-10-08T02:36:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.