Framing the News:From Human Perception to Large Language Model
Inferences
- URL: http://arxiv.org/abs/2304.14456v1
- Date: Thu, 27 Apr 2023 18:30:18 GMT
- Title: Framing the News:From Human Perception to Large Language Model
Inferences
- Authors: David Alonso del Barrio and Daniel Gatica-Perez
- Abstract summary: Identifying the frames of news is important to understand the articles' vision, intention, message to be conveyed, and which aspects of the news are emphasized.
We develop a protocol for human labeling of frames for 1786 headlines of No-Vax movement articles of European newspapers from 5 countries.
We investigate two approaches for frame inference of news headlines: first with a GPT-3.5 fine-tuning approach, and second with GPT-3.5 prompt-engineering.
- Score: 8.666172545138272
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying the frames of news is important to understand the articles'
vision, intention, message to be conveyed, and which aspects of the news are
emphasized. Framing is a widely studied concept in journalism, and has emerged
as a new topic in computing, with the potential to automate processes and
facilitate the work of journalism professionals. In this paper, we study this
issue with articles related to the Covid-19 anti-vaccine movement. First, to
understand the perspectives used to treat this theme, we developed a protocol
for human labeling of frames for 1786 headlines of No-Vax movement articles of
European newspapers from 5 countries. Headlines are key units in the written
press, and worth of analysis as many people only read headlines (or use them to
guide their decision for further reading.) Second, considering advances in
Natural Language Processing (NLP) with large language models, we investigated
two approaches for frame inference of news headlines: first with a GPT-3.5
fine-tuning approach, and second with GPT-3.5 prompt-engineering. Our work
contributes to the study and analysis of the performance that these models have
to facilitate journalistic tasks like classification of frames, while
understanding whether the models are able to replicate human perception in the
identification of these frames.
Related papers
- AI as a Tool for Fair Journalism: Case Studies from Malta [0.07499722271664144]
Two projects focus on media monitoring and present tools designed to analyse potential biases in news articles and television news segments.
The first project uses Computer Vision and Natural Language Processing techniques to analyse the coherence between images in news articles and their corresponding captions, headlines, and article bodies.
The second project employs computer vision techniques to track individuals on-screen time or visual exposure in news videos, providing queryable data.
arXiv Detail & Related papers (2024-07-08T15:02:39Z) - A Multilingual Similarity Dataset for News Article Frame [14.977682986280998]
We introduce an extended version of a large labeled news article dataset with 16,687 new labeled pairs.
Our method frees the work of manual identification of frame classes in traditional news frame analysis studies.
Overall we introduce the most extensive cross-lingual news article similarity dataset available to date with 26,555 labeled news article pairs across 10 languages.
arXiv Detail & Related papers (2024-05-22T01:01:04Z) - Language Models: A Guide for the Perplexed [51.88841610098437]
This tutorial aims to help narrow the gap between those who study language models and those who are intrigued and want to learn more.
We offer a scientific viewpoint that focuses on questions amenable to study through experimentation.
We situate language models as they are today in the context of the research that led to their development.
arXiv Detail & Related papers (2023-11-29T01:19:02Z) - Identifying Informational Sources in News Articles [109.70475599552523]
We build the largest and widest-ranging annotated dataset of informational sources used in news writing.
We introduce a novel task, source prediction, to study the compositionality of sources in news articles.
arXiv Detail & Related papers (2023-05-24T08:56:35Z) - Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
Comparing What Sources Say About Entities as a Start [65.28355014154549]
This paper investigates the challenges of building scalable NLP systems for discovering patterns of media selection biases directly from news content in massive-scale news corpora.
We show the capabilities of the framework through a case study on NELA-2020, a corpus of 1.8M news articles in English from 519 news sources worldwide.
arXiv Detail & Related papers (2023-04-06T23:36:45Z) - The COVID That Wasn't: Counterfactual Journalism Using GPT [0.5076419064097734]
We use a language model trained prior to 2020 to artificially generate news articles concerning COVID-19.
We then compare stylistic qualities of our artificially generated corpus with a news corpus.
We find our artificially generated articles exhibits a considerably more negative attitude towards COVID and a significantly lower reliance on geopolitical framing.
arXiv Detail & Related papers (2022-10-13T00:50:24Z) - NewsEdits: A News Article Revision Dataset and a Document-Level
Reasoning Challenge [122.37011526554403]
NewsEdits is the first publicly available dataset of news revision histories.
It contains 1.2 million articles with 4.6 million versions from over 22 English- and French-language newspaper sources.
arXiv Detail & Related papers (2022-06-14T18:47:13Z) - What's New? Summarizing Contributions in Scientific Literature [85.95906677964815]
We introduce a new task of disentangled paper summarization, which seeks to generate separate summaries for the paper contributions and the context of the work.
We extend the S2ORC corpus of academic articles by adding disentangled "contribution" and "context" reference labels.
We propose a comprehensive automatic evaluation protocol which reports the relevance, novelty, and disentanglement of generated outputs.
arXiv Detail & Related papers (2020-11-06T02:23:01Z) - Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News [57.9843300852526]
We introduce the more realistic and challenging task of defending against machine-generated news that also includes images and captions.
To identify the possible weaknesses that adversaries can exploit, we create a NeuralNews dataset composed of 4 different types of generated articles.
In addition to the valuable insights gleaned from our user study experiments, we provide a relatively effective approach based on detecting visual-semantic inconsistencies.
arXiv Detail & Related papers (2020-09-16T14:13:15Z) - XREF: Entity Linking for Chinese News Comments with Supplementary
Article Reference [19.811371589597382]
We study the problem of entity linking for Chinese news comments given mentions' spans.
We propose a novel model, XREF, that leverages attention mechanisms to pinpoint relevant context.
We develop a weakly supervised training scheme to utilize the large-scale unlabeled corpus.
arXiv Detail & Related papers (2020-06-24T19:42:54Z) - Generating Representative Headlines for News Stories [31.67864779497127]
Grouping articles that are reporting the same event into news stories is a common way of assisting readers in their news consumption.
It remains a challenging research problem to efficiently and effectively generate a representative headline for each story.
We develop a distant supervision approach to train large-scale generation models without any human annotation.
arXiv Detail & Related papers (2020-01-26T02:08:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.