CompRes: A Dataset for Narrative Structure in News
- URL: http://arxiv.org/abs/2007.04874v2
- Date: Tue, 7 Nov 2023 13:29:06 GMT
- Title: CompRes: A Dataset for Narrative Structure in News
- Authors: Effi Levi, Guy Mor, Shaul Shenhav, Tamir Sheafer
- Abstract summary: We introduce CompRes -- the first dataset for narrative structure in news media.
We use the annotated dataset to train several supervised models to identify the different narrative elements.
- Score: 2.4578723416255754
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper addresses the task of automatically detecting narrative structures
in raw texts. Previous works have utilized the oral narrative theory by Labov
and Waletzky to identify various narrative elements in personal stories texts.
Instead, we direct our focus to news articles, motivated by their growing
social impact as well as their role in creating and shaping public opinion.
We introduce CompRes -- the first dataset for narrative structure in news
media. We describe the process in which the dataset was constructed: first, we
designed a new narrative annotation scheme, better suited for news media, by
adapting elements from the narrative theory of Labov and Waletzky (Complication
and Resolution) and adding a new narrative element of our own (Success); then,
we used that scheme to annotate a set of 29 English news articles (containing
1,099 sentences) collected from news and partisan websites. We use the
annotated dataset to train several supervised models to identify the different
narrative elements, achieving an $F_1$ score of up to 0.7. We conclude by
suggesting several promising directions for future work.
Related papers
- BookWorm: A Dataset for Character Description and Analysis [59.186325346763184]
We define two tasks: character description, which generates a brief factual profile, and character analysis, which offers an in-depth interpretation.
We introduce the BookWorm dataset, pairing books from the Gutenberg Project with human-written descriptions and analyses.
Our findings show that retrieval-based approaches outperform hierarchical ones in both tasks.
arXiv Detail & Related papers (2024-10-14T10:55:58Z) - Mapping News Narratives Using LLMs and Narrative-Structured Text Embeddings [0.0]
We introduce a numerical narrative representation grounded in structuralist linguistic theory.
We extract the actants using an open-source LLM and integrate them into a Narrative-Structured Text Embedding.
We demonstrate the analytical insights of the method on the example of 5000 full-text news articles from Al Jazeera and The Washington Post on the Israel-Palestine conflict.
arXiv Detail & Related papers (2024-09-10T14:15:30Z) - SCStory: Self-supervised and Continual Online Story Discovery [53.72745249384159]
SCStory helps people digest rapidly published news article streams in real-time without human annotations.
SCStory employs self-supervised and continual learning with a novel idea of story-indicative adaptive modeling of news article streams.
arXiv Detail & Related papers (2023-11-27T04:50:01Z) - ReelFramer: Human-AI Co-Creation for News-to-Video Translation [18.981919581170175]
We introduce ReelFramer, a human-AI co-creative system that helps journalists translate print articles into scripts and storyboards.
narrative framing introduces the necessary diversity to translate various articles into reels, and establishes details helps generate scripts that are more relevant and coherent.
arXiv Detail & Related papers (2023-04-19T13:44:35Z) - Detecting Narrative Elements in Informational Text [0.0]
We introduce NEAT (Narrative Elements AnnoTation) - a novel NLP task for detecting narrative elements in raw text.
We use this scheme to annotate a new dataset of 2,209 sentences, compiled from 46 news articles from various category domains.
We trained a number of supervised models in several different setups over the annotated dataset to identify the different narrative elements, achieving an average F1 score of up to 0.77.
arXiv Detail & Related papers (2022-10-06T16:23:33Z) - WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions
from Paragraphs [66.88232442007062]
We introduce WikiDes, a dataset to generate short descriptions of Wikipedia articles.
The dataset consists of over 80k English samples on 6987 topics.
Our paper shows a practical impact on Wikipedia and Wikidata since there are thousands of missing descriptions.
arXiv Detail & Related papers (2022-09-27T01:28:02Z) - End-to-End Segmentation-based News Summarization [15.549631631269198]
We introduce the task of segmenting a news article into multiple sections and generating the corresponding summary to each section.
First, we create and make available a dataset, SegNews, consisting of 27k news articles with sections and aligned heading-style section summaries.
Second, we propose a novel segmentation-based language generation model adapted from pre-trained language models.
arXiv Detail & Related papers (2021-10-15T04:17:26Z) - News Article Retrieval in Context for Event-centric Narrative Creation [45.50837121213255]
Given an incomplete narrative, we aim to retrieve news articles that discuss relevant events that would enable the continuation of the narrative.
Experiments show that state-of-the-art lexical and semantic rankers are not sufficient for this task.
We show that combining those with a ranker that ranks articles by reverse chronological order outperforms those rankers alone.
arXiv Detail & Related papers (2021-06-30T13:27:54Z) - Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language.
We generate abstractive summaries of narrated instructional videos across a wide variety of topics.
We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z) - Screenplay Summarization Using Latent Narrative Structure [78.45316339164133]
We propose to explicitly incorporate the underlying structure of narratives into general unsupervised and supervised extractive summarization models.
We formalize narrative structure in terms of key narrative events (turning points) and treat it as latent in order to summarize screenplays.
Experimental results on the CSI corpus of TV screenplays, which we augment with scene-level summarization labels, show that latent turning points correlate with important aspects of a CSI episode.
arXiv Detail & Related papers (2020-04-27T11:54:19Z) - Learning to Select Bi-Aspect Information for Document-Scale Text Content
Manipulation [50.01708049531156]
We focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer.
In detail, the input is a set of structured records and a reference text for describing another recordset.
The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference.
arXiv Detail & Related papers (2020-02-24T12:52:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.