CompRes: A Dataset for Narrative Structure in News
- URL: http://arxiv.org/abs/2007.04874v2
- Date: Tue, 7 Nov 2023 13:29:06 GMT
- Title: CompRes: A Dataset for Narrative Structure in News
- Authors: Effi Levi, Guy Mor, Shaul Shenhav, Tamir Sheafer
- Abstract summary: We introduce CompRes -- the first dataset for narrative structure in news media.
We use the annotated dataset to train several supervised models to identify the different narrative elements.
- Score: 2.4578723416255754
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper addresses the task of automatically detecting narrative structures
in raw texts. Previous works have utilized the oral narrative theory by Labov
and Waletzky to identify various narrative elements in personal stories texts.
Instead, we direct our focus to news articles, motivated by their growing
social impact as well as their role in creating and shaping public opinion.
We introduce CompRes -- the first dataset for narrative structure in news
media. We describe the process in which the dataset was constructed: first, we
designed a new narrative annotation scheme, better suited for news media, by
adapting elements from the narrative theory of Labov and Waletzky (Complication
and Resolution) and adding a new narrative element of our own (Success); then,
we used that scheme to annotate a set of 29 English news articles (containing
1,099 sentences) collected from news and partisan websites. We use the
annotated dataset to train several supervised models to identify the different
narrative elements, achieving an $F_1$ score of up to 0.7. We conclude by
suggesting several promising directions for future work.
Related papers
- SCStory: Self-supervised and Continual Online Story Discovery [53.72745249384159]
SCStory helps people digest rapidly published news article streams in real-time without human annotations.
SCStory employs self-supervised and continual learning with a novel idea of story-indicative adaptive modeling of news article streams.
arXiv Detail & Related papers (2023-11-27T04:50:01Z) - ReelFramer: Human-AI Co-Creation for News-to-Video Translation [18.981919581170175]
We introduce ReelFramer, a human-AI co-creative system that helps journalists translate print articles into scripts and storyboards.
narrative framing introduces the necessary diversity to translate various articles into reels, and establishes details helps generate scripts that are more relevant and coherent.
arXiv Detail & Related papers (2023-04-19T13:44:35Z) - NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization [26.80378373420446]
NarraSum is a large-scale narrative summarization dataset.
It contains 122K narrative documents, which are collected from plot descriptions of movies and TV episodes with diverse genres, and their corresponding abstractive summaries.
Experiments show that there is a large performance gap between humans and the state-of-the-art summarization models on NarraSum.
arXiv Detail & Related papers (2022-12-02T22:51:51Z) - Detecting Narrative Elements in Informational Text [0.0]
We introduce NEAT (Narrative Elements AnnoTation) - a novel NLP task for detecting narrative elements in raw text.
We use this scheme to annotate a new dataset of 2,209 sentences, compiled from 46 news articles from various category domains.
We trained a number of supervised models in several different setups over the annotated dataset to identify the different narrative elements, achieving an average F1 score of up to 0.77.
arXiv Detail & Related papers (2022-10-06T16:23:33Z) - WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions
from Paragraphs [66.88232442007062]
We introduce WikiDes, a dataset to generate short descriptions of Wikipedia articles.
The dataset consists of over 80k English samples on 6987 topics.
Our paper shows a practical impact on Wikipedia and Wikidata since there are thousands of missing descriptions.
arXiv Detail & Related papers (2022-09-27T01:28:02Z) - End-to-End Segmentation-based News Summarization [15.549631631269198]
We introduce the task of segmenting a news article into multiple sections and generating the corresponding summary to each section.
First, we create and make available a dataset, SegNews, consisting of 27k news articles with sections and aligned heading-style section summaries.
Second, we propose a novel segmentation-based language generation model adapted from pre-trained language models.
arXiv Detail & Related papers (2021-10-15T04:17:26Z) - News Article Retrieval in Context for Event-centric Narrative Creation [45.50837121213255]
Given an incomplete narrative, we aim to retrieve news articles that discuss relevant events that would enable the continuation of the narrative.
Experiments show that state-of-the-art lexical and semantic rankers are not sufficient for this task.
We show that combining those with a ranker that ranks articles by reverse chronological order outperforms those rankers alone.
arXiv Detail & Related papers (2021-06-30T13:27:54Z) - STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story
Generation [48.56586847883825]
We introduce a dataset and evaluation platform built from STORIUM, an online collaborative storytelling community.
Our dataset contains 6K lengthy stories with fine-grained natural language annotations interspersed throughout each narrative.
We evaluate language models fine-tuned on our dataset by integrating them onto STORIUM, where real authors can query a model for suggested story continuations and then edit them.
arXiv Detail & Related papers (2020-10-04T23:26:09Z) - Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language.
We generate abstractive summaries of narrated instructional videos across a wide variety of topics.
We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z) - Screenplay Summarization Using Latent Narrative Structure [78.45316339164133]
We propose to explicitly incorporate the underlying structure of narratives into general unsupervised and supervised extractive summarization models.
We formalize narrative structure in terms of key narrative events (turning points) and treat it as latent in order to summarize screenplays.
Experimental results on the CSI corpus of TV screenplays, which we augment with scene-level summarization labels, show that latent turning points correlate with important aspects of a CSI episode.
arXiv Detail & Related papers (2020-04-27T11:54:19Z) - Learning to Select Bi-Aspect Information for Document-Scale Text Content
Manipulation [50.01708049531156]
We focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer.
In detail, the input is a set of structured records and a reference text for describing another recordset.
The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference.
arXiv Detail & Related papers (2020-02-24T12:52:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.