EntSUM: A Data Set for Entity-Centric Summarization
- URL: http://arxiv.org/abs/2204.02213v1
- Date: Tue, 5 Apr 2022 13:45:54 GMT
- Title: EntSUM: A Data Set for Entity-Centric Summarization
- Authors: Mounica Maddela, Mayank Kulkarni and Daniel Preotiuc-Pietro
- Abstract summary: Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences.
We introduce a human-annotated data setSUM for controllable summarization with a focus on named entities as the aspects to control.
- Score: 27.845014142019917
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Controllable summarization aims to provide summaries that take into account
user-specified aspects and preferences to better assist them with their
information need, as opposed to the standard summarization setup which build a
single generic summary of a document. We introduce a human-annotated data set
EntSUM for controllable summarization with a focus on named entities as the
aspects to control. We conduct an extensive quantitative analysis to motivate
the task of entity-centric summarization and show that existing methods for
controllable summarization fail to generate entity-centric summaries. We
propose extensions to state-of-the-art summarization approaches that achieve
substantially better results on our data set. Our analysis and results show the
challenging nature of this task and of the proposed data set.
Related papers
- SummIt: Iterative Text Summarization via ChatGPT [12.966825834765814]
We propose SummIt, an iterative text summarization framework based on large language models like ChatGPT.
Our framework enables the model to refine the generated summary iteratively through self-evaluation and feedback.
We also conduct a human evaluation to validate the effectiveness of the iterative refinements and identify a potential issue of over-correction.
arXiv Detail & Related papers (2023-05-24T07:40:06Z) - UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot
Summarization [54.59104881168188]
textscUniSumm is a unified few-shot summarization model pre-trained with multiple summarization tasks.
textscSummZoo is a new benchmark to better evaluate few-shot summarizers.
arXiv Detail & Related papers (2022-11-17T18:54:47Z) - MACSum: Controllable Summarization with Mixed Attributes [56.685735509260276]
MACSum is the first human-annotated summarization dataset for controlling mixed attributes.
We propose two simple and effective parameter-efficient approaches for the new task of mixed controllable summarization.
arXiv Detail & Related papers (2022-11-09T17:17:37Z) - Guided Exploration of Data Summaries [24.16170440895994]
A useful summary contains k individually uniform sets that are collectively diverse to be representative.
Finding such as summary is a difficult task when data is highly diverse and large.
We examine the applicability of Exploratory Data Analysis (EDA) to data summarization and formalize Eda4Sum.
arXiv Detail & Related papers (2022-05-27T13:06:27Z) - Unsupervised Summarization with Customized Granularities [76.26899748972423]
We propose the first unsupervised multi-granularity summarization framework, GranuSum.
By inputting different numbers of events, GranuSum is capable of producing multi-granular summaries in an unsupervised manner.
arXiv Detail & Related papers (2022-01-29T05:56:35Z) - Controllable Summarization with Constrained Markov Decision Process [50.04321779376415]
We study controllable text summarization which allows users to gain control on a particular attribute.
We propose a novel training framework based on Constrained Markov Decision Process (CMDP)
Our framework can be applied to control important attributes of summarization, including length, covered entities, and abstractiveness.
arXiv Detail & Related papers (2021-08-07T09:12:53Z) - AgreeSum: Agreement-Oriented Multi-Document Summarization [3.4743618614284113]
Given a cluster of articles, the goal is to provide abstractive summaries that represent information common and faithful to all input articles.
We create a dataset for AgreeSum, and provide annotations on articlesummary entailment relations for a subset of the clusters in the dataset.
arXiv Detail & Related papers (2021-06-04T06:17:49Z) - Entity-level Factual Consistency of Abstractive Text Summarization [26.19686599842915]
Key challenge for abstractive summarization is ensuring factual consistency of the generated summary with respect to the original document.
We propose a set of new metrics to quantify the entity-level factual consistency of generated summaries.
arXiv Detail & Related papers (2021-02-18T03:07:28Z) - Abstractive Query Focused Summarization with Query-Free Resources [60.468323530248945]
In this work, we consider the problem of leveraging only generic summarization resources to build an abstractive QFS system.
We propose Marge, a Masked ROUGE Regression framework composed of a novel unified representation for summaries and queries.
Despite learning from minimal supervision, our system achieves state-of-the-art results in the distantly supervised setting.
arXiv Detail & Related papers (2020-12-29T14:39:35Z) - CTRLsum: Towards Generic Controllable Text Summarization [54.69190421411766]
We presentsum, a novel framework for controllable summarization.
Our approach enables users to control multiple aspects of generated summaries by interacting with the summarization system.
Using a single unified model,sum is able to achieve a broad scope of summary manipulation at inference time.
arXiv Detail & Related papers (2020-12-08T08:54:36Z) - Self-Supervised and Controlled Multi-Document Opinion Summarization [16.674646504295687]
We propose a self-supervised setup that considers an individual document as a target summary for a set of similar documents.
We address the problem of hallucinations through the use of control codes.
Our benchmarks on two datasets against graph-based and recent neural abstractive unsupervised models show that our proposed method generates summaries with a superior quality and relevance.
arXiv Detail & Related papers (2020-04-30T13:20:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.