Extracting Summary Knowledge Graphs from Long Documents
- URL: http://arxiv.org/abs/2009.09162v2
- Date: Mon, 14 Jun 2021 05:55:46 GMT
- Title: Extracting Summary Knowledge Graphs from Long Documents
- Authors: Zeqiu Wu, Rik Koncel-Kedziorski, Mari Ostendorf, Hannaneh Hajishirzi
- Abstract summary: We introduce a new text-to-graph task of predicting summarized knowledge graphs from long documents.
We develop a dataset of 200k document/graph pairs using automatic and human annotations.
- Score: 48.92130466606231
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Knowledge graphs capture entities and relations from long documents and can
facilitate reasoning in many downstream applications. Extracting compact
knowledge graphs containing only salient entities and relations is important
but challenging for understanding and summarizing long documents. We introduce
a new text-to-graph task of predicting summarized knowledge graphs from long
documents. We develop a dataset of 200k document/graph pairs using automatic
and human annotations. We also develop strong baselines for this task based on
graph learning and text summarization, and provide quantitative and qualitative
studies of their effect.
Related papers
- Doc2Graph: a Task Agnostic Document Understanding Framework based on
Graph Neural Networks [0.965964228590342]
We propose Doc2Graph, a task-agnostic document understanding framework based on a GNN model.
We evaluate our approach on two challenging datasets for key information extraction in form understanding, invoice layout analysis and table detection.
arXiv Detail & Related papers (2022-08-23T19:48:10Z) - Knowledge Graph Enhanced Event Extraction in Financial Documents [0.12891210250935145]
We propose a first event extraction framework that embeds a knowledge graph through a Graph Neural Network.
For extracting events from Chinese financial announcements, our method outperforms the state-of-the-art method by 5.3% in F1-score.
arXiv Detail & Related papers (2021-09-06T16:35:15Z) - BASS: Boosting Abstractive Summarization with Unified Semantic Graph [49.48925904426591]
BASS is a framework for Boosting Abstractive Summarization based on a unified Semantic graph.
A graph-based encoder-decoder model is proposed to improve both the document representation and summary generation process.
Empirical results show that the proposed architecture brings substantial improvements for both long-document and multi-document summarization tasks.
arXiv Detail & Related papers (2021-05-25T16:20:48Z) - Enhancing Extractive Text Summarization with Topic-Aware Graph Neural
Networks [21.379555672973975]
This paper proposes a graph neural network (GNN)-based extractive summarization model.
Our model integrates a joint neural topic model (NTM) to discover latent topics, which can provide document-level features for sentence selection.
The experimental results demonstrate that our model achieves substantially state-of-the-art results on CNN/DM and NYT datasets.
arXiv Detail & Related papers (2020-10-13T09:30:04Z) - Leveraging Graph to Improve Abstractive Multi-Document Summarization [50.62418656177642]
We develop a neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents.
Our model utilizes graphs to encode documents in order to capture cross-document relations, which is crucial to summarizing long documents.
Our model can also take advantage of graphs to guide the summary generation process, which is beneficial for generating coherent and concise summaries.
arXiv Detail & Related papers (2020-05-20T13:39:47Z) - ENT-DESC: Entity Description Generation by Exploring Knowledge Graph [53.03778194567752]
In practice, the input knowledge could be more than enough, since the output description may only cover the most significant knowledge.
We introduce a large-scale and challenging dataset to facilitate the study of such a practical scenario in KG-to-text.
We propose a multi-graph structure that is able to represent the original graph information more comprehensively.
arXiv Detail & Related papers (2020-04-30T14:16:19Z) - Exploiting Structured Knowledge in Text via Graph-Guided Representation
Learning [73.0598186896953]
We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs.
Building upon entity-level masked language models, our first contribution is an entity masking scheme.
In contrast to existing paradigms, our approach uses knowledge graphs implicitly, only during pre-training.
arXiv Detail & Related papers (2020-04-29T14:22:42Z) - Heterogeneous Graph Neural Networks for Extractive Document
Summarization [101.17980994606836]
Cross-sentence relations are a crucial step in extractive document summarization.
We present a graph-based neural network for extractive summarization (HeterSumGraph)
We introduce different types of nodes into graph-based neural networks for extractive document summarization.
arXiv Detail & Related papers (2020-04-26T14:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.