Scientific Paper Extractive Summarization Enhanced by Citation Graphs
- URL: http://arxiv.org/abs/2212.04214v1
- Date: Thu, 8 Dec 2022 11:53:12 GMT
- Title: Scientific Paper Extractive Summarization Enhanced by Citation Graphs
- Authors: Xiuying Chen, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang
- Abstract summary: We focus on leveraging citation graphs to improve scientific paper extractive summarization under different settings.
Preliminary results demonstrate that citation graph is helpful even in a simple unsupervised framework.
Motivated by this, we propose a Graph-based Supervised Summarization model (GSS) to achieve more accurate results on the task when large-scale labeled data are available.
- Score: 50.19266650000948
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In a citation graph, adjacent paper nodes share related scientific terms and
topics. The graph thus conveys unique structure information of document-level
relatedness that can be utilized in the paper summarization task, for exploring
beyond the intra-document information. In this work, we focus on leveraging
citation graphs to improve scientific paper extractive summarization under
different settings. We first propose a Multi-granularity Unsupervised
Summarization model (MUS) as a simple and low-cost solution to the task. MUS
finetunes a pre-trained encoder model on the citation graph by link prediction
tasks. Then, the abstract sentences are extracted from the corresponding paper
considering multi-granularity information. Preliminary results demonstrate that
citation graph is helpful even in a simple unsupervised framework. Motivated by
this, we next propose a Graph-based Supervised Summarization model (GSS) to
achieve more accurate results on the task when large-scale labeled data are
available. Apart from employing the link prediction as an auxiliary task, GSS
introduces a gated sentence encoder and a graph information fusion module to
take advantage of the graph information to polish the sentence representation.
Experiments on a public benchmark dataset show that MUS and GSS bring
substantial improvements over the prior state-of-the-art model.
Related papers
- Explanation Graph Generation via Generative Pre-training over Synthetic
Graphs [6.25568933262682]
The generation of explanation graphs is a significant task that aims to produce explanation graphs in response to user input.
Current research commonly fine-tunes a text-based pre-trained language model on a small downstream dataset that is annotated with labeled graphs.
We propose a novel pre-trained framework EG3P(for Explanation Graph Generation via Generative Pre-training over synthetic graphs) for the explanation graph generation task.
arXiv Detail & Related papers (2023-06-01T13:20:22Z) - Contrastive Hierarchical Discourse Graph for Scientific Document
Summarization [14.930704950433324]
CHANGES is a contrastive hierarchical graph neural network for extractive scientific paper summarization.
We also propose a graph contrastive learning module to learn global theme-aware sentence representations.
arXiv Detail & Related papers (2023-05-31T20:54:43Z) - Rethinking Explaining Graph Neural Networks via Non-parametric Subgraph
Matching [68.35685422301613]
We propose a novel non-parametric subgraph matching framework, dubbed MatchExplainer, to explore explanatory subgraphs.
It couples the target graph with other counterpart instances and identifies the most crucial joint substructure by minimizing the node corresponding-based distance.
Experiments on synthetic and real-world datasets show the effectiveness of our MatchExplainer by outperforming all state-of-the-art parametric baselines with significant margins.
arXiv Detail & Related papers (2023-01-07T05:14:45Z) - Unsupervised Extractive Summarization with Heterogeneous Graph
Embeddings for Chinese Document [5.9630342951482085]
We propose an unsupervised extractive summarizaiton method with heterogeneous graph embeddings (HGEs) for Chinese document.
Experimental results demonstrate that our method consistently outperforms the strong baseline in three summarization datasets.
arXiv Detail & Related papers (2022-11-09T06:07:31Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - BASS: Boosting Abstractive Summarization with Unified Semantic Graph [49.48925904426591]
BASS is a framework for Boosting Abstractive Summarization based on a unified Semantic graph.
A graph-based encoder-decoder model is proposed to improve both the document representation and summary generation process.
Empirical results show that the proposed architecture brings substantial improvements for both long-document and multi-document summarization tasks.
arXiv Detail & Related papers (2021-05-25T16:20:48Z) - Enhancing Scientific Papers Summarization with Citation Graph [78.65955304229863]
We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
arXiv Detail & Related papers (2021-04-07T11:13:35Z) - Enhancing Extractive Text Summarization with Topic-Aware Graph Neural
Networks [21.379555672973975]
This paper proposes a graph neural network (GNN)-based extractive summarization model.
Our model integrates a joint neural topic model (NTM) to discover latent topics, which can provide document-level features for sentence selection.
The experimental results demonstrate that our model achieves substantially state-of-the-art results on CNN/DM and NYT datasets.
arXiv Detail & Related papers (2020-10-13T09:30:04Z) - Leveraging Graph to Improve Abstractive Multi-Document Summarization [50.62418656177642]
We develop a neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents.
Our model utilizes graphs to encode documents in order to capture cross-document relations, which is crucial to summarizing long documents.
Our model can also take advantage of graphs to guide the summary generation process, which is beneficial for generating coherent and concise summaries.
arXiv Detail & Related papers (2020-05-20T13:39:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.