Enhancing Scientific Papers Summarization with Citation Graph
- URL: http://arxiv.org/abs/2104.03057v1
- Date: Wed, 7 Apr 2021 11:13:35 GMT
- Title: Enhancing Scientific Papers Summarization with Citation Graph
- Authors: Chenxin An, Ming Zhong, Yiran Chen, Danqing Wang, Xipeng Qiu, Xuanjing
Huang
- Abstract summary: We redefine the task of scientific papers summarization by utilizing their citation graph.
We construct a novel scientific papers summarization dataset Semantic Scholar Network (SSN) which contains 141K research papers in different domains.
Our model can achieve competitive performance when compared with the pretrained models.
- Score: 78.65955304229863
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Previous work for text summarization in scientific domain mainly focused on
the content of the input document, but seldom considering its citation network.
However, scientific papers are full of uncommon domain-specific terms, making
it almost impossible for the model to understand its true meaning without the
help of the relevant research community. In this paper, we redefine the task of
scientific papers summarization by utilizing their citation graph and propose a
citation graph-based summarization model CGSum which can incorporate the
information of both the source paper and its references. In addition, we
construct a novel scientific papers summarization dataset Semantic Scholar
Network (SSN) which contains 141K research papers in different domains and 661K
citation relationships. The entire dataset constitutes a large connected
citation graph. Extensive experiments show that our model can achieve
competitive performance when compared with the pretrained models even with a
simple architecture. The results also indicates the citation graph is crucial
to better understand the content of papers and generate high-quality summaries.
Related papers
- Context-Enhanced Language Models for Generating Multi-Paper Citations [35.80247519023821]
We propose a method that leverages Large Language Models (LLMs) to generate multi-citation sentences.
Our approach involves a single source paper and a collection of target papers, culminating in a coherent paragraph containing multi-sentence citation text.
arXiv Detail & Related papers (2024-04-22T04:30:36Z) - Contrastive Hierarchical Discourse Graph for Scientific Document
Summarization [14.930704950433324]
CHANGES is a contrastive hierarchical graph neural network for extractive scientific paper summarization.
We also propose a graph contrastive learning module to learn global theme-aware sentence representations.
arXiv Detail & Related papers (2023-05-31T20:54:43Z) - H2CGL: Modeling Dynamics of Citation Network for Impact Prediction [13.00224680454585]
We construct hierarchical and heterogeneous graphs for target papers with an annual perspective.
The graphs can record the annual dynamics of target papers' scientific context information.
A novel graph neural network, Hierarchical and Heterogeneous Contrastive Graph Learning Model, is proposed.
arXiv Detail & Related papers (2023-04-16T13:04:32Z) - CiteBench: A benchmark for Scientific Citation Text Generation [69.37571393032026]
CiteBench is a benchmark for citation text generation.
We make the code for CiteBench publicly available at https://github.com/UKPLab/citebench.
arXiv Detail & Related papers (2022-12-19T16:10:56Z) - Scientific Paper Extractive Summarization Enhanced by Citation Graphs [50.19266650000948]
We focus on leveraging citation graphs to improve scientific paper extractive summarization under different settings.
Preliminary results demonstrate that citation graph is helpful even in a simple unsupervised framework.
Motivated by this, we propose a Graph-based Supervised Summarization model (GSS) to achieve more accurate results on the task when large-scale labeled data are available.
arXiv Detail & Related papers (2022-12-08T11:53:12Z) - Deep Graph Learning for Anomalous Citation Detection [55.81334139806342]
We propose a novel deep graph learning model, namely GLAD (Graph Learning for Anomaly Detection), to identify anomalies in citation networks.
Within the GLAD framework, we propose an algorithm called CPU (Citation PUrpose) to discover the purpose of citation based on citation texts.
arXiv Detail & Related papers (2022-02-23T09:05:28Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - What's New? Summarizing Contributions in Scientific Literature [85.95906677964815]
We introduce a new task of disentangled paper summarization, which seeks to generate separate summaries for the paper contributions and the context of the work.
We extend the S2ORC corpus of academic articles by adding disentangled "contribution" and "context" reference labels.
We propose a comprehensive automatic evaluation protocol which reports the relevance, novelty, and disentanglement of generated outputs.
arXiv Detail & Related papers (2020-11-06T02:23:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.