Survey on Multi-Document Summarization: Systematic Literature Review
- URL: http://arxiv.org/abs/2312.12915v1
- Date: Wed, 20 Dec 2023 10:51:35 GMT
- Title: Survey on Multi-Document Summarization: Systematic Literature Review
- Authors: Uswa Ihsan, Humaira Ashraf, NZ Jhanjhi
- Abstract summary: This study conducts a systematic literature review of existing methods for multi-document summarization.
The findings of the study show that more effective methods are still required for getting higher accuracy of these methods.
- Score: 1.1318175666743657
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this era of information technology, abundant information is available on
the internet in the form of web pages and documents on any given topic. Finding
the most relevant and informative content out of these huge number of
documents, without spending several hours of reading has become a very
challenging task. Various methods of multi-document summarization have been
developed to overcome this problem. The multi-document summarization methods
try to produce high-quality summaries of documents with low redundancy. This
study conducts a systematic literature review of existing methods for
multi-document summarization methods and provides an in-depth analysis of
performance achieved by these methods. The findings of the study show that more
effective methods are still required for getting higher accuracy of these
methods. The study also identifies some open challenges that can gain the
attention of future researchers of this domain.
Related papers
- Unified Multi-Modal Interleaved Document Representation for Information Retrieval [57.65409208879344]
We produce more comprehensive and nuanced document representations by holistically embedding documents interleaved with different modalities.
Specifically, we achieve this by leveraging the capability of recent vision-language models that enable the processing and integration of text, images, and tables into a unified format and representation.
arXiv Detail & Related papers (2024-10-03T17:49:09Z) - Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review [51.61531917413708]
Deep learning-based approaches for Key Information Extraction have been proposed under the umbrella term Document Understanding.
The goal of this systematic literature review is an in-depth analysis of existing approaches in this domain and the identification of opportunities for further research.
arXiv Detail & Related papers (2024-07-23T08:15:55Z) - Unsupervised Multi-document Summarization with Holistic Inference [41.58777650517525]
This paper proposes a new holistic framework for unsupervised multi-document extractive summarization.
Subset Representative Index (SRI) balances the importance and diversity of a subset of sentences from the source documents.
Our findings suggest that diversity is essential for improving multi-document summary performance.
arXiv Detail & Related papers (2023-09-08T02:56:30Z) - Generating a Structured Summary of Numerous Academic Papers: Dataset and
Method [20.90939310713561]
We propose BigSurvey, the first large-scale dataset for generating comprehensive summaries of numerous academic papers on each topic.
We collect target summaries from more than seven thousand survey papers and utilize their 430 thousand reference papers' abstracts as input documents.
To organize the diverse content from dozens of input documents, we propose a summarization method named category-based alignment and sparse transformer (CAST)
arXiv Detail & Related papers (2023-02-09T11:42:07Z) - Embedding Knowledge for Document Summarization: A Survey [66.76415502727802]
Previous works proved that knowledge-embedded document summarizers excel at generating superior digests.
We propose novel to recapitulate knowledge and knowledge embeddings under the document summarization view.
arXiv Detail & Related papers (2022-04-24T04:36:07Z) - Modeling Endorsement for Multi-Document Abstractive Summarization [10.166639983949887]
A crucial difference between single- and multi-document summarization is how salient content manifests itself in the document(s)
In this paper, we model the cross-document endorsement effect and its utilization in multiple document summarization.
Our method generates a synopsis from each document, which serves as an endorser to identify salient content from other documents.
arXiv Detail & Related papers (2021-10-15T03:55:42Z) - A Survey of Deep Learning Approaches for OCR and Document Understanding [68.65995739708525]
We review different techniques for document understanding for documents written in English.
We consolidate methodologies present in literature to act as a jumping-off point for researchers exploring this area.
arXiv Detail & Related papers (2020-11-27T03:05:59Z) - Multi-document Summarization via Deep Learning Techniques: A Survey [29.431160110691607]
We propose a novel taxonomy to summarize the design strategies of neural networks.
We highlight the differences between various objective functions that are rarely discussed in the existing literature.
arXiv Detail & Related papers (2020-11-10T00:35:46Z) - CAiRE-COVID: A Question Answering and Query-focused Multi-Document
Summarization System for COVID-19 Scholarly Information Management [48.251211691263514]
We present CAiRE-COVID, a real-time question answering (QA) and multi-document summarization system, which won one of the 10 tasks in the Kaggle COVID-19 Open Research dataset Challenge.
Our system aims to tackle the recent challenge of mining the numerous scientific articles being published on COVID-19 by answering high priority questions from the community.
arXiv Detail & Related papers (2020-05-04T15:07:27Z) - Explaining Relationships Between Scientific Documents [55.23390424044378]
We address the task of explaining relationships between two scientific documents using natural language text.
In this paper we establish a dataset of 622K examples from 154K documents.
arXiv Detail & Related papers (2020-02-02T03:54:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.