TL;DR Progress: Multi-faceted Literature Exploration in Text
Summarization
- URL: http://arxiv.org/abs/2402.06913v1
- Date: Sat, 10 Feb 2024 09:16:56 GMT
- Title: TL;DR Progress: Multi-faceted Literature Exploration in Text
Summarization
- Authors: Shahbaz Syed, Khalid Al-Khatib, Martin Potthast
- Abstract summary: This paper presents TL;DR Progress, a new tool for exploring the literature on neural text summarization.
It organizes 514papers based on a comprehensive annotation scheme for text summarization approaches.
- Score: 37.88261925867143
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents TL;DR Progress, a new tool for exploring the literature
on neural text summarization. It organizes 514~papers based on a comprehensive
annotation scheme for text summarization approaches and enables fine-grained,
faceted search. Each paper was manually annotated to capture aspects such as
evaluation metrics, quality dimensions, learning paradigms, challenges
addressed, datasets, and document domains. In addition, a succinct indicative
summary is provided for each paper, consisting of automatically extracted
contextual factors, issues, and proposed solutions. The tool is available
online at https://www.tldr-progress.de, a demo video at
https://youtu.be/uCVRGFvXUj8
Related papers
- Integrating Planning into Single-Turn Long-Form Text Generation [66.08871753377055]
We propose to use planning to generate long form content.
Our main novelty lies in a single auxiliary task that does not require multiple rounds of prompting or planning.
Our experiments demonstrate on two datasets from different domains, that LLMs fine-tuned with the auxiliary task generate higher quality documents.
arXiv Detail & Related papers (2024-10-08T17:02:40Z) - Automatic summarisation of Instagram social network posts Combining
semantic and statistical approaches [0.0]
A crawler has been developed to extract popular text posts from the Instagram social network with appropriate preprocessing.
Observations made on 820 popular text posts on the social network Instagram show the accuracy (80%) of the proposed system.
arXiv Detail & Related papers (2023-03-14T14:59:20Z) - TRIE++: Towards End-to-End Information Extraction from Visually Rich
Documents [51.744527199305445]
This paper proposes a unified end-to-end information extraction framework from visually rich documents.
Text reading and information extraction can reinforce each other via a well-designed multi-modal context block.
The framework can be trained in an end-to-end trainable manner, achieving global optimization.
arXiv Detail & Related papers (2022-07-14T08:52:07Z) - NEWTS: A Corpus for News Topic-Focused Summarization [9.872518517174498]
This paper introduces the first topical summarization corpus, based on the well-known CNN/Dailymail dataset.
We evaluate a range of existing techniques and analyze the effectiveness of different prompting methods.
arXiv Detail & Related papers (2022-05-31T10:01:38Z) - Summarization with Graphical Elements [55.5913491389047]
We propose a new task: summarization with graphical elements.
We collect a high quality human labeled dataset to support research into the task.
arXiv Detail & Related papers (2022-04-15T17:16:41Z) - Topic Modeling Based Extractive Text Summarization [0.0]
We propose a novel method to summarize a text document by clustering its contents based on latent topics.
We utilize the lesser used and challenging WikiHow dataset in our approach to text summarization.
arXiv Detail & Related papers (2021-06-29T12:28:19Z) - Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language.
We generate abstractive summaries of narrated instructional videos across a wide variety of topics.
We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z) - GLEAKE: Global and Local Embedding Automatic Keyphrase Extraction [1.0681288493631977]
We introduce Global and Local Embedding Automatic Keyphrase Extractor (GLEAKE) for the task of automatic keyphrase extraction.
GLEAKE uses single and multi-word embedding techniques to explore the syntactic and semantic aspects of the candidate phrases.
It refines the most significant phrases as a final set of keyphrases.
arXiv Detail & Related papers (2020-05-19T20:24:02Z) - From Standard Summarization to New Tasks and Beyond: Summarization with
Manifold Information [77.89755281215079]
Text summarization is the research area aiming at creating a short and condensed version of the original document.
In real-world applications, most of the data is not in a plain text format.
This paper focuses on the survey of these new summarization tasks and approaches in the real-world application.
arXiv Detail & Related papers (2020-05-10T14:59:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.