A Survey on Text Simplification
- URL: http://arxiv.org/abs/2008.08612v3
- Date: Fri, 20 May 2022 14:56:47 GMT
- Title: A Survey on Text Simplification
- Authors: Punardeep Sikka and Vijay Mago
- Abstract summary: Text Simplification (TS) aims to reduce the linguistic complexity of content to make it easier to understand.
This survey seeks to provide a comprehensive overview of TS, including a brief description of earlier approaches used.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Text Simplification (TS) aims to reduce the linguistic complexity of content
to make it easier to understand. Research in TS has been of keen interest,
especially as approaches to TS have shifted from manual, hand-crafted rules to
automated simplification. This survey seeks to provide a comprehensive overview
of TS, including a brief description of earlier approaches used, discussion of
various aspects of simplification (lexical, semantic and syntactic), and latest
techniques being utilized in the field. We note that the research in the field
has clearly shifted towards utilizing deep learning techniques to perform TS,
with a specific focus on developing solutions to combat the lack of data
available for simplification. We also include a discussion of datasets and
evaluations metrics commonly used, along with discussion of related fields
within Natural Language Processing (NLP), like semantic similarity.
Related papers
- H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables [56.73919743039263]
This paper introduces a novel algorithm that integrates both symbolic and semantic (textual) approaches in a two-stage process to address limitations.
Our experiments demonstrate that H-STAR significantly outperforms state-of-the-art methods across three question-answering (QA) and fact-verification datasets.
arXiv Detail & Related papers (2024-06-29T21:24:19Z) - Exploiting Adaptive Contextual Masking for Aspect-Based Sentiment
Analysis [0.6827423171182154]
Aspect-Based Sentiment Analysis (ABSA) is a fine-grained linguistics problem that entails the extraction of multifaceted aspects, opinions, and sentiments from the given text.
We present adaptive masking methods that remove irrelevant tokens based on context to assist in Aspect Term Extraction and Aspect Sentiment Classification subtasks of ABSA.
arXiv Detail & Related papers (2024-02-21T11:33:09Z) - Thread of Thought Unraveling Chaotic Contexts [133.24935874034782]
"Thread of Thought" (ThoT) strategy draws inspiration from human cognitive processes.
In experiments, ThoT significantly improves reasoning performance compared to other prompting techniques.
arXiv Detail & Related papers (2023-11-15T06:54:44Z) - Elaborative Simplification as Implicit Questions Under Discussion [51.17933943734872]
This paper proposes to view elaborative simplification through the lens of the Question Under Discussion (QUD) framework.
We show that explicitly modeling QUD provides essential understanding of elaborative simplification and how the elaborations connect with the rest of the discourse.
arXiv Detail & Related papers (2023-05-17T17:26:16Z) - Lexical Complexity Prediction: An Overview [13.224233182417636]
The occurrence of unknown words in texts significantly hinders reading comprehension.
computational modelling has been applied to identify complex words in texts and substitute them for simpler alternatives.
We present an overview of computational approaches to lexical complexity prediction focusing on the work carried out on English data.
arXiv Detail & Related papers (2023-03-08T19:35:08Z) - Cognitive Simplification Operations Improve Text Simplification [24.970301040693883]
We present a method for incorporating knowledge from the cognitive accessibility domain into a Text Simplification model.
We show that by adding this inductive bias to a TS-trained model, it is able to adapt better to Cognitive Simplification without ever seeing CS data.
arXiv Detail & Related papers (2022-11-16T10:51:03Z) - An Inclusive Notion of Text [69.36678873492373]
We argue that clarity on the notion of text is crucial for reproducible and generalizable NLP.
We introduce a two-tier taxonomy of linguistic and non-linguistic elements that are available in textual sources and can be used in NLP modeling.
arXiv Detail & Related papers (2022-11-10T14:26:43Z) - Revisiting the Roles of "Text" in Text Games [102.22750109468652]
This paper investigates the roles of text in the face of different reinforcement learning challenges.
We propose a simple scheme to extract relevant contextual information into an approximate state hash.
Such a lightweight plug-in achieves competitive performance with state-of-the-art text agents.
arXiv Detail & Related papers (2022-10-15T21:52:39Z) - Unsupervised Simplification of Legal Texts [0.0]
We introduce an unsupervised simplification method for legal texts (USLT)
USLT performs domain-specific TS by replacing complex words and splitting long sentences.
We demonstrate that USLT outperforms state-of-the-art domain-general TS methods in text simplicity while keeping the semantics intact.
arXiv Detail & Related papers (2022-09-01T15:58:12Z) - Towards Accurate Scene Text Recognition with Semantic Reasoning Networks [52.86058031919856]
We propose a novel end-to-end trainable framework named semantic reasoning network (SRN) for accurate scene text recognition.
GSRM is introduced to capture global semantic context through multi-way parallel transmission.
Results on 7 public benchmarks, including regular text, irregular text and non-Latin long text, verify the effectiveness and robustness of the proposed method.
arXiv Detail & Related papers (2020-03-27T09:19:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.