Related papers: Interpreting Themes from Educational Stories

Interpreting Themes from Educational Stories

URL: http://arxiv.org/abs/2404.05250v1
Date: Mon, 8 Apr 2024 07:26:27 GMT
Title: Interpreting Themes from Educational Stories
Authors: Yigeng Zhang, Fabio A. González, Thamar Solorio,
Abstract summary: We introduce the first dataset specifically designed for interpretive comprehension of educational narratives. The dataset spans a variety of genres and cultural origins and includes human-annotated theme keywords. We formulate NLP tasks under different abstractions of interpretive comprehension toward the main idea of a story.
Score: 9.608135094187912
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Reading comprehension continues to be a crucial research focus in the NLP community. Recent advances in Machine Reading Comprehension (MRC) have mostly centered on literal comprehension, referring to the surface-level understanding of content. In this work, we focus on the next level - interpretive comprehension, with a particular emphasis on inferring the themes of a narrative text. We introduce the first dataset specifically designed for interpretive comprehension of educational narratives, providing corresponding well-edited theme texts. The dataset spans a variety of genres and cultural origins and includes human-annotated theme keywords with varying levels of granularity. We further formulate NLP tasks under different abstractions of interpretive comprehension toward the main idea of a story. After conducting extensive experiments with state-of-the-art methods, we found the task to be both challenging and significant for NLP research. The dataset and source code have been made publicly available to the research community at https://github.com/RiTUAL-UH/EduStory.

Related papers

Reading Between the Lines: A dataset and a study on why some texts are tougher than others [0.20482269513546458]
Our research aims at better understanding what makes a text difficult to read for specific audiences with intellectual disabilities. We introduce a scheme for the annotation of difficulties based on empirical research in psychology. We fine-tuned four different pre-trained transformer models to perform the task of multiclass classification.
arXiv Detail & Related papers (2025-01-03T13:09:46Z)
The Text Classification Pipeline: Starting Shallow going Deeper [4.97309503788908]
The past decade has seen deep learning revolutionize text classification. English is the predominant language of focus, despite studies involving Arabic, Chinese, Hindi, and others. This work integrates traditional and contemporary text mining methodologies, fostering a holistic understanding of text classification.
arXiv Detail & Related papers (2024-12-30T23:01:19Z)
Recent Trends in Linear Text Segmentation: a Survey [10.740243165055743]
The field of Natural Language Processing has recently seen a lot of interest as a result of the surge of text, video, and audio available on the web. We provide an extensive overview of current advances in linear text segmentation, describing the state of the art in terms of resources and approaches for the task.
arXiv Detail & Related papers (2024-11-25T17:48:59Z)
Think from Words(TFW): Initiating Human-Like Cognition in Large Language Models Through Think from Words for Japanese Text-level Classification [0.0]
"Think from Words" (TFW) initiates the comprehension process at the word level and then extends it to encompass the entire text. "TFW with Extra word-level information" (TFW Extra) augmenting comprehension with additional word-level data. Our findings shed light on the impact of various word-level information types on LLMs' text comprehension.
arXiv Detail & Related papers (2023-12-06T12:34:46Z)
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review [2.4185510826808487]
Deep learning has revolutionized natural language processing (NLP) by enabling the development of models that can learn complex representations of language data. Deep learning models for NLP typically use large amounts of data to train deep neural networks, allowing them to learn the patterns and relationships in language data. Applying deep learning to text summarization refers to the use of deep neural networks to perform text summarization tasks.
arXiv Detail & Related papers (2023-10-13T21:24:37Z)
How learners produce data from text in classifying clickbait [0.0]
This study investigates how students reason with text data in scenarios designed to elicit certain aspects of the domain. Our goal was to shed light on students' understanding of text as data using a motivating task to classify headlines as "clickbait" or "news"
arXiv Detail & Related papers (2023-01-28T20:23:39Z)
An Inclusive Notion of Text [69.36678873492373]
We argue that clarity on the notion of text is crucial for reproducible and generalizable NLP. We introduce a two-tier taxonomy of linguistic and non-linguistic elements that are available in textual sources and can be used in NLP modeling.
arXiv Detail & Related papers (2022-11-10T14:26:43Z)
SCROLLS: Standardized CompaRison Over Long Language Sequences [62.574959194373264]
We introduce SCROLLS, a suite of tasks that require reasoning over long texts. SCROLLS contains summarization, question answering, and natural language inference tasks. We make all datasets available in a unified text-to-text format and host a live leaderboard to facilitate research on model architecture and pretraining methods.
arXiv Detail & Related papers (2022-01-10T18:47:15Z)
Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey [54.34370423151014]
This paper surveys the components of modeling approaches relaying task impacts across various generation tasks such as storytelling, summarization, translation etc. We present an abstraction of the imperative techniques with respect to learning paradigms, pretraining, modeling approaches, decoding and the key challenges outstanding in the field in each of them.
arXiv Detail & Related papers (2020-10-14T17:54:42Z)
Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language. We generate abstractive summaries of narrated instructional videos across a wide variety of topics. We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z)
The Shmoop Corpus: A Dataset of Stories with Loosely Aligned Summaries [72.48439126769627]
We introduce the Shmoop Corpus: a dataset of 231 stories paired with detailed multi-paragraph summaries for each individual chapter. From the corpus, we construct a set of common NLP tasks, including Cloze-form question answering and a simplified form of abstractive summarization. We believe that the unique structure of this corpus provides an important foothold towards making machine story comprehension more approachable.
arXiv Detail & Related papers (2019-12-30T21:03:59Z)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer [64.22926988297685]
Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP) In this paper, we explore the landscape of introducing transfer learning techniques for NLP by a unified framework that converts all text-based language problems into a text-to-text format.
arXiv Detail & Related papers (2019-10-23T17:37:36Z)

This list is automatically generated from the titles and abstracts of the papers in this site.