Predicting Text Readability from Scrolling Interactions
- URL: http://arxiv.org/abs/2105.06354v1
- Date: Thu, 13 May 2021 15:27:00 GMT
- Title: Predicting Text Readability from Scrolling Interactions
- Authors: Sian Gooding, Yevgeni Berzak, Tony Mak, Matt Sharifi
- Abstract summary: This paper investigates how scrolling behaviour relates to the readability of a text.
We make our dataset publicly available and show that there are statistically significant differences in the way readers interact with text depending on the text level.
- Score: 6.530293714772306
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Judging the readability of text has many important applications, for instance
when performing text simplification or when sourcing reading material for
language learners. In this paper, we present a 518 participant study which
investigates how scrolling behaviour relates to the readability of a text. We
make our dataset publicly available and show that (1) there are statistically
significant differences in the way readers interact with text depending on the
text level, (2) such measures can be used to predict the readability of text,
and (3) the background of a reader impacts their reading interactions and the
factors contributing to text difficulty.
Related papers
- Evaluating Text Classification Robustness to Part-of-Speech Adversarial Examples [0.6445605125467574]
Adversarial examples are inputs that are designed to trick the decision making process, and are intended to be imperceptible to humans.
For text-based classification systems, changes to the input, a string of text, are always perceptible.
To improve the quality of text-based adversarial examples, we need to know what elements of the input text are worth focusing on.
arXiv Detail & Related papers (2024-08-15T18:33:54Z) - Discovering influential text using convolutional neural networks [0.0]
We present a method for discovering clusters of similar text phrases that are predictive of human reactions to texts using convolutional neural networks.
We apply the method to two datasets. The first enables direct validation of the model's ability to detect phrases known to cause the outcome.
In both cases, the model learns a greater variety of text treatments compared to benchmark methods, and these text features quantitatively meet or exceed the ability of benchmark methods to predict the outcome.
arXiv Detail & Related papers (2024-06-14T14:41:44Z) - Generating Summaries with Controllable Readability Levels [67.34087272813821]
Several factors affect the readability level, such as the complexity of the text, its subject matter, and the reader's background knowledge.
Current text generation approaches lack refined control, resulting in texts that are not customized to readers' proficiency levels.
We develop three text generation techniques for controlling readability: instruction-based readability control, reinforcement learning to minimize the gap between requested and observed readability, and a decoding approach that uses look-ahead to estimate the readability of upcoming decoding steps.
arXiv Detail & Related papers (2023-10-16T17:46:26Z) - Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models [2.913033886371052]
We propose a novel readability-controlled text modification task.
The task requires the generation of 8 versions at various target readability levels for each input text.
We find greater drops in semantic and lexical similarity between the source and target texts with greater shifts in the readability.
arXiv Detail & Related papers (2023-09-22T00:47:18Z) - How learners produce data from text in classifying clickbait [0.0]
This study investigates how students reason with text data in scenarios designed to elicit certain aspects of the domain.
Our goal was to shed light on students' understanding of text as data using a motivating task to classify headlines as "clickbait" or "news"
arXiv Detail & Related papers (2023-01-28T20:23:39Z) - Textual Entailment Recognition with Semantic Features from Empirical
Text Representation [60.31047947815282]
A text entails a hypothesis if and only if the true value of the hypothesis follows the text.
In this paper, we propose a novel approach to identifying the textual entailment relationship between text and hypothesis.
We employ an element-wise Manhattan distance vector-based feature that can identify the semantic entailment relationship between the text-hypothesis pair.
arXiv Detail & Related papers (2022-10-18T10:03:51Z) - CORE-Text: Improving Scene Text Detection with Contrastive Relational
Reasoning [65.57338873921168]
Localizing text instances in natural scenes is regarded as a fundamental challenge in computer vision.
In this work, we quantitatively analyze the sub-text problem and present a simple yet effective design, COntrastive RElation (CORE) module.
We integrate the CORE module into a two-stage text detector of Mask R-CNN and devise our text detector CORE-Text.
arXiv Detail & Related papers (2021-12-14T16:22:25Z) - Readability Research: An Interdisciplinary Approach [62.03595526230364]
We aim to provide a firm foundation for readability research, a comprehensive framework for readability research.
Readability refers to aspects of visual information design which impact information flow from the page to the reader.
These aspects can be modified on-demand, instantly improving the ease with which a reader can process and derive meaning from text.
arXiv Detail & Related papers (2021-07-20T16:52:17Z) - Improving Machine Reading Comprehension with Contextualized Commonsense
Knowledge [62.46091695615262]
We aim to extract commonsense knowledge to improve machine reading comprehension.
We propose to represent relations implicitly by situating structured knowledge in a context.
We employ a teacher-student paradigm to inject multiple types of contextualized knowledge into a student machine reader.
arXiv Detail & Related papers (2020-09-12T17:20:01Z) - TRIE: End-to-End Text Reading and Information Extraction for Document
Understanding [56.1416883796342]
We propose a unified end-to-end text reading and information extraction network.
multimodal visual and textual features of text reading are fused for information extraction.
Our proposed method significantly outperforms the state-of-the-art methods in both efficiency and accuracy.
arXiv Detail & Related papers (2020-05-27T01:47:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.