Handwriting Quality Analysis using Online-Offline Models
- URL: http://arxiv.org/abs/2010.06693v1
- Date: Fri, 9 Oct 2020 14:33:56 GMT
- Title: Handwriting Quality Analysis using Online-Offline Models
- Authors: Yahia Hamdi, Hanen Akouaydi, Houcine Boubaker, Adel M. Alimi
- Abstract summary: This work is part of an innovative e-learning project allowing the development of an advanced digital educational tool.
It automatically detects mistakes, gives real-time on-line feedback for children's writing, and helps teachers comprehend and evaluate children's writing skills.
- Score: 4.61479186986544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work is part of an innovative e-learning project allowing the
development of an advanced digital educational tool that provides feedback
during the process of learning handwriting for young school children (three to
eight years old). In this paper, we describe a new method for children
handwriting quality analysis. It automatically detects mistakes, gives
real-time on-line feedback for children's writing, and helps teachers
comprehend and evaluate children's writing skills. The proposed method adjudges
five main criteria shape, direction, stroke order, position respect to the
reference lines, and kinematics of the trace. It analyzes the handwriting
quality and automatically gives feedback based on the combination of three
extracted models: Beta-Elliptic Model (BEM) using similarity detection (SD) and
dissimilarity distance (DD) measure, Fourier Descriptor Model (FDM), and
perceptive Convolutional Neural Network (CNN) with Support Vector Machine (SVM)
comparison engine. The originality of our work lies partly in the system
architecture which apprehends complementary dynamic, geometric, and visual
representation of the examined handwritten scripts and in the efficient
selected features adapted to various handwriting styles and multiple script
languages such as Arabic, Latin, digits, and symbol drawing. The application
offers two interactive interfaces respectively dedicated to learners,
educators, experts or teachers and allows them to adapt it easily to the
specificity of their disciples. The evaluation of our framework is enhanced by
a database collected in Tunisia primary school with 400 children. Experimental
results show the efficiency and robustness of our suggested framework that
helps teachers and children by offering positive feedback throughout the
handwriting learning process using tactile digital devices.
Related papers
- Towards Accessible Learning: Deep Learning-Based Potential Dysgraphia Detection and OCR for Potentially Dysgraphic Handwriting [1.9575346216959502]
Dysgraphia is a learning disorder that affects handwriting abilities.
Early detection and monitoring are crucial for providing timely support and interventions.
This study applies deep learning techniques to address the dual tasks of dysgraphia detection and optical character recognition.
arXiv Detail & Related papers (2024-11-18T13:28:26Z) - FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction [66.98008357232428]
We propose FineMatch, a new aspect-based fine-grained text and image matching benchmark.
FineMatch focuses on text and image mismatch detection and correction.
We show that models trained on FineMatch demonstrate enhanced proficiency in detecting fine-grained text and image mismatches.
arXiv Detail & Related papers (2024-04-23T03:42:14Z) - Representing Online Handwriting for Recognition in Large Vision-Language
Models [8.344510330567495]
We propose a novel tokenized representation of digital ink (online handwriting) that includes both a time-ordered sequence of strokes as text, and as image.
We show that this representation yields results comparable to or better than state-of-the-art online handwriting recognizers.
arXiv Detail & Related papers (2024-02-23T13:11:10Z) - Self-Supervised Representation Learning for Online Handwriting Text
Classification [0.8594140167290099]
We propose the novel Part of Stroke Masking (POSM) as a pretext task for pretraining models to extract informative representations from the online handwriting of individuals in English and Chinese languages.
To evaluate the quality of the extracted representations, we use both intrinsic and extrinsic evaluation methods.
The pretrained models are fine-tuned to achieve state-of-the-art results in tasks such as writer identification, gender classification, and handedness classification.
arXiv Detail & Related papers (2023-10-10T14:07:49Z) - OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models [122.27878464009181]
We conducted a comprehensive evaluation of Large Multimodal Models, such as GPT4V and Gemini, in various text-related visual tasks.
OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available.
arXiv Detail & Related papers (2023-05-13T11:28:37Z) - EmbedDistill: A Geometric Knowledge Distillation for Information
Retrieval [83.79667141681418]
Large neural models (such as Transformers) achieve state-of-the-art performance for information retrieval (IR)
We propose a novel distillation approach that leverages the relative geometry among queries and documents learned by the large teacher model.
We show that our approach successfully distills from both dual-encoder (DE) and cross-encoder (CE) teacher models to 1/10th size asymmetric students that can retain 95-97% of the teacher performance.
arXiv Detail & Related papers (2023-01-27T22:04:37Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - SURDS: Self-Supervised Attention-guided Reconstruction and Dual Triplet
Loss for Writer Independent Offline Signature Verification [16.499360910037904]
Offline Signature Verification (OSV) is a fundamental biometric task across various forensic, commercial and legal applications.
We propose a two-stage deep learning framework that leverages self-supervised representation learning as well as metric learning for writer-independent OSV.
The proposed framework has been evaluated on two publicly available offline signature datasets and compared with various state-of-the-art methods.
arXiv Detail & Related papers (2022-01-25T07:26:55Z) - Continuous Offline Handwriting Recognition using Deep Learning Models [0.0]
Handwritten text recognition is an open problem of great interest in the area of automatic document image analysis.
We have proposed a new recognition model based on integrating two types of deep learning architectures: convolutional neural networks (CNN) and sequence-to-sequence (seq2seq)
The new proposed model provides competitive results with those obtained with other well-established methodologies.
arXiv Detail & Related papers (2021-12-26T07:31:03Z) - Digital Editions as Distant Supervision for Layout Analysis of Printed
Books [76.29918490722902]
We describe methods for exploiting this semantic markup as distant supervision for training and evaluating layout analysis models.
In experiments with several model architectures on the half-million pages of the Deutsches Textarchiv (DTA), we find a high correlation of these region-level evaluation methods with pixel-level and word-level metrics.
We discuss the possibilities for improving accuracy with self-training and the ability of models trained on the DTA to generalize to other historical printed books.
arXiv Detail & Related papers (2021-12-23T16:51:53Z) - Object Relational Graph with Teacher-Recommended Learning for Video
Captioning [92.48299156867664]
We propose a complete video captioning system including both a novel model and an effective training strategy.
Specifically, we propose an object relational graph (ORG) based encoder, which captures more detailed interaction features to enrich visual representation.
Meanwhile, we design a teacher-recommended learning (TRL) method to make full use of the successful external language model (ELM) to integrate the abundant linguistic knowledge into the caption model.
arXiv Detail & Related papers (2020-02-26T15:34:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.