Explainable AI in Handwriting Detection for Dyslexia Using Transfer Learning
- URL: http://arxiv.org/abs/2410.19821v1
- Date: Fri, 18 Oct 2024 11:14:54 GMT
- Title: Explainable AI in Handwriting Detection for Dyslexia Using Transfer Learning
- Authors: Mahmoud Robaa, Mazen Balat, Rewaa Awaad, Esraa Omar, Salah A. Aly,
- Abstract summary: We propose an explainable AI (XAI) framework for dyslexia detection through handwriting analysis.
Our approach surpasses state-of-the-art methods, achieving a test accuracy of 0.9958.
This framework not only improves diagnostic accuracy but also fosters trust and understanding among educators, clinicians, and parents.
- Score: 0.0
- License:
- Abstract: Dyslexia is one of the most common learning disorders, often characterized by distinct features in handwriting. Early detection is essential for effective intervention. In this paper, we propose an explainable AI (XAI) framework for dyslexia detection through handwriting analysis, utilizing transfer learning and transformer-based models. Our approach surpasses state-of-the-art methods, achieving a test accuracy of 0.9958, while ensuring model interpretability through Grad-CAM visualizations that highlight the critical handwriting features influencing model decisions. The main contributions of this work include the integration of XAI for enhanced interpretability, adaptation to diverse languages and writing systems, and demonstration of the method's global applicability. This framework not only improves diagnostic accuracy but also fosters trust and understanding among educators, clinicians, and parents, supporting earlier diagnoses and the development of personalized educational strategies.
Related papers
- Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease [52.46922921214341]
Alzheimer's disease (AD) has become one of the most significant health challenges in an aging society.
We devised an explainable and effective feature set that leverages the visual capabilities of a large language model (LLM) and the Term Frequency-Inverse Document Frequency (TF-IDF) model.
Our new features can be well explained and interpreted step by step which enhance the interpretability of automatic AD screening.
arXiv Detail & Related papers (2024-11-28T05:23:22Z) - Towards Accessible Learning: Deep Learning-Based Potential Dysgraphia Detection and OCR for Potentially Dysgraphic Handwriting [1.9575346216959502]
Dysgraphia is a learning disorder that affects handwriting abilities.
Early detection and monitoring are crucial for providing timely support and interventions.
This study applies deep learning techniques to address the dual tasks of dysgraphia detection and optical character recognition.
arXiv Detail & Related papers (2024-11-18T13:28:26Z) - Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech [13.700867213652648]
Speech impairments in Parkinson's disease (PD) provide significant early indicators for diagnosis.
Models for speech-based PD detection have shown strong performance, but their interpretability remains underexplored.
This study systematically evaluates several explainability methods to identify PD-specific speech features.
arXiv Detail & Related papers (2024-11-12T18:43:27Z) - Dynamically enhanced static handwriting representation for Parkinson's disease detection [8.26914435242875]
Handwriting plays a special role in the context of Parkinson's disease (PD) assessment.
In this paper, the discriminating power of "dynamically enhanced" static images of handwriting is investigated.
arXiv Detail & Related papers (2024-05-22T08:28:42Z) - Handwriting Anomalies and Learning Disabilities through Recurrent Neural Networks and Geometric Pattern Analysis [0.0]
This study utilizes advanced geometrical patterns and recurrent neural networks (RNN) to identify handwriting anomalies indicative of dyslexia and dysgraphia.
Handwriting is first standardized, followed by feature extraction that focuses on baseline deviations, letter connectivity, stroke thickness, and other anomalies.
Initial results demonstrate the ability of this RNN model to achieve state-of-art performance on combined dyslexia and dysgraphia detection.
arXiv Detail & Related papers (2024-05-12T10:10:13Z) - MLIP: Enhancing Medical Visual Representation with Divergence Encoder
and Knowledge-guided Contrastive Learning [48.97640824497327]
We propose a novel framework leveraging domain-specific medical knowledge as guiding signals to integrate language information into the visual domain through image-text contrastive learning.
Our model includes global contrastive learning with our designed divergence encoder, local token-knowledge-patch alignment contrastive learning, and knowledge-guided category-level contrastive learning with expert knowledge.
Notably, MLIP surpasses state-of-the-art methods even with limited annotated data, highlighting the potential of multimodal pre-training in advancing medical representation learning.
arXiv Detail & Related papers (2024-02-03T05:48:50Z) - Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics [9.168525887419388]
We evaluate six state-of-the-art ASR-based systems for automatically assessing Dutch oral reading accuracy using Kaldi and Whisper.
Results show our most successful system reached substantial agreement with human evaluations.
arXiv Detail & Related papers (2023-06-06T06:49:58Z) - Learning to Decompose Visual Features with Latent Textual Prompts [140.2117637223449]
We propose Decomposed Feature Prompting (DeFo) to improve vision-language models.
Our empirical study shows DeFo's significance in improving the vision-language models.
arXiv Detail & Related papers (2022-10-09T15:40:13Z) - Exploring linguistic feature and model combination for speech
recognition based automatic AD detection [61.91708957996086]
Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques.
Scarcity of specialist data leads to uncertainty in both model selection and feature learning when developing such systems.
This paper investigates the use of feature and model combination approaches to improve the robustness of domain fine-tuning of BERT and Roberta pre-trained text encoders.
arXiv Detail & Related papers (2022-06-28T05:09:01Z) - Lexically Aware Semi-Supervised Learning for OCR Post-Correction [90.54336622024299]
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents.
Previous work has demonstrated the utility of neural post-correction methods on recognition of less-well-resourced languages.
We present a semi-supervised learning method that makes it possible to utilize raw images to improve performance.
arXiv Detail & Related papers (2021-11-04T04:39:02Z) - Morphologically Aware Word-Level Translation [82.59379608647147]
We propose a novel morphologically aware probability model for bilingual lexicon induction.
Our model exploits the basic linguistic intuition that the lexeme is the key lexical unit of meaning.
arXiv Detail & Related papers (2020-11-15T17:54:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.