A Classifier Using Global Character Level and Local Sub-unit Level
Features for Hindi Online Handwritten Character Recognition
- URL: http://arxiv.org/abs/2310.17138v1
- Date: Thu, 26 Oct 2023 04:20:39 GMT
- Title: A Classifier Using Global Character Level and Local Sub-unit Level
Features for Hindi Online Handwritten Character Recognition
- Authors: Anand Sharma (MIET, Meerut), A. G. Ramakrishnan (IISc, Bengaluru)
- Abstract summary: A classifier is developed that defines a joint distribution of global character features, number of sub-units and local sub-unit features to model Hindi online handwritten characters.
The developed classifier has the highest accuracy of 93.5% on the testing set compared to that of the classifiers trained on different features extracted from the same training set.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: A classifier is developed that defines a joint distribution of global
character features, number of sub-units and local sub-unit features to model
Hindi online handwritten characters. The classifier uses latent variables to
model the structure of sub-units. The classifier uses histograms of points,
orientations, and dynamics of orientations (HPOD) features to represent
characters at global character level and local sub-unit level and is
independent of character stroke order and stroke direction variations. The
parameters of the classifier is estimated using maximum likelihood method.
Different classifiers and features used in other studies are considered in this
study for classification performance comparison with the developed classifier.
The classifiers considered are Second Order Statistics (SOS), Sub-space (SS),
Fisher Discriminant (FD), Feedforward Neural Network (FFN) and Support Vector
Machines (SVM) and the features considered are Spatio Temporal (ST), Discrete
Fourier Transform (DFT), Discrete Cosine Transform (SCT), Discrete Wavelet
Transform (DWT), Spatial (SP) and Histograms of Oriented Gradients (HOG). Hindi
character datasets used for training and testing the developed classifier
consist of samples of handwritten characters from 96 different character
classes. There are 12832 samples with an average of 133 samples per character
class in the training set and 2821 samples with an average of 29 samples per
character class in the testing set. The developed classifier has the highest
accuracy of 93.5\% on the testing set compared to that of the classifiers
trained on different features extracted from the same training set and
evaluated on the same testing set considered in this study.
Related papers
- Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - Novel Deep Neural Network Classifier Characterization Metrics with Applications to Dataless Evaluation [1.6574413179773757]
In this work, we evaluate a Deep Neural Network (DNN) classifier's training quality without any example dataset.
Our empirical study of the proposed method for ResNet18, trained with CAFIR10 and CAFIR100 datasets, confirms that data-less evaluation of DNN classifiers is indeed possible.
arXiv Detail & Related papers (2024-07-17T20:40:46Z) - Structural analysis of Hindi online handwritten characters for character
recognition [0.0]
Direction properties of online strokes are used to analyze them in terms of homogeneous regions or sub-strokes.
These properties along with some geometrics are used to extract sub-units from Hindi online handwritten characters.
A method is developed to extract point stroke, clockwise curve stroke, counter-clockwise curve stroke and loop stroke segments as sub-units.
arXiv Detail & Related papers (2023-10-12T11:14:27Z) - Histograms of Points, Orientations, and Dynamics of Orientations
Features for Hindi Online Handwritten Character Recognition [0.0]
A set of features independent of character stroke direction and order variations is proposed for online handwritten character recognition.
The proposed features have better character discnative capability than the other features considered for comparison.
arXiv Detail & Related papers (2023-09-05T09:11:18Z) - Enhancing Pashto Text Classification using Language Processing
Techniques for Single And Multi-Label Analysis [0.0]
This study aims to establish an automated classification system for Pashto text.
The study achieved an average testing accuracy rate of 94%.
The use of pre-trained language representation models, such as DistilBERT, showed promising results.
arXiv Detail & Related papers (2023-05-04T23:11:31Z) - Explaining Cross-Domain Recognition with Interpretable Deep Classifier [100.63114424262234]
Interpretable Deep (IDC) learns the nearest source samples of a target sample as evidence upon which the classifier makes the decision.
Our IDC leads to a more explainable model with almost no accuracy degradation and effectively calibrates classification for optimum reject options.
arXiv Detail & Related papers (2022-11-15T15:58:56Z) - Detecting Handwritten Mathematical Terms with Sensor Based Data [71.84852429039881]
We propose a solution to the UbiComp 2021 Challenge by Stabilo in which handwritten mathematical terms are supposed to be automatically classified.
The input data set contains data of different writers, with label strings constructed from a total of 15 different possible characters.
arXiv Detail & Related papers (2021-09-12T19:33:34Z) - No Fear of Heterogeneity: Classifier Calibration for Federated Learning
with Non-IID Data [78.69828864672978]
A central challenge in training classification models in the real-world federated system is learning with non-IID data.
We propose a novel and simple algorithm called Virtual Representations (CCVR), which adjusts the classifier using virtual representations sampled from an approximated ssian mixture model.
Experimental results demonstrate that CCVR state-of-the-art performance on popular federated learning benchmarks including CIFAR-10, CIFAR-100, and CINIC-10.
arXiv Detail & Related papers (2021-06-09T12:02:29Z) - Latent Embedding Feedback and Discriminative Features for Zero-Shot
Classification [139.44681304276]
zero-shot learning aims to classify unseen categories for which no data is available during training.
Generative Adrial Networks synthesize unseen class features by leveraging class-specific semantic embeddings.
We propose to enforce semantic consistency at all stages of zero-shot learning: training, feature synthesis and classification.
arXiv Detail & Related papers (2020-03-17T17:34:16Z) - Learning Class Regularized Features for Action Recognition [68.90994813947405]
We introduce a novel method named Class Regularization that performs class-based regularization of layer activations.
We show that using Class Regularization blocks in state-of-the-art CNN architectures for action recognition leads to systematic improvement gains of 1.8%, 1.2% and 1.4% on the Kinetics, UCF-101 and HMDB-51 datasets, respectively.
arXiv Detail & Related papers (2020-02-07T07:27:49Z) - Text Complexity Classification Based on Linguistic Information:
Application to Intelligent Tutoring of ESL [0.0]
The goal of this work is to build a classifier that can identify text complexity within the context of teaching reading to English as a Second Language ( ESL) learners.
Using a corpus of 6171 texts, which had already been classified into three different levels of difficulty by ESL experts, different experiments were conducted with five machine learning algorithms.
The results showed that the adopted linguistic features provide a good overall classification performance.
arXiv Detail & Related papers (2020-01-07T02:42:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.