A Classical Approach to Handcrafted Feature Extraction Techniques for
Bangla Handwritten Digit Recognition
- URL: http://arxiv.org/abs/2201.10102v1
- Date: Tue, 25 Jan 2022 05:27:57 GMT
- Title: A Classical Approach to Handcrafted Feature Extraction Techniques for
Bangla Handwritten Digit Recognition
- Authors: Md. Ferdous Wahid, Md. Fahim Shahriar, Md. Shohanur Islam Sobuj
- Abstract summary: We benchmarked four rigorous classifiers to recognize Bangla Handwritten Digit.
The recognition accuracy of the HOG+SVM method on the NumtaDB, CMARTdb, Ekush and BDRW datasets reached 93.32%, 98.08%, 95.68% and 89.68%, respectively.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Bangla Handwritten Digit recognition is a significant step forward in the
development of Bangla OCR. However, intricate shape, structural likeness and
distinctive composition style of Bangla digits makes it relatively challenging
to distinguish. Thus, in this paper, we benchmarked four rigorous classifiers
to recognize Bangla Handwritten Digit: K-Nearest Neighbor (KNN), Support Vector
Machine (SVM), Random Forest (RF), and Gradient-Boosted Decision Trees (GBDT)
based on three handcrafted feature extraction techniques: Histogram of Oriented
Gradients (HOG), Local Binary Pattern (LBP), and Gabor filter on four publicly
available Bangla handwriting digits datasets: NumtaDB, CMARTdb, Ekush and BDRW.
Here, handcrafted feature extraction methods are used to extract features from
the dataset image, which are then utilized to train machine learning
classifiers to identify Bangla handwritten digits. We further fine-tuned the
hyperparameters of the classification algorithms in order to acquire the finest
Bangla handwritten digits recognition performance from these algorithms, and
among all the models we employed, the HOG features combined with SVM model
(HOG+SVM) attained the best performance metrics across all datasets. The
recognition accuracy of the HOG+SVM method on the NumtaDB, CMARTdb, Ekush and
BDRW datasets reached 93.32%, 98.08%, 95.68% and 89.68%, respectively as well
as we compared the model performance with recent state-of-art methods.
Related papers
- BanglaNet: Bangla Handwritten Character Recognition using Ensembling of
Convolutional Neural Network [0.0]
This paper presents a classification model based on the ensembling of several Convolutional Neural Networks (CNN)
Three different models based on the idea of state-of-the-art CNN models like Inception, ResNet, and DenseNet have been trained with both augmented and non-augmented inputs.
Rigorous experimentation on three benchmark Bangla handwritten characters datasets, namely, CMATERdb, BanglaLekha-Isolated, and Ekush has exhibited significant recognition accuracies.
arXiv Detail & Related papers (2024-01-16T01:08:19Z) - Text-to-3D with Classifier Score Distillation [80.14832887529259]
Classifier-free guidance is considered an auxiliary trick rather than the most essential.
We name this method Score Distillation (CSD), which can be interpreted as using an implicit classification model for generation.
We validate the effectiveness of CSD across a variety of text-to-3D tasks including shape generation, texture synthesis, and shape editing.
arXiv Detail & Related papers (2023-10-30T10:25:40Z) - BN-DRISHTI: Bangla Document Recognition through Instance-level
Segmentation of Handwritten Text Images [0.0]
This paper introduces a deep learning-based object detection framework (YOLO) with Hough and Affine transformation for skew correction.
We present an extended version of the BN-HTRd dataset comprising 786 full-page handwritten Bangla document images.
Evaluation on the test portion of our dataset resulted in an F-score of 99.97% for line and 98% for word segmentation.
arXiv Detail & Related papers (2023-05-31T04:08:57Z) - Advancing 3D finger knuckle recognition via deep feature learning [51.871256510747465]
Contactless 3D finger knuckle patterns have emerged as an effective biometric identifier due to its discriminativeness, visibility from a distance, and convenience.
Recent research has developed a deep feature collaboration network which simultaneously incorporates intermediate features from deep neural networks with multiple scales.
This paper advances this approach by investigating the possibility of learning a discriminative feature vector with the least possible dimension for representing 3D finger knuckle images.
arXiv Detail & Related papers (2023-01-07T20:55:16Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Gait Recognition in the Wild: A Large-scale Benchmark and NAS-based
Baseline [95.88825497452716]
Gait benchmarks empower the research community to train and evaluate high-performance gait recognition systems.
GREW is the first large-scale dataset for gait recognition in the wild.
SPOSGait is the first NAS-based gait recognition model.
arXiv Detail & Related papers (2022-05-05T14:57:39Z) - Lexically Aware Semi-Supervised Learning for OCR Post-Correction [90.54336622024299]
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents.
Previous work has demonstrated the utility of neural post-correction methods on recognition of less-well-resourced languages.
We present a semi-supervised learning method that makes it possible to utilize raw images to improve performance.
arXiv Detail & Related papers (2021-11-04T04:39:02Z) - Deep Hough Voting for Robust Global Registration [52.40611370293272]
We present an efficient framework for pairwise registration of real-world 3D scans, leveraging Hough voting in the 6D transformation parameter space.
Our method outperforms state-of-the-art methods on 3DMatch and 3DLoMatch benchmarks while achieving comparable performance on KITTI odometry dataset.
arXiv Detail & Related papers (2021-09-09T14:38:06Z) - End-to-End Approach for Recognition of Historical Digit Strings [2.0754848504005583]
We propose an end-to-end segmentation-free deep learning approach to handle challenging ancient handwriting style of dates present in the ARDIS dataset (4-digits long strings)
We show that with slight modifications in the VGG-16 deep model, the framework can achieve a recognition rate of 93.2%, resulting in a feasible solution free of methods, segmentation, and fusion methods.
arXiv Detail & Related papers (2021-04-28T09:39:29Z) - A Skip-connected Multi-column Network for Isolated Handwritten Bangla
Character and Digit recognition [12.551285203114723]
We have proposed a non-explicit feature extraction method using a multi-scale multi-column skip convolutional neural network.
Our method is evaluated on four publicly available datasets of isolated handwritten Bangla characters and digits.
arXiv Detail & Related papers (2020-04-27T13:18:58Z) - Sparse Concept Coded Tetrolet Transform for Unconstrained Odia Character
Recognition [0.0]
We propose a new image representation approach for unconstrained alphanumeric characters using sparse concept coded Tetrolets.
The proposed OCR system is shown to perform better than other sparse based techniques such as PCA, SparsePCA and Slantlet.
arXiv Detail & Related papers (2020-04-03T13:20:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.