BanglaNet: Bangla Handwritten Character Recognition using Ensembling of
Convolutional Neural Network
- URL: http://arxiv.org/abs/2401.08035v2
- Date: Sun, 4 Feb 2024 17:39:07 GMT
- Title: BanglaNet: Bangla Handwritten Character Recognition using Ensembling of
Convolutional Neural Network
- Authors: Chandrika Saha, Md Mostafijur Rahman
- Abstract summary: This paper presents a classification model based on the ensembling of several Convolutional Neural Networks (CNN)
Three different models based on the idea of state-of-the-art CNN models like Inception, ResNet, and DenseNet have been trained with both augmented and non-augmented inputs.
Rigorous experimentation on three benchmark Bangla handwritten characters datasets, namely, CMATERdb, BanglaLekha-Isolated, and Ekush has exhibited significant recognition accuracies.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Handwritten character recognition is a crucial task because of its abundant
applications. The recognition task of Bangla handwritten characters is
especially challenging because of the cursive nature of Bangla characters and
the presence of compound characters with more than one way of writing. In this
paper, a classification model based on the ensembling of several Convolutional
Neural Networks (CNN), namely, BanglaNet is proposed to classify Bangla basic
characters, compound characters, numerals, and modifiers. Three different
models based on the idea of state-of-the-art CNN models like Inception, ResNet,
and DenseNet have been trained with both augmented and non-augmented inputs.
Finally, all these models are averaged or ensembled to get the finishing model.
Rigorous experimentation on three benchmark Bangla handwritten characters
datasets, namely, CMATERdb, BanglaLekha-Isolated, and Ekush has exhibited
significant recognition accuracies compared to some recent CNN-based research.
The top-1 recognition accuracies obtained are 98.40%, 97.65%, and 97.32%, and
the top-3 accuracies are 99.79%, 99.74%, and 99.56% for CMATERdb,
BanglaLekha-Isolated, and Ekush datasets respectively.
Related papers
- Understanding writing style in social media with a supervised
contrastively pre-trained transformer [57.48690310135374]
Online Social Networks serve as fertile ground for harmful behavior, ranging from hate speech to the dissemination of disinformation.
We introduce the Style Transformer for Authorship Representations (STAR), trained on a large corpus derived from public sources of 4.5 x 106 authored texts.
Using a support base of 8 documents of 512 tokens, we can discern authors from sets of up to 1616 authors with at least 80% accuracy.
arXiv Detail & Related papers (2023-10-17T09:01:17Z) - Sampling and Ranking for Digital Ink Generation on a tight computational
budget [69.15275423815461]
We study ways to maximize the quality of the output of a trained digital ink generative model.
We use and compare the effect of multiple sampling and ranking techniques, in the first ablation study of its kind in the digital ink domain.
arXiv Detail & Related papers (2023-06-02T09:55:15Z) - Efficient approach of using CNN based pretrained model in Bangla
handwritten digit recognition [0.0]
Handwritten digit recognition is essential for numerous applications in various industries.
Due to the complexity of Bengali writing in terms of variety in shape, size, and writing style, researchers did not get better accuracy usingSupervised machine learning algorithms to date.
We propose a novel CNN-based pre-trained handwritten digit recognition model which includes Resnet-50, Inception-v3, and EfficientNetB0 on NumtaDB dataset of 17 thousand instances with 10 classes.
arXiv Detail & Related papers (2022-09-19T15:58:53Z) - Writer Recognition Using Off-line Handwritten Single Block Characters [59.17685450892182]
We use personal identity numbers consisting of the six digits of the date of birth, DoB.
We evaluate two recognition approaches, one based on handcrafted features that compute directional measurements, and another based on deep features from a ResNet50 model.
Results show the presence of identity-related information in a piece of handwritten information as small as six digits with the DoB.
arXiv Detail & Related papers (2022-01-25T23:04:10Z) - A Classical Approach to Handcrafted Feature Extraction Techniques for
Bangla Handwritten Digit Recognition [0.0]
We benchmarked four rigorous classifiers to recognize Bangla Handwritten Digit.
The recognition accuracy of the HOG+SVM method on the NumtaDB, CMARTdb, Ekush and BDRW datasets reached 93.32%, 98.08%, 95.68% and 89.68%, respectively.
arXiv Detail & Related papers (2022-01-25T05:27:57Z) - Bengali Handwritten Grapheme Classification: Deep Learning Approach [0.0]
We participate in a Kaggle competition citek_link where the challenge is to classify three constituent elements of a Bengali grapheme in the image.
We explore the performances of some existing neural network models such as Multi-Layer Perceptron (MLP) and state of the art ResNet50.
We propose our own convolution neural network (CNN) model for Bengali grapheme classification with validation root accuracy 95.32%, vowel accuracy 98.61%, and consonant accuracy 98.76%.
arXiv Detail & Related papers (2021-11-16T06:14:59Z) - Sentiment analysis in tweets: an assessment study from classical to
modern text representation models [59.107260266206445]
Short texts published on Twitter have earned significant attention as a rich source of information.
Their inherent characteristics, such as the informal, and noisy linguistic style, remain challenging to many natural language processing (NLP) tasks.
This study fulfils an assessment of existing language models in distinguishing the sentiment expressed in tweets by using a rich collection of 22 datasets.
arXiv Detail & Related papers (2021-05-29T21:05:28Z) - Skeleton Based Sign Language Recognition Using Whole-body Keypoints [71.97020373520922]
Sign language is used by deaf or speech impaired people to communicate.
Skeleton-based recognition is becoming popular that it can be further ensembled with RGB-D based method to achieve state-of-the-art performance.
Inspired by the recent development of whole-body pose estimation citejin 2020whole, we propose recognizing sign language based on the whole-body key points and features.
arXiv Detail & Related papers (2021-03-16T03:38:17Z) - Bangla Handwritten Digit Recognition and Generation [0.0]
A Semi-Supervised Generative Adversarial Network or SGAN has been applied to generate Bangla handwritten numerals.
In this paper, an architecture has been implemented which achieved the validation accuracy of 99.44% on BHAND dataset.
arXiv Detail & Related papers (2021-03-14T12:11:21Z) - Read Like Humans: Autonomous, Bidirectional and Iterative Language
Modeling for Scene Text Recognition [80.446770909975]
Linguistic knowledge is of great benefit to scene text recognition.
How to effectively model linguistic rules in end-to-end deep networks remains a research challenge.
We propose an autonomous, bidirectional and iterative ABINet for scene text recognition.
arXiv Detail & Related papers (2021-03-11T06:47:45Z) - MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten
Compound Characters [0.0]
MatrriVasha is the project which can recognize Bangla, handwritten several compound characters.
The proposed dataset is so far the most extensive dataset for Bangla compound characters.
arXiv Detail & Related papers (2020-04-29T06:38:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.