Efficient approach of using CNN based pretrained model in Bangla
handwritten digit recognition
- URL: http://arxiv.org/abs/2209.13005v1
- Date: Mon, 19 Sep 2022 15:58:53 GMT
- Title: Efficient approach of using CNN based pretrained model in Bangla
handwritten digit recognition
- Authors: Muntarin Islam, Shabbir Ahmed Shuvo, Musarrat Saberin Nipun, Rejwan
Bin Sulaiman, Jannatul Nayeem, Zubaer Haque, Md Mostak Shaikh, Md Sakib Ullah
Sourav
- Abstract summary: Handwritten digit recognition is essential for numerous applications in various industries.
Due to the complexity of Bengali writing in terms of variety in shape, size, and writing style, researchers did not get better accuracy usingSupervised machine learning algorithms to date.
We propose a novel CNN-based pre-trained handwritten digit recognition model which includes Resnet-50, Inception-v3, and EfficientNetB0 on NumtaDB dataset of 17 thousand instances with 10 classes.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to digitalization in everyday life, the need for automatically
recognizing handwritten digits is increasing. Handwritten digit recognition is
essential for numerous applications in various industries. Bengali ranks the
fifth largest language in the world with 265 million speakers (Native and
non-native combined) and 4 percent of the world population speaks Bengali. Due
to the complexity of Bengali writing in terms of variety in shape, size, and
writing style, researchers did not get better accuracy using Supervised machine
learning algorithms to date. Moreover, fewer studies have been done on Bangla
handwritten digit recognition (BHwDR). In this paper, we proposed a novel
CNN-based pre-trained handwritten digit recognition model which includes
Resnet-50, Inception-v3, and EfficientNetB0 on NumtaDB dataset of 17 thousand
instances with 10 classes.. The Result outperformed the performance of other
models to date with 97% accuracy in the 10-digit classes. Furthermore, we have
evaluated the result or our model with other research studies while suggesting
future study
Related papers
- Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter [1.5236380958983642]
The study employed a convolutional neural network (CNN) with ensemble transfer learning and a multichannel attention network.
We evaluated the proposed model using the CAMTERdb 3.1.2 data set and achieved 92% accuracy for the raw dataset and 98.00% for the preprocessed dataset.
arXiv Detail & Related papers (2024-08-20T15:51:01Z) - DocReLM: Mastering Document Retrieval with Language Model [49.847369507694154]
We demonstrate that by utilizing large language models, a document retrieval system can achieve advanced semantic understanding capabilities.
Our approach involves training the retriever and reranker using domain-specific data generated by large language models.
We use a test set annotated by academic researchers in the fields of quantum physics and computer vision to evaluate our system's performance.
arXiv Detail & Related papers (2024-05-19T06:30:22Z) - Sampling and Ranking for Digital Ink Generation on a tight computational
budget [69.15275423815461]
We study ways to maximize the quality of the output of a trained digital ink generative model.
We use and compare the effect of multiple sampling and ranking techniques, in the first ablation study of its kind in the digital ink domain.
arXiv Detail & Related papers (2023-06-02T09:55:15Z) - Bengali Handwritten Digit Recognition using CNN with Explainable AI [0.5156484100374058]
We have used various machine learning algorithms and CNN to recognize handwritten Bengali digits.
Grad-CAM was used as an XAI method on our CNN model, which gave us insights into the model.
arXiv Detail & Related papers (2022-12-23T04:40:20Z) - Writer Recognition Using Off-line Handwritten Single Block Characters [59.17685450892182]
We use personal identity numbers consisting of the six digits of the date of birth, DoB.
We evaluate two recognition approaches, one based on handcrafted features that compute directional measurements, and another based on deep features from a ResNet50 model.
Results show the presence of identity-related information in a piece of handwritten information as small as six digits with the DoB.
arXiv Detail & Related papers (2022-01-25T23:04:10Z) - Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot
Sentiment Classification [78.120927891455]
State-of-the-art brain-to-text systems have achieved great success in decoding language directly from brain signals using neural networks.
In this paper, we extend the problem to open vocabulary Electroencephalography(EEG)-To-Text Sequence-To-Sequence decoding and zero-shot sentence sentiment classification on natural reading tasks.
Our model achieves a 40.1% BLEU-1 score on EEG-To-Text decoding and a 55.6% F1 score on zero-shot EEG-based ternary sentiment classification, which significantly outperforms supervised baselines.
arXiv Detail & Related papers (2021-12-05T21:57:22Z) - Bengali Handwritten Grapheme Classification: Deep Learning Approach [0.0]
We participate in a Kaggle competition citek_link where the challenge is to classify three constituent elements of a Bengali grapheme in the image.
We explore the performances of some existing neural network models such as Multi-Layer Perceptron (MLP) and state of the art ResNet50.
We propose our own convolution neural network (CNN) model for Bengali grapheme classification with validation root accuracy 95.32%, vowel accuracy 98.61%, and consonant accuracy 98.76%.
arXiv Detail & Related papers (2021-11-16T06:14:59Z) - Lexically Aware Semi-Supervised Learning for OCR Post-Correction [90.54336622024299]
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents.
Previous work has demonstrated the utility of neural post-correction methods on recognition of less-well-resourced languages.
We present a semi-supervised learning method that makes it possible to utilize raw images to improve performance.
arXiv Detail & Related papers (2021-11-04T04:39:02Z) - Bangla Handwritten Digit Recognition and Generation [0.0]
A Semi-Supervised Generative Adversarial Network or SGAN has been applied to generate Bangla handwritten numerals.
In this paper, an architecture has been implemented which achieved the validation accuracy of 99.44% on BHAND dataset.
arXiv Detail & Related papers (2021-03-14T12:11:21Z) - AKHCRNet: Bengali Handwritten Character Recognition Using Deep Learning [0.228438857884398]
I propose a state of the art deep neural architectural solution for handwritten character recognition for Bengali alphabets, compound characters, and numerical digits.
I propose an HCR network that is trained from the scratch on Bengali characters without the "Ensemble Learning" that can outperform previous architectures.
arXiv Detail & Related papers (2020-08-29T15:22:00Z) - SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological
Inflection [81.85463892070085]
The SIGMORPHON 2020 task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages.
Systems were developed using data from 45 languages and just 5 language families, fine-tuned with data from an additional 45 languages and 10 language families (13 in total), and evaluated on all 90 languages.
arXiv Detail & Related papers (2020-06-20T13:24:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.