Bengali Handwritten Grapheme Classification: Deep Learning Approach
- URL: http://arxiv.org/abs/2111.08249v1
- Date: Tue, 16 Nov 2021 06:14:59 GMT
- Title: Bengali Handwritten Grapheme Classification: Deep Learning Approach
- Authors: Tarun Roy, Hasib Hasan, Kowsar Hossain, Masuma Akter Rumi
- Abstract summary: We participate in a Kaggle competition citek_link where the challenge is to classify three constituent elements of a Bengali grapheme in the image.
We explore the performances of some existing neural network models such as Multi-Layer Perceptron (MLP) and state of the art ResNet50.
We propose our own convolution neural network (CNN) model for Bengali grapheme classification with validation root accuracy 95.32%, vowel accuracy 98.61%, and consonant accuracy 98.76%.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Despite being one of the most spoken languages in the world ($6^{th}$ based
on population), research regarding Bengali handwritten grapheme (smallest
functional unit of a writing system) classification has not been explored
widely compared to other prominent languages. Moreover, the large number of
combinations of graphemes in the Bengali language makes this classification
task very challenging. With an effort to contribute to this research problem,
we participate in a Kaggle competition \cite{kaggle_link} where the challenge
is to separately classify three constituent elements of a Bengali grapheme in
the image: grapheme root, vowel diacritics, and consonant diacritics. We
explore the performances of some existing neural network models such as
Multi-Layer Perceptron (MLP) and state of the art ResNet50. To further improve
the performance we propose our own convolution neural network (CNN) model for
Bengali grapheme classification with validation root accuracy 95.32\%, vowel
accuracy 98.61\%, and consonant accuracy 98.76\%. We also explore Region
Proposal Network (RPN) using VGGNet with a limited setting that can be a
potential future direction to improve the performance.
Related papers
- Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter [1.5236380958983642]
The study employed a convolutional neural network (CNN) with ensemble transfer learning and a multichannel attention network.
We evaluated the proposed model using the CAMTERdb 3.1.2 data set and achieved 92% accuracy for the raw dataset and 98.00% for the preprocessed dataset.
arXiv Detail & Related papers (2024-08-20T15:51:01Z) - FewShotTextGCN: K-hop neighborhood regularization for few-shot learning
on graphs [25.88292419660444]
FewShotTextGCN is designed to effectively utilize the properties of word-document graphs for improved learning in low-resource settings.
We introduce K-hop Neighbourhood Regularization, a regularizer for heterogeneous graphs, and show that it stabilizes and improves learning when only a few training samples are available.
arXiv Detail & Related papers (2023-01-25T09:30:32Z) - ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for
Scene Text Spotting [121.11880210592497]
We argue that the limited capacity of language models comes from 1) implicit language modeling; 2) unidirectional feature representation; and 3) language model with noise input.
We propose an autonomous, bidirectional and iterative ABINet++ for scene text spotting.
arXiv Detail & Related papers (2022-11-19T03:50:33Z) - Efficient approach of using CNN based pretrained model in Bangla
handwritten digit recognition [0.0]
Handwritten digit recognition is essential for numerous applications in various industries.
Due to the complexity of Bengali writing in terms of variety in shape, size, and writing style, researchers did not get better accuracy usingSupervised machine learning algorithms to date.
We propose a novel CNN-based pre-trained handwritten digit recognition model which includes Resnet-50, Inception-v3, and EfficientNetB0 on NumtaDB dataset of 17 thousand instances with 10 classes.
arXiv Detail & Related papers (2022-09-19T15:58:53Z) - Harnessing Cross-lingual Features to Improve Cognate Detection for
Low-resource Languages [50.82410844837726]
We demonstrate the use of cross-lingual word embeddings for detecting cognates among fourteen Indian languages.
We evaluate our methods to detect cognates on a challenging dataset of twelve Indian languages.
We observe an improvement of up to 18% points, in terms of F-score, for cognate detection.
arXiv Detail & Related papers (2021-12-16T11:17:58Z) - Towards Zero-shot Language Modeling [90.80124496312274]
We construct a neural model that is inductively biased towards learning human languages.
We infer this distribution from a sample of typologically diverse training languages.
We harness additional language-specific side information as distant supervision for held-out languages.
arXiv Detail & Related papers (2021-08-06T23:49:18Z) - Graph Neural Networks for Natural Language Processing: A Survey [64.36633422999905]
We present a comprehensive overview onGraph Neural Networks (GNNs) for Natural Language Processing.
We propose a new taxonomy of GNNs for NLP, which organizes existing research of GNNs for NLP along three axes: graph construction,graph representation learning, and graph based encoder-decoder models.
arXiv Detail & Related papers (2021-06-10T23:59:26Z) - Graph Convolutional Network for Swahili News Classification [78.6363825307044]
This work empirically demonstrates the ability of Text Graph Convolutional Network (Text GCN) to outperform traditional natural language processing benchmarks for the task of semi-supervised Swahili news classification.
arXiv Detail & Related papers (2021-03-16T21:03:47Z) - Read Like Humans: Autonomous, Bidirectional and Iterative Language
Modeling for Scene Text Recognition [80.446770909975]
Linguistic knowledge is of great benefit to scene text recognition.
How to effectively model linguistic rules in end-to-end deep networks remains a research challenge.
We propose an autonomous, bidirectional and iterative ABINet for scene text recognition.
arXiv Detail & Related papers (2021-03-11T06:47:45Z) - An Attention Ensemble Approach for Efficient Text Classification of
Indian Languages [0.0]
This paper focuses on the coarse-grained technical domain identification of short text documents in Marathi, a Devanagari script-based Indian language.
A hybrid CNN-BiLSTM attention ensemble model is proposed that competently combines the intermediate sentence representations generated by the convolutional neural network and the bidirectional long short-term memory, leading to efficient text classification.
Experimental results show that the proposed model outperforms various baseline machine learning and deep learning models in the given task, giving the best validation accuracy of 89.57% and f1-score of 0.8875.
arXiv Detail & Related papers (2021-02-20T07:31:38Z) - A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes [1.009810782568186]
We propose a labeling scheme that makes segmentation in-side alpha-syllabary words linear.
The dataset contains 411k curated samples of 1295 unique commonly used Bengali graphemes.
The dataset is open-sourced as a part of a public Handwritten Grapheme Classification Challenge on Kaggle.
arXiv Detail & Related papers (2020-10-01T01:51:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.