Urdu Handwritten Text Recognition Using ResNet18
- URL: http://arxiv.org/abs/2103.05105v1
- Date: Fri, 19 Feb 2021 17:55:57 GMT
- Title: Urdu Handwritten Text Recognition Using ResNet18
- Authors: Muhammad Kashif
- Abstract summary: We propose a ResNet18 model for handwritten text recognition using Urdu Nastaliq Handwritten dataset (UNHD) which contains 3,12000 words written by 500 candidates.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Handwritten text recognition is an active research area in the field of deep
learning and artificial intelligence to convert handwritten text into
machine-understandable. A lot of work has been done for other languages,
especially for English, but work for the Urdu language is very minimal due to
the cursive nature of Urdu characters. The need for Urdu HCR systems is
increasing because of the advancement of technology. In this paper, we propose
a ResNet18 model for handwritten text recognition using Urdu Nastaliq
Handwritten Dataset (UNHD) which contains 3,12000 words written by 500
candidates.
Related papers
- BeHGAN: Bengali Handwritten Word Generation from Plain Text Using Generative Adversarial Networks [0.2446672595462589]
We develop and use a self-collected dataset of Bengali handwriting samples.<n>The dataset includes contributions from approximately five hundred individuals across different ages and genders.<n>Our approach demonstrates the ability to produce diverse handwritten outputs from input plain text.
arXiv Detail & Related papers (2025-12-25T14:38:12Z) - Devanagari Handwritten Character Recognition using Convolutional Neural Network [0.0]
The Devanagari script is one of the oldest language scripts in India that does not have proper digitization tools.<n>We present a technique to recognize handwritten Devanagari characters using two deep convolutional neural network layers.<n>This approach achieves 96.36% accuracy in testing and 99.55% in training time.
arXiv Detail & Related papers (2025-07-14T15:38:42Z) - Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering [50.52792174648067]
This initiative seeks to bridge the gap between textual and visual comprehension.
We propose a new multi-task Urdu scene text dataset comprising over 1000 natural scene images.
We provide fine-grained annotations for text instances, addressing the limitations of previous datasets.
arXiv Detail & Related papers (2024-05-21T06:48:26Z) - The First Swahili Language Scene Text Detection and Recognition Dataset [55.83178123785643]
There is a significant gap in low-resource languages, especially the Swahili Language.
Swahili is widely spoken in East African countries but is still an under-explored language in scene text recognition.
We propose a comprehensive dataset of Swahili scene text images and evaluate the dataset on different scene text detection and recognition models.
arXiv Detail & Related papers (2024-05-19T03:55:02Z) - NusaWrites: Constructing High-Quality Corpora for Underrepresented and
Extremely Low-Resource Languages [54.808217147579036]
We conduct a case study on Indonesian local languages.
We compare the effectiveness of online scraping, human translation, and paragraph writing by native speakers in constructing datasets.
Our findings demonstrate that datasets generated through paragraph writing by native speakers exhibit superior quality in terms of lexical diversity and cultural content.
arXiv Detail & Related papers (2023-09-19T14:42:33Z) - ASR2K: Speech Recognition for Around 2000 Languages without Audio [100.41158814934802]
We present a speech recognition pipeline that does not require any audio for the target language.
Our pipeline consists of three components: acoustic, pronunciation, and language models.
We build speech recognition for 1909 languages by combining it with Crubadan: a large endangered languages n-gram database.
arXiv Detail & Related papers (2022-09-06T22:48:29Z) - Transformer based Urdu Handwritten Text Optical Character Reader [0.0]
Urdu language script is very difficult because of its cursive nature and change of shape of characters based on it's relative position.
A need arises to propose a model which can understand complex features and generalize it for every kind of handwriting style.
In this work, we propose a transformer based Urdu Handwritten text extraction model.
arXiv Detail & Related papers (2022-06-09T15:43:35Z) - CALText: Contextual Attention Localization for Offline Handwritten Text [1.066048003460524]
We present an attention based encoder-decoder model that learns to read Urdu in context.
A novel localization penalty is introduced to encourage the model to attend only one location at a time when recognizing the next character.
We evaluate the model on both Urdu and Arabic datasets and show that contextual attention localization outperforms both simple attention and multi-directional LSTM models.
arXiv Detail & Related papers (2021-11-06T19:54:21Z) - KOHTD: Kazakh Offline Handwritten Text Dataset [0.0]
We propose an extensive Kazakh offline Handwritten Text dataset (KOHTD)
KOHTD has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols.
We used a variety of popular text recognition methods for word and line recognition in our studies, including CTC-based and attention-based methods.
arXiv Detail & Related papers (2021-09-22T16:19:38Z) - SmartPatch: Improving Handwritten Word Imitation with Patch
Discriminators [67.54204685189255]
We propose SmartPatch, a new technique increasing the performance of current state-of-the-art methods.
We combine the well-known patch loss with information gathered from the parallel trained handwritten text recognition system.
This leads to a more enhanced local discriminator and results in more realistic and higher-quality generated handwritten words.
arXiv Detail & Related papers (2021-05-21T18:34:21Z) - Co-occurrences using Fasttext embeddings for word similarity tasks in
Urdu [0.0]
This paper builds a corpus for Urdu by scraping and integrating data from various sources.
We modify fasttext embeddings and N-Grams models to enable training them on our built corpus.
We have used these trained embeddings for a word similarity task and compared the results with existing techniques.
arXiv Detail & Related papers (2021-02-22T12:56:26Z) - A Survey of Knowledge-Enhanced Text Generation [81.24633231919137]
The goal of text generation is to make machines express in human language.
Various neural encoder-decoder models have been proposed to achieve the goal by learning to map input text to output text.
To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models.
arXiv Detail & Related papers (2020-10-09T06:46:46Z) - Large Scale Font Independent Urdu Text Recognition System [1.5229257192293197]
There exists no automated system that can reliably recognize printed Urdu text in images and videos across different fonts.
We have developed Qaida, a large scale data set with 256 fonts, and a complete Urdu lexicon.
We have also developed a Convolutional Neural Network (CNN) based classification model which can recognize Urdu ligatures with 84.2% accuracy.
arXiv Detail & Related papers (2020-05-14T06:57:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.