KOHTD: Kazakh Offline Handwritten Text Dataset
- URL: http://arxiv.org/abs/2110.04075v1
- Date: Wed, 22 Sep 2021 16:19:38 GMT
- Title: KOHTD: Kazakh Offline Handwritten Text Dataset
- Authors: Nazgul Toiganbayeva, Mahmoud Kasem, Galymzhan Abdimanap, Kairat
Bostanbekov, Abdelrahman Abdallah, Anel Alimova, Daniyar Nurseitov
- Abstract summary: We propose an extensive Kazakh offline Handwritten Text dataset (KOHTD)
KOHTD has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols.
We used a variety of popular text recognition methods for word and line recognition in our studies, including CTC-based and attention-based methods.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Despite the transition to digital information exchange, many documents, such
as invoices, taxes, memos and questionnaires, historical data, and answers to
exam questions, still require handwritten inputs. In this regard, there is a
need to implement Handwritten Text Recognition (HTR) which is an automatic way
to decrypt records using a computer. Handwriting recognition is challenging
because of the virtually infinite number of ways a person can write the same
message. For this proposal we introduce Kazakh handwritten text recognition
research, a comprehensive dataset of Kazakh handwritten texts is necessary.
This is particularly true given the lack of a dataset for handwritten Kazakh
text. In this paper, we proposed our extensive Kazakh offline Handwritten Text
dataset (KOHTD), which has 3000 handwritten exam papers and more than 140335
segmented images and there are approximately 922010 symbols. It can serve
researchers in the field of handwriting recognition tasks by using deep and
machine learning. We used a variety of popular text recognition methods for
word and line recognition in our studies, including CTC-based and
attention-based methods. The findings demonstrate KOHTD's diversity. Also, we
proposed a Genetic Algorithm (GA) for line and word segmentation based on
random enumeration of a parameter. The dataset and GA code are available at
https://github.com/abdoelsayed2016/KOHTD.
Related papers
- Bukva: Russian Sign Language Alphabet [75.42794328290088]
This paper investigates the recognition of the Russian fingerspelling alphabet, also known as the Russian Sign Language (RSL) dactyl.
Dactyl is a component of sign languages where distinct hand movements represent individual letters of a written language.
We provide Bukva, the first full-fledged open-source video dataset for RSL dactyl recognition.
arXiv Detail & Related papers (2024-10-11T09:59:48Z) - Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition [5.28595286827031]
The Manuscripts of Handwritten Arabic(Muharaf) dataset is a machine learning dataset consisting of more than 1,600 historic handwritten page images.
This dataset was compiled to advance the state of the art in handwritten text recognition.
arXiv Detail & Related papers (2024-06-13T23:40:34Z) - Khayyam Offline Persian Handwriting Dataset [0.0]
We present the Khayyam dataset as another large unconstrained handwriting dataset for elements (words, sentences, letters, digits) of the Persian language.
Khayyam's dataset contains 44000 words, 60000 letters, and 6000 digits.
To show the applicability of the dataset, machine learning algorithms are trained on the digits, letters, and word data and results are reported.
arXiv Detail & Related papers (2024-06-03T06:17:21Z) - Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering [50.52792174648067]
This initiative seeks to bridge the gap between textual and visual comprehension.
We propose a new multi-task Urdu scene text dataset comprising over 1000 natural scene images.
We provide fine-grained annotations for text instances, addressing the limitations of previous datasets.
arXiv Detail & Related papers (2024-05-21T06:48:26Z) - Handwritten and Printed Text Segmentation: A Signature Case Study [0.0]
We develop novel approaches to address the challenges of handwritten and printed text segmentation.
Our objective is to recover text from different classes in their entirety, especially enhancing the segmentation performance on overlapping sections.
Our best configuration outperforms prior work on two different datasets by 17.9% and 7.3% on IoU scores.
arXiv Detail & Related papers (2023-07-15T21:49:22Z) - UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for
Evaluating Unconstrained Handwriting Image Recognition in Vietnamese [2.8360662552057323]
In Vietnamese, besides the modern Latin characters, there are accent and letter marks together with characters that draw confusion to state-of-the-art handwriting recognition methods.
As a low-resource language, there are not many datasets for researching handwriting recognition in Vietnamese.
Recent works evaluated offline handwriting recognition methods in Vietnamese using images from an online handwriting dataset constructed by connecting pen stroke coordinates without further processing.
This paper proposes the Transferring method to construct a handwriting image dataset that associates crucial natural attributes required for offline handwriting images.
arXiv Detail & Related papers (2022-11-10T08:23:54Z) - Kurdish Handwritten Character Recognition using Deep Learning Techniques [26.23274417985375]
This paper attempts to design and develop a model that can recognize handwritten characters for Kurdish alphabets using deep learning techniques.
A comprehensive dataset was created for handwritten Kurdish characters, which contains more than 40 thousand images.
The tested results reported a 96% accuracy rate, and training accuracy reported a 97% accuracy rate.
arXiv Detail & Related papers (2022-10-18T16:48:28Z) - Comprehensive Benchmark Datasets for Amharic Scene Text Detection and
Recognition [56.048783994698425]
Ethiopic/Amharic script is one of the oldest African writing systems, which serves at least 23 languages in East Africa.
The Amharic writing system, Abugida, has 282 syllables, 15 punctuation marks, and 20 numerals.
We presented the first comprehensive public datasets named HUST-ART, HUST-AST, ABE, and Tana for Amharic script detection and recognition in the natural scene.
arXiv Detail & Related papers (2022-03-23T03:19:35Z) - Letter-level Online Writer Identification [86.13203975836556]
We focus on a novel problem, letter-level online writer-id, which requires only a few trajectories of written letters as identification cues.
A main challenge is that a person often writes a letter in different styles from time to time.
We refer to this problem as the variance of online writing styles (Var-O-Styles)
arXiv Detail & Related papers (2021-12-06T07:21:53Z) - SmartPatch: Improving Handwritten Word Imitation with Patch
Discriminators [67.54204685189255]
We propose SmartPatch, a new technique increasing the performance of current state-of-the-art methods.
We combine the well-known patch loss with information gathered from the parallel trained handwritten text recognition system.
This leads to a more enhanced local discriminator and results in more realistic and higher-quality generated handwritten words.
arXiv Detail & Related papers (2021-05-21T18:34:21Z) - TextScanner: Reading Characters in Order for Robust Scene Text
Recognition [60.04267660533966]
TextScanner is an alternative approach for scene text recognition.
It generates pixel-wise, multi-channel segmentation maps for character class, position and order.
It also adopts RNN for context modeling and performs paralleled prediction for character position and class.
arXiv Detail & Related papers (2019-12-28T07:52:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.