BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla
Handwritten Text Recognition (HTR) and Line Segmentation
- URL: http://arxiv.org/abs/2206.08977v1
- Date: Sun, 29 May 2022 22:56:26 GMT
- Title: BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla
Handwritten Text Recognition (HTR) and Line Segmentation
- Authors: Md. Ataur Rahman, Nazifa Tabassum, Mitu Paul, Riya Pal, Mohammad
Khairul Islam
- Abstract summary: We introduce a new dataset for offline Handwritten Text Recognition (HTR) from images of Bangla scripts comprising words, lines, and document-level annotations.
The BN-HTRd dataset is based on the BBC Bangla News corpus, meant to act as ground truth texts.
Our dataset includes 788 images of handwritten pages produced by approximately 150 different writers.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce a new dataset for offline Handwritten Text Recognition (HTR)
from images of Bangla scripts comprising words, lines, and document-level
annotations. The BN-HTRd dataset is based on the BBC Bangla News corpus, meant
to act as ground truth texts. These texts were subsequently used to generate
the annotations that were filled out by people with their handwriting. Our
dataset includes 788 images of handwritten pages produced by approximately 150
different writers. It can be adopted as a basis for various handwriting
classification tasks such as end-to-end document recognition, word-spotting,
word or line segmentation, and so on. We also propose a scheme to segment
Bangla handwritten document images into corresponding lines in an unsupervised
manner. Our line segmentation approach takes care of the variability involved
in different writing styles, accurately segmenting complex handwritten text
lines of curvilinear nature. Along with a bunch of pre-processing and
morphological operations, both Hough line and circle transforms were employed
to distinguish different linear components. In order to arrange those
components into their corresponding lines, we followed an unsupervised
clustering approach. The average success rate of our segmentation technique is
81.57% in terms of FM metrics (similar to F-measure) with a mean Average
Precision (mAP) of 0.547.
Related papers
- Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis [52.01356859448068]
HTS can recognize text in an image and identify its 4-level hierarchical structure: characters, words, lines, and paragraphs.
HTS achieves state-of-the-art results on multiple word-level text spotting benchmark datasets as well as geometric layout analysis tasks.
arXiv Detail & Related papers (2023-10-25T22:23:54Z) - Copy Is All You Need [66.00852205068327]
We formulate text generation as progressively copying text segments from an existing text collection.
Our approach achieves better generation quality according to both automatic and human evaluations.
Our approach attains additional performance gains by simply scaling up to larger text collections.
arXiv Detail & Related papers (2023-07-13T05:03:26Z) - BN-DRISHTI: Bangla Document Recognition through Instance-level
Segmentation of Handwritten Text Images [0.0]
This paper introduces a deep learning-based object detection framework (YOLO) with Hough and Affine transformation for skew correction.
We present an extended version of the BN-HTRd dataset comprising 786 full-page handwritten Bangla document images.
Evaluation on the test portion of our dataset resulted in an F-score of 99.97% for line and 98% for word segmentation.
arXiv Detail & Related papers (2023-05-31T04:08:57Z) - Letter-level Online Writer Identification [86.13203975836556]
We focus on a novel problem, letter-level online writer-id, which requires only a few trajectories of written letters as identification cues.
A main challenge is that a person often writes a letter in different styles from time to time.
We refer to this problem as the variance of online writing styles (Var-O-Styles)
arXiv Detail & Related papers (2021-12-06T07:21:53Z) - Line Segmentation from Unconstrained Handwritten Text Images using
Adaptive Approach [10.436029791699777]
Line segmentation from handwritten text images is a challenging task due to diversity and unknown variations.
An adaptive approach is used for the line segmentation from handwritten text images merging the alignment of connected component coordinates and text height.
The proposed scheme is tested on two different type of datasets; document pages having base lines and plain pages.
arXiv Detail & Related papers (2021-04-18T08:52:52Z) - SOLD2: Self-supervised Occlusion-aware Line Description and Detection [95.8719432775724]
We introduce the first joint detection and description of line segments in a single deep network.
Our method does not require any annotated line labels and can therefore generalize to any dataset.
We evaluate our approach against previous line detection and description methods on several multi-view datasets.
arXiv Detail & Related papers (2021-04-07T19:27:17Z) - Text Line Segmentation for Challenging Handwritten Document Images Using
Fully Convolutional Network [0.0]
This paper presents a method for text line segmentation of challenging historical manuscript images.
We rely on line masks that connect the components on the same text line.
FCN has been successfully used for text line segmentation of regular handwritten document images.
arXiv Detail & Related papers (2021-01-20T19:51:26Z) - VML-MOC: Segmenting a multiply oriented and curved handwritten text
lines dataset [0.8399688944263843]
We evaluate a multi-oriented Gaussian based method to segment these handwritten text lines that are skewed or curved in any orientation.
The results are compared with the results of a single-oriented Gaussian based text line segmentation method.
arXiv Detail & Related papers (2021-01-19T10:10:45Z) - Unsupervised deep learning for text line segmentation [0.0]
A common method is to train a deep learning network for embedding the document image into an image of blob lines that are tracing the text lines.
This paper presents an unsupervised embedding of document image patches without a need for annotations.
We show that the outliers do not harm the convergence and the network learns to discriminate the text lines from the spaces between text lines.
arXiv Detail & Related papers (2020-03-19T08:57:53Z) - Learning to Select Bi-Aspect Information for Document-Scale Text Content
Manipulation [50.01708049531156]
We focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer.
In detail, the input is a set of structured records and a reference text for describing another recordset.
The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference.
arXiv Detail & Related papers (2020-02-24T12:52:10Z) - TextScanner: Reading Characters in Order for Robust Scene Text
Recognition [60.04267660533966]
TextScanner is an alternative approach for scene text recognition.
It generates pixel-wise, multi-channel segmentation maps for character class, position and order.
It also adopts RNN for context modeling and performs paralleled prediction for character position and class.
arXiv Detail & Related papers (2019-12-28T07:52:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.