Text line extraction using fully convolutional network and energy
minimization
- URL: http://arxiv.org/abs/2101.07370v1
- Date: Mon, 18 Jan 2021 23:23:03 GMT
- Title: Text line extraction using fully convolutional network and energy
minimization
- Authors: Berat Kurar Barakat, Ahmad Droby, Reem Alaasam, Boraq Madi, Irina
Rabaev, Jihad El-Sana
- Abstract summary: This paper proposes to use a fully convolutional network for text line detection and energy minimization.
We evaluate the proposed method on VML-AHTE, VML-MOC, and Diva-HisDB datasets.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Text lines are important parts of handwritten document images and easier to
analyze by further applications. Despite recent progress in text line
detection, text line extraction from a handwritten document remains an unsolved
task. This paper proposes to use a fully convolutional network for text line
detection and energy minimization for text line extraction. Detected text lines
are represented by blob lines that strike through the text lines. These blob
lines assist an energy function for text line extraction. The detection stage
can locate arbitrarily oriented text lines. Furthermore, the extraction stage
is capable of finding out the pixels of text lines with various heights and
interline proximity independent of their orientations. Besides, it can finely
split the touching and overlapping text lines without an orientation
assumption. We evaluate the proposed method on VML-AHTE, VML-MOC, and
Diva-HisDB datasets. The VML-AHTE dataset contains overlapping, touching and
close text lines with rich diacritics. The VML-MOC dataset is very challenging
by its multiply oriented and skewed text lines. The Diva-HisDB dataset exhibits
distinct text line heights and touching text lines. The results demonstrate the
effectiveness of the method despite various types of challenges, yet using the
same parameters in all the experiments.
Related papers
- Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text [61.22649031769564]
We propose a novel framework, paraphrased text span detection (PTD)
PTD aims to identify paraphrased text spans within a text.
We construct a dedicated dataset, PASTED, for paraphrased text span detection.
arXiv Detail & Related papers (2024-05-21T11:22:27Z) - Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using
Diffusion Models [63.99110667987318]
We present DiffText, a pipeline that seamlessly blends foreground text with the background's intrinsic features.
With fewer text instances, our produced text images consistently surpass other synthetic data in aiding text detectors.
arXiv Detail & Related papers (2023-11-28T06:51:28Z) - TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision [61.186488081379]
We propose TextFormer, a query-based end-to-end text spotter with Transformer architecture.
TextFormer builds upon an image encoder and a text decoder to learn a joint semantic understanding for multi-task modeling.
It allows for mutual training and optimization of classification, segmentation, and recognition branches, resulting in deeper feature sharing.
arXiv Detail & Related papers (2023-06-06T03:37:41Z) - Contextual Text Block Detection towards Scene Text Understanding [85.40898487745272]
This paper presents contextual text detection, a new setup that detects contextual text blocks (CTBs) for better understanding of texts in scenes.
We formulate the new setup by a dual detection task which first detects integral text units and then groups them into a CTB.
To this end, we design a novel scene text clustering technique that treats integral text units as tokens and groups them (belonging to the same CTB) into an ordered token sequence.
arXiv Detail & Related papers (2022-07-26T14:59:25Z) - LineCounter: Learning Handwritten Text Line Segmentation by Counting [37.06878615666929]
Handwritten Text Line (HTLS) is a low-level but important task for document processing.
We propose a novel Line Counting formulation for HTLS -- that involves counting the number of text lines from the top at every pixel location.
This formulation helps learn an end-to-end HTLS solution that directly predicts per-pixel line number for a given document image.
arXiv Detail & Related papers (2021-05-24T14:42:54Z) - Unsupervised learning of text line segmentation by differentiating
coarse patterns [0.0]
We present an unsupervised deep learning method that embeds document image patches to a compact Euclidean space where distances correspond to a coarse text line pattern similarity.
Text line segmentation can be easily implemented using standard techniques with the embedded feature vectors.
We evaluate the method qualitatively and quantitatively on several variants of text line segmentation datasets to demonstrate its effectivity.
arXiv Detail & Related papers (2021-05-19T21:21:30Z) - Text Line Segmentation for Challenging Handwritten Document Images Using
Fully Convolutional Network [0.0]
This paper presents a method for text line segmentation of challenging historical manuscript images.
We rely on line masks that connect the components on the same text line.
FCN has been successfully used for text line segmentation of regular handwritten document images.
arXiv Detail & Related papers (2021-01-20T19:51:26Z) - BOTD: Bold Outline Text Detector [85.33700624095181]
We propose a new one-stage text detector, termed as Bold Outline Text Detector (BOTD)
BOTD is able to process the arbitrary-shaped text with low model complexity.
Experimental results on three real-world benchmarks show the state-of-the-art performance of BOTD.
arXiv Detail & Related papers (2020-11-30T11:54:14Z) - Rethinking Text Segmentation: A Novel Dataset and A Text-Specific
Refinement Approach [34.63444886780274]
Text segmentation is a prerequisite in real-world text-related tasks.
We introduce Text Refinement Network (TexRNet), a novel text segmentation approach.
TexRNet consistently improves text segmentation performance by nearly 2% compared to other state-of-the-art segmentation methods.
arXiv Detail & Related papers (2020-11-27T22:50:09Z) - Unsupervised deep learning for text line segmentation [0.0]
A common method is to train a deep learning network for embedding the document image into an image of blob lines that are tracing the text lines.
This paper presents an unsupervised embedding of document image patches without a need for annotations.
We show that the outliers do not harm the convergence and the network learns to discriminate the text lines from the spaces between text lines.
arXiv Detail & Related papers (2020-03-19T08:57:53Z) - Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting [49.768327669098674]
We propose an end-to-end trainable text spotting approach named Text Perceptron.
It first employs an efficient segmentation-based text detector that learns the latent text reading order and boundary information.
Then a novel Shape Transform Module (abbr. STM) is designed to transform the detected feature regions into regular morphologies.
arXiv Detail & Related papers (2020-02-17T08:07:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.