Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection
- URL: http://arxiv.org/abs/2107.12664v2
- Date: Wed, 28 Jul 2021 09:19:10 GMT
- Title: Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection
- Authors: Shi-Xue Zhang, Xiaobin Zhu, Chun Yang, Hongfa Wang, Xu-Cheng Yin
- Abstract summary: We propose a novel adaptive boundary proposal network for arbitrary shape text detection.
Our method can learn to directly produce accurate boundary for arbitrary shape text without any post-processing.
- Score: 18.491440228386313
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Arbitrary shape text detection is a challenging task due to the high
complexity and variety of scene texts. In this work, we propose a novel
adaptive boundary proposal network for arbitrary shape text detection, which
can learn to directly produce accurate boundary for arbitrary shape text
without any post-processing. Our method mainly consists of a boundary proposal
model and an innovative adaptive boundary deformation model. The boundary
proposal model constructed by multi-layer dilated convolutions is adopted to
produce prior information (including classification map, distance field, and
direction field) and coarse boundary proposals. The adaptive boundary
deformation model is an encoder-decoder network, in which the encoder mainly
consists of a Graph Convolutional Network (GCN) and a Recurrent Neural Network
(RNN). It aims to perform boundary deformation in an iterative way for
obtaining text instance shape guided by prior information from the boundary
proposal model. In this way, our method can directly and efficiently generate
accurate text boundaries without complex post-processing. Extensive experiments
on publicly available datasets demonstrate the state-of-the-art performance of
our method.
Related papers
- BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text
Detection [19.574306663095243]
We propose a novel arbitrary shape scene text detector through boundary points dynamic optimization(BPDO)
Model is designed with a text aware module (TAM) and a boundary point dynamic optimization module (DOM)
Experiments on CTW-1500, Total-Text, and MSRA-TD500 datasets show that the model proposed in this paper achieves a performance better than or comparable to the state-of-the-art algorithm.
arXiv Detail & Related papers (2024-01-18T14:13:46Z) - RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline
Model and DoF-based Curriculum Learning [62.86400614141706]
We propose a new learning model, i.e., Rectangling Rectification Network (RecRecNet)
Our model can flexibly warp the source structure to the target domain and achieves an end-to-end unsupervised deformation.
Experiments show the superiority of our solution over the compared methods on both quantitative and qualitative evaluations.
arXiv Detail & Related papers (2023-01-04T15:12:57Z) - Reduced Representation of Deformation Fields for Effective Non-rigid
Shape Matching [26.77241999731105]
We present a novel approach for computing correspondences between non-rigid objects by exploiting a reduced representation of deformation fields.
By letting the network learn deformation parameters at a sparse set of positions in space (nodes), we reconstruct the continuous deformation field in a closed-form with guaranteed smoothness.
Our model has high expressive power and is able to capture complex deformations.
arXiv Detail & Related papers (2022-11-26T16:11:17Z) - SpaText: Spatio-Textual Representation for Controllable Image Generation [61.89548017729586]
SpaText is a new method for text-to-image generation using open-vocabulary scene control.
In addition to a global text prompt that describes the entire scene, the user provides a segmentation map.
We show its effectiveness on two state-of-the-art diffusion models: pixel-based and latent-conditional-based.
arXiv Detail & Related papers (2022-11-25T18:59:10Z) - Arbitrary Shape Text Detection via Boundary Transformer [18.229219867056347]
We present a unified coarse-to-fine framework via boundary learning for arbitrary shape text detection.
We explicitly model the text boundary via an innovative iterative boundary transformer in a coarse-to-fine manner.
Our method can directly gain accurate text boundaries and abandon complex post-processing to improve efficiency.
arXiv Detail & Related papers (2022-05-11T07:59:13Z) - Real-Time Scene Text Detection with Differentiable Binarization and
Adaptive Scale Fusion [62.269219152425556]
segmentation-based scene text detection methods have drawn extensive attention in the scene text detection field.
We propose a Differentiable Binarization (DB) module that integrates the binarization process into a segmentation network.
An efficient Adaptive Scale Fusion (ASF) module is proposed to improve the scale robustness by fusing features of different scales adaptively.
arXiv Detail & Related papers (2022-02-21T15:30:14Z) - Deep Magnification-Flexible Upsampling over 3D Point Clouds [103.09504572409449]
We propose a novel end-to-end learning-based framework to generate dense point clouds.
We first formulate the problem explicitly, which boils down to determining the weights and high-order approximation errors.
Then, we design a lightweight neural network to adaptively learn unified and sorted weights as well as the high-order refinements.
arXiv Detail & Related papers (2020-11-25T14:00:18Z) - Neural Subdivision [58.97214948753937]
This paper introduces Neural Subdivision, a novel framework for data-driven coarseto-fine geometry modeling.
We optimize for the same set of network weights across all local mesh patches, thus providing an architecture that is not constrained to a specific input mesh, fixed genus, or category.
We demonstrate that even when trained on a single high-resolution mesh our method generates reasonable subdivisions for novel shapes.
arXiv Detail & Related papers (2020-05-04T20:03:21Z) - ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene
Text Detection [147.10751375922035]
We propose the ContourNet, which effectively handles false positives and large scale variance of scene texts.
Our method effectively suppresses these false positives by only outputting predictions with high response value in both directions.
arXiv Detail & Related papers (2020-04-10T08:15:23Z) - Deep Relational Reasoning Graph Network for Arbitrary Shape Text
Detection [20.244378408779554]
We propose a novel unified relational reasoning graph network for arbitrary shape text detection.
An innovative local graph bridges a text proposal model via CNN and a deep relational reasoning network via Graph Convolutional Network (GCN)
Experiments on public available datasets demonstrate the state-of-the-art performance of our method.
arXiv Detail & Related papers (2020-03-17T01:50:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.