ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
- URL: http://arxiv.org/abs/2002.10200v2
- Date: Tue, 25 Feb 2020 08:02:28 GMT
- Title: ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
- Authors: Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei
Wang
- Abstract summary: We propose the Adaptive Bezier-Curve Network (ABCNet) for scene text detection and recognition.
For the first time, we adaptively fit arbitrarily-shaped text by a parameterized Bezier curve.
Compared with standard bounding box detection, our Bezier curve detection introduces negligible overhead, resulting in superiority of our method in both efficiency and accuracy.
- Score: 108.07304516679103
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Scene text detection and recognition has received increasing research
attention. Existing methods can be roughly categorized into two groups:
character-based and segmentation-based. These methods either are costly for
character annotation or need to maintain a complex pipeline, which is often not
suitable for real-time applications. Here we address the problem by proposing
the Adaptive Bezier-Curve Network (ABCNet). Our contributions are three-fold:
1) For the first time, we adaptively fit arbitrarily-shaped text by a
parameterized Bezier curve. 2) We design a novel BezierAlign layer for
extracting accurate convolution features of a text instance with arbitrary
shapes, significantly improving the precision compared with previous methods.
3) Compared with standard bounding box detection, our Bezier curve detection
introduces negligible computation overhead, resulting in superiority of our
method in both efficiency and accuracy. Experiments on arbitrarily-shaped
benchmark datasets, namely Total-Text and CTW1500, demonstrate that ABCNet
achieves state-of-the-art accuracy, meanwhile significantly improving the
speed. In particular, on Total-Text, our realtime version is over 10 times
faster than recent state-of-the-art methods with a competitive recognition
accuracy. Code is available at https://tinyurl.com/AdelaiDet
Related papers
- LRANet: Towards Accurate and Efficient Scene Text Detection with
Low-Rank Approximation Network [63.554061288184165]
We propose a novel parameterized text shape method based on low-rank approximation.
By exploring the shape correlation among different text contours, our method achieves consistency, compactness, simplicity, and robustness in shape representation.
We implement an accurate and efficient arbitrary-shaped text detector named LRANet.
arXiv Detail & Related papers (2023-06-27T02:03:46Z) - DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in
Transformer [94.35116535588332]
Transformer-based methods, which predict polygon points or Bezier curve control points to localize texts, are quite popular in scene text detection.
However, the used point label form implies the reading order of humans, which affects the robustness of Transformer model.
We propose DPText-DETR, which directly uses point coordinates as queries and dynamically updates them between decoder layers.
arXiv Detail & Related papers (2022-07-10T15:45:16Z) - CentripetalText: An Efficient Text Instance Representation for Scene
Text Detection [19.69057252363207]
We propose an efficient text instance representation named CentripetalText (CT)
CT decomposes text instances into the combination of text kernels and centripetal shifts.
For the task of scene text detection, our approach achieves superior or competitive performance compared to other existing methods.
arXiv Detail & Related papers (2021-07-13T09:34:18Z) - RSCA: Real-time Segmentation-based Context-Aware Scene Text Detection [14.125634725954848]
We propose RSCA: a Real-time-based Context-Aware model for arbitrary-shaped scene text detection.
Based on these strategies, RSCA achieves state-of-the-art performance in both speed and accuracy, without complex label assignments or repeated feature aggregations.
arXiv Detail & Related papers (2021-05-26T18:43:17Z) - ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text
Spotting [108.93803186429017]
End-to-end text-spotting aims to integrate detection and recognition in a unified framework.
Here, we tackle end-to-end text spotting by presenting Adaptive Bezier Curve Network v2 (ABCNet v2)
Our main contributions are four-fold: 1) For the first time, we adaptively fit arbitrarily-shaped text by a parameterized Bezier curve, which, compared with segmentation-based methods, can not only provide structured output but also controllable representation.
Comprehensive experiments conducted on various bilingual (English and Chinese) benchmark datasets demonstrate that ABCNet v2 can achieve state-of-the
arXiv Detail & Related papers (2021-05-08T07:46:55Z) - PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering
Network [54.03560668182197]
We propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time.
With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations.
Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed.
arXiv Detail & Related papers (2021-04-12T13:27:34Z) - ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene
Text Detection [147.10751375922035]
We propose the ContourNet, which effectively handles false positives and large scale variance of scene texts.
Our method effectively suppresses these false positives by only outputting predictions with high response value in both directions.
arXiv Detail & Related papers (2020-04-10T08:15:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.