Scene Text Detection with Selected Anchor
- URL: http://arxiv.org/abs/2008.08523v1
- Date: Wed, 19 Aug 2020 16:03:13 GMT
- Title: Scene Text Detection with Selected Anchor
- Authors: Anna Zhu, Hang Du, Shengwu Xiong
- Abstract summary: Object proposal technique with dense anchoring scheme for scene text detection was applied frequently to achieve high recall.
We propose an anchor selection-based region proposal network (AS-RPN) using effective selected anchors instead of dense anchors.
- Score: 16.27975694546667
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object proposal technique with dense anchoring scheme for scene text
detection were applied frequently to achieve high recall. It results in the
significant improvement in accuracy but waste of computational searching,
regression and classification. In this paper, we propose an anchor
selection-based region proposal network (AS-RPN) using effective selected
anchors instead of dense anchors to extract text proposals. The center, scales,
aspect ratios and orientations of anchors are learnable instead of fixing,
which leads to high recall and greatly reduced numbers of anchors. By replacing
the anchor-based RPN in Faster RCNN, the AS-RPN-based Faster RCNN can achieve
comparable performance with previous state-of-the-art text detecting approaches
on standard benchmarks, including COCO-Text, ICDAR2013, ICDAR2015 and
MSRA-TD500 when using single-scale and single model (ResNet50) testing only.
Related papers
- Efficient k-NN Search with Cross-Encoders using Adaptive Multi-Round CUR
Decomposition [77.4863142882136]
Cross-encoder models are prohibitively expensive for direct k-nearest neighbor (k-NN) search.
We propose ADACUR, a method that adaptively, iteratively, and efficiently minimizes the approximation error for the practically important top-k neighbors.
arXiv Detail & Related papers (2023-05-04T17:01:17Z) - Group R-CNN for Weakly Semi-supervised Object Detection with Points [18.720915213798623]
We propose an effective point-to-box regressor: Group R-CNN.
Group R-CNN first uses instance-level proposal grouping to generate a group of proposals for each point annotation.
We show that Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images.
arXiv Detail & Related papers (2022-05-12T07:17:54Z) - Local Citation Recommendation with Hierarchical-Attention Text Encoder
and SciBERT-based Reranking [6.456347800676685]
BM25 has been found to be a tough-to-beat approach to prefetching.
In this paper, we explore prefetching with nearest neighbor search among text embeddings constructed by a hierarchical attention network.
When coupled with a SciBERT reranker fine-tuned on local citation recommendation tasks, our hierarchical Attention encoder (HAtten) achieves high prefetch recall for a given number of candidates to be reranked.
arXiv Detail & Related papers (2021-12-02T13:20:26Z) - On Addressing Practical Challenges for RNN-Transduce [72.72132048437751]
We adapt a well-trained RNN-T model to a new domain without collecting the audio data.
We obtain word-level confidence scores by utilizing several types of features calculated during decoding.
The proposed time stamping method can get less than 50ms word timing difference on average.
arXiv Detail & Related papers (2021-04-27T23:31:43Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Collaborative Training between Region Proposal Localization and
Classification for Domain Adaptive Object Detection [121.28769542994664]
Domain adaptation for object detection tries to adapt the detector from labeled datasets to unlabeled ones for better performance.
In this paper, we are the first to reveal that the region proposal network (RPN) and region proposal classifier(RPC) demonstrate significantly different transferability when facing large domain gap.
arXiv Detail & Related papers (2020-09-17T07:39:52Z) - Semi-Anchored Detector for One-Stage Object Detection [24.42034747738174]
A standard one-stage detector is comprised of two tasks: classification and regression.
With ResNet-101 as the backbone, the proposed semi-anchored detector achieves 43.6% mAP on COCO data set.
arXiv Detail & Related papers (2020-09-10T16:57:09Z) - Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text
Spotting [71.6244869235243]
Most arbitrary-shape scene text spotters use region proposal networks (RPN) to produce proposals.
Our Mask TextSpotter v3 can handle text instances of extreme aspect ratios or irregular shapes, and its recognition accuracy won't be affected by nearby text or background noise.
arXiv Detail & Related papers (2020-07-18T17:25:50Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.