RRPN++: Guidance Towards More Accurate Scene Text Detection
- URL: http://arxiv.org/abs/2009.13118v1
- Date: Mon, 28 Sep 2020 08:00:35 GMT
- Title: RRPN++: Guidance Towards More Accurate Scene Text Detection
- Authors: Jianqi Ma
- Abstract summary: We propose RRPN++ to exploit the potential of RRPN-based model by several improvements.
Based on RRPN, we propose the Anchor-free Pyramid Proposal Networks (APPN) to generate first-stage proposals.
In our second stage, both the detection branch and the recognition branch are incorporated to perform multi-task learning.
- Score: 0.30458514384586394
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: RRPN is among the outstanding scene text detection approaches, but the
manually-designed anchor and coarse proposal refinement make the performance
still far from perfection. In this paper, we propose RRPN++ to exploit the
potential of RRPN-based model by several improvements. Based on RRPN, we
propose the Anchor-free Pyramid Proposal Networks (APPN) to generate
first-stage proposals, which adopts the anchor-free design to reduce proposal
number and accelerate the inference speed. In our second stage, both the
detection branch and the recognition branch are incorporated to perform
multi-task learning. In inference stage, the detection branch outputs the
proposal refinement and the recognition branch predicts the transcript of the
refined text region. Further, the recognition branch also helps rescore the
proposals and eliminate the false positive proposals by the jointing filtering
strategy. With these enhancements, we boost the detection results by $6\%$ of
F-measure in ICDAR2015 compared to RRPN. Experiments conducted on other
benchmarks also illustrate the superior performance and efficiency of our
model.
Related papers
- PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection [26.843891792018447]
We present PETDet (Proposal Enhancement for Two-stage fine-grained object detection) to better handle the sub-tasks in two-stage FGOD methods.
An anchor-free Quality Oriented Proposal Network (QOPN) is proposed with dynamic label assignment and attention-based decomposition.
A novel Adaptive Recognition Loss (ARL) offers guidance for the R-CNN head to focus on high-quality proposals.
arXiv Detail & Related papers (2023-12-16T18:04:56Z) - Rank-DETR for High Quality Object Detection [52.82810762221516]
A highly performant object detector requires accurate ranking for the bounding box predictions.
In this work, we introduce a simple and highly performant DETR-based object detector by proposing a series of rank-oriented designs.
arXiv Detail & Related papers (2023-10-13T04:48:32Z) - Label-Efficient Object Detection via Region Proposal Network
Pre-Training [58.50615557874024]
We propose a simple pretext task that provides an effective pre-training for the region proposal network (RPN)
In comparison with multi-stage detectors without RPN pre-training, our approach is able to consistently improve downstream task performance.
arXiv Detail & Related papers (2022-11-16T16:28:18Z) - A New High-Performance Approach to Approximate Pattern-Matching for
Plagiarism Detection in Blockchain-Based Non-Fungible Tokens (NFTs) [0.0]
We present a fast and innovative approach to performing approximate pattern-matching for plagiarism detection using an NDFA-based approach.
We outline the advantages of our approach in the context of blockchain-based non-fungible tokens (NFTs)
arXiv Detail & Related papers (2022-05-28T17:53:20Z) - Adaptive Proposal Generation Network for Temporal Sentence Localization
in Videos [58.83440885457272]
We address the problem of temporal sentence localization in videos (TSLV)
Traditional methods follow a top-down framework which localizes the target segment with pre-defined segment proposals.
We propose an Adaptive Proposal Generation Network (APGN) to maintain the segment-level interaction while speeding up the efficiency.
arXiv Detail & Related papers (2021-09-14T02:02:36Z) - Augmenting Proposals by the Detector Itself [16.059480514515506]
In this paper, we design a novel training method named APDI, which means augmenting proposals by the detector itself and can generate proposals with higher quality.
Experiments on COCO dataset show that our method brings at least 2.7 AP improvements on Faster R-CNN with various backbones.
It can cooperate with advanced RPNs, such as GA-RPN and Cascade RPN, to obtain extra gains.
arXiv Detail & Related papers (2021-01-28T02:48:00Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - PPGN: Phrase-Guided Proposal Generation Network For Referring Expression
Comprehension [31.39505099600821]
We propose a novel phrase-guided proposal generation network ( PPGN)
The main implementation principle of PPGN is refining visual features with text and generate proposals through regression.
Experiments show that our method is effective and achieve SOTA performance in benchmark datasets.
arXiv Detail & Related papers (2020-12-20T11:21:06Z) - CRACT: Cascaded Regression-Align-Classification for Robust Visual
Tracking [97.84109669027225]
We introduce an improved proposal refinement module, Cascaded Regression-Align- Classification (CRAC)
CRAC yields new state-of-the-art performances on many benchmarks.
In experiments on seven benchmarks including OTB-2015, UAV123, NfS, VOT-2018, TrackingNet, GOT-10k and LaSOT, our CRACT exhibits very promising results in comparison with state-of-the-art competitors.
arXiv Detail & Related papers (2020-11-25T02:18:33Z) - Collaborative Training between Region Proposal Localization and
Classification for Domain Adaptive Object Detection [121.28769542994664]
Domain adaptation for object detection tries to adapt the detector from labeled datasets to unlabeled ones for better performance.
In this paper, we are the first to reveal that the region proposal network (RPN) and region proposal classifier(RPC) demonstrate significantly different transferability when facing large domain gap.
arXiv Detail & Related papers (2020-09-17T07:39:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.