RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object
Detection
- URL: http://arxiv.org/abs/2110.12130v1
- Date: Sat, 23 Oct 2021 04:00:25 GMT
- Title: RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object
Detection
- Authors: Zhuofan Zong, Qianggang Cao, Biao Leng
- Abstract summary: We propose RCNet, which consists of Reverse Feature Pyramid (RevFP) and Cross-scale Shift Network (CSN)
RevFP utilizes local bidirectional feature fusion to simplify the bidirectional pyramid inference pipeline.
CSN directly propagates representations to both adjacent and non-adjacent levels to enable multi-scale features more correlative.
- Score: 10.847953426161924
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature pyramid networks (FPN) are widely exploited for multi-scale feature
fusion in existing advanced object detection frameworks. Numerous previous
works have developed various structures for bidirectional feature fusion, all
of which are shown to improve the detection performance effectively. We observe
that these complicated network structures require feature pyramids to be
stacked in a fixed order, which introduces longer pipelines and reduces the
inference speed. Moreover, semantics from non-adjacent levels are diluted in
the feature pyramid since only features at adjacent pyramid levels are merged
by the local fusion operation in a sequence manner. To address these issues, we
propose a novel architecture named RCNet, which consists of Reverse Feature
Pyramid (RevFP) and Cross-scale Shift Network (CSN). RevFP utilizes local
bidirectional feature fusion to simplify the bidirectional pyramid inference
pipeline. CSN directly propagates representations to both adjacent and
non-adjacent levels to enable multi-scale features more correlative. Extensive
experiments on the MS COCO dataset demonstrate RCNet can consistently bring
significant improvements over both one-stage and two-stage detectors with
subtle extra computational overhead. In particular, RetinaNet is boosted to
40.2 AP, which is 3.7 points higher than baseline, by replacing FPN with our
proposed model. On COCO test-dev, RCNet can achieve very competitive
performance with a single-model single-scale 50.5 AP. Codes will be made
available.
Related papers
- Trident Pyramid Networks: The importance of processing at the feature
pyramid level for better object detection [50.008529403150206]
We present a new core architecture called Trident Pyramid Network (TPN)
TPN allows for a deeper design and for a better balance between communication-based processing and self-processing.
We show consistent improvements when using our TPN core on the object detection benchmark, outperforming the popular BiFPN baseline by 1.5 AP.
arXiv Detail & Related papers (2021-10-08T09:59:59Z) - A^2-FPN: Attention Aggregation based Feature Pyramid Network for
Instance Segmentation [68.10621089649486]
We propose Attention Aggregation based Feature Pyramid Network (A2-FPN) to improve multi-scale feature learning.
A2-FPN achieves an improvement of 2.0% and 1.4% mask AP when integrated into the strong baselines such as Cascade Mask R-CNN and Hybrid Task Cascade.
arXiv Detail & Related papers (2021-05-07T11:51:08Z) - OPANAS: One-Shot Path Aggregation Network Architecture Search for Object
Detection [82.04372532783931]
Recently, neural architecture search (NAS) has been exploited to design feature pyramid networks (FPNs)
We propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.
arXiv Detail & Related papers (2021-03-08T01:48:53Z) - Implicit Feature Pyramid Network for Object Detection [22.530998243247154]
We present an implicit feature pyramid network (i-FPN) for object detection.
We propose to use an implicit function, recently introduced in deep equilibrium model (DEQ) to model the transformation of FPN.
Experimental results on MS dataset show that i-FPN can significantly boost detection performance compared to baseline detectors.
arXiv Detail & Related papers (2020-12-25T11:30:27Z) - Dual Refinement Feature Pyramid Networks for Object Detection [2.88935873409577]
FPN is a common component used in object detectors, it supplements multi-scale information by adjacent level features and summation.
In this paper, we analyze the design defects from pixel level and feature map level.
We design a novel parameter-free feature pyramid networks named Dual Refinement Feature Pyramid Networks for the problems.
arXiv Detail & Related papers (2020-12-03T07:17:03Z) - Parallel Residual Bi-Fusion Feature Pyramid Network for Accurate
Single-Shot Object Detection [22.817918566911203]
This paper proposes the Parallel Residual Bi-Fusion Feature Pyramid Network (PRB-FPN) for fast and accurate single-shot object detection.
The proposed network achieves state-of-the-art performance on the UAVDT17 and MS COCO datasets.
arXiv Detail & Related papers (2020-12-03T06:51:20Z) - Suppress and Balance: A Simple Gated Network for Salient Object
Detection [89.88222217065858]
We propose a simple gated network (GateNet) to solve both issues at once.
With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.
In addition, we adopt the atrous spatial pyramid pooling based on the proposed "Fold" operation (Fold-ASPP) to accurately localize salient objects of various scales.
arXiv Detail & Related papers (2020-07-16T02:00:53Z) - A novel Region of Interest Extraction Layer for Instance Segmentation [3.5493798890908104]
This paper is motivated by the need to overcome the limitations of existing RoI extractors.
The proposed layer (called Generic RoI Extractor - GRoIE) introduces non-local building blocks and attention mechanisms to boost the performance.
GRoIE can be integrated seamlessly with every two-stage architecture for both object detection and instance segmentation tasks.
arXiv Detail & Related papers (2020-04-28T17:07:32Z) - Feature Pyramid Grids [140.11116687047058]
We present Feature Pyramid Grids (FPG), a deep multi-pathway feature pyramid.
FPG can improve single-pathway feature pyramid networks by significantly increasing its performance at similar computation cost.
arXiv Detail & Related papers (2020-04-07T17:59:52Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.