Implicit Feature Pyramid Network for Object Detection
- URL: http://arxiv.org/abs/2012.13563v1
- Date: Fri, 25 Dec 2020 11:30:27 GMT
- Title: Implicit Feature Pyramid Network for Object Detection
- Authors: Tiancai Wang, Xiangyu Zhang, Jian Sun
- Abstract summary: We present an implicit feature pyramid network (i-FPN) for object detection.
We propose to use an implicit function, recently introduced in deep equilibrium model (DEQ) to model the transformation of FPN.
Experimental results on MS dataset show that i-FPN can significantly boost detection performance compared to baseline detectors.
- Score: 22.530998243247154
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present an implicit feature pyramid network (i-FPN) for
object detection. Existing FPNs stack several cross-scale blocks to obtain
large receptive field. We propose to use an implicit function, recently
introduced in deep equilibrium model (DEQ), to model the transformation of FPN.
We develop a residual-like iteration to updates the hidden states efficiently.
Experimental results on MS COCO dataset show that i-FPN can significantly boost
detection performance compared to baseline detectors with ResNet-50-FPN: +3.4,
+3.2, +3.5, +4.2, +3.2 mAP on RetinaNet, Faster-RCNN, FCOS, ATSS and
AutoAssign, respectively.
Related papers
- SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud
Representation [65.4396959244269]
The paper tackles the challenge by designing a general framework to construct 3D learning architectures.
The proposed approach can be applied to general backbones like PointNet and DGCNN.
Experiments on ModelNet40, ShapeNet, and the real-world dataset ScanObjectNN, demonstrated that the method achieves a great trade-off between efficiency, rotation, and accuracy.
arXiv Detail & Related papers (2022-09-13T12:12:19Z) - ssFPN: Scale Sequence (S^2) Feature Based-Feature Pyramid Network for
Object Detection [4.844193288417161]
We propose a new scale sequence (S2) feature extraction of Feature Pyramid Network (FPN) to strengthen feature information of small objects.
We demonstrate the proposed S2 feature can improve the performance of both one-stage and two-stage detectors on MS COCO dataset.
arXiv Detail & Related papers (2022-08-24T13:29:12Z) - RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object
Detection [10.847953426161924]
We propose RCNet, which consists of Reverse Feature Pyramid (RevFP) and Cross-scale Shift Network (CSN)
RevFP utilizes local bidirectional feature fusion to simplify the bidirectional pyramid inference pipeline.
CSN directly propagates representations to both adjacent and non-adjacent levels to enable multi-scale features more correlative.
arXiv Detail & Related papers (2021-10-23T04:00:25Z) - Trident Pyramid Networks: The importance of processing at the feature
pyramid level for better object detection [50.008529403150206]
We present a new core architecture called Trident Pyramid Network (TPN)
TPN allows for a deeper design and for a better balance between communication-based processing and self-processing.
We show consistent improvements when using our TPN core on the object detection benchmark, outperforming the popular BiFPN baseline by 1.5 AP.
arXiv Detail & Related papers (2021-10-08T09:59:59Z) - A^2-FPN: Attention Aggregation based Feature Pyramid Network for
Instance Segmentation [68.10621089649486]
We propose Attention Aggregation based Feature Pyramid Network (A2-FPN) to improve multi-scale feature learning.
A2-FPN achieves an improvement of 2.0% and 1.4% mask AP when integrated into the strong baselines such as Cascade Mask R-CNN and Hybrid Task Cascade.
arXiv Detail & Related papers (2021-05-07T11:51:08Z) - OPANAS: One-Shot Path Aggregation Network Architecture Search for Object
Detection [82.04372532783931]
Recently, neural architecture search (NAS) has been exploited to design feature pyramid networks (FPNs)
We propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.
arXiv Detail & Related papers (2021-03-08T01:48:53Z) - End-to-End Object Detection with Fully Convolutional Network [71.56728221604158]
We introduce a Prediction-aware One-To-One (POTO) label assignment for classification to enable end-to-end detection.
A simple 3D Max Filtering (3DMF) is proposed to utilize the multi-scale features and improve the discriminability of convolutions in the local region.
Our end-to-end framework achieves competitive performance against many state-of-the-art detectors with NMS on COCO and CrowdHuman datasets.
arXiv Detail & Related papers (2020-12-07T09:14:55Z) - Dual Refinement Feature Pyramid Networks for Object Detection [2.88935873409577]
FPN is a common component used in object detectors, it supplements multi-scale information by adjacent level features and summation.
In this paper, we analyze the design defects from pixel level and feature map level.
We design a novel parameter-free feature pyramid networks named Dual Refinement Feature Pyramid Networks for the problems.
arXiv Detail & Related papers (2020-12-03T07:17:03Z) - Dynamic Feature Pyramid Networks for Object Detection [40.24111664691307]
We introduce an inception FPN in which each layer contains convolution filters with different kernel sizes to enlarge the receptive field.
We propose a new dynamic FPN (DyFPN) which consists of multiple branches with different computational costs.
Experiments conducted on benchmarks demonstrate that the proposed DyFPN significantly improves performance with the optimal allocation of computation resources.
arXiv Detail & Related papers (2020-12-01T19:03:55Z) - Regularized Densely-connected Pyramid Network for Salient Instance
Segmentation [73.17802158095813]
We propose a new pipeline for end-to-end salient instance segmentation (SIS)
To better use the rich feature hierarchies in deep networks, we propose the regularized dense connections.
A novel multi-level RoIAlign based decoder is introduced to adaptively aggregate multi-level features for better mask predictions.
arXiv Detail & Related papers (2020-08-28T00:13:30Z) - ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid
Networks for Accurate Dense Pixel Matching [10.303618438296981]
Feature Pyramid Networks (FPN) have proven to be a suitable feature extractor for CNN-based dense matching tasks.
We present ResFPN -- a multi-resolution feature pyramid network with multiple residual skip connections.
In our ablation study, we demonstrate the effectiveness of our novel architecture with clearly higher accuracy than FPN.
arXiv Detail & Related papers (2020-06-22T13:31:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.