Selective Multi-Scale Learning for Object Detection
- URL: http://arxiv.org/abs/2206.08206v1
- Date: Thu, 16 Jun 2022 14:23:50 GMT
- Title: Selective Multi-Scale Learning for Object Detection
- Authors: Junliang Chen, Weizeng Lu, Linlin Shen
- Abstract summary: RetinaNet combined with SMSL obtains 1.8% improvement in AP (from 39.1% to 40.9%) on COCO dataset.
When integrated with SMSL, two-stage detectors can get around 1.0% improvement in AP.
- Score: 35.08958597150306
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pyramidal networks are standard methods for multi-scale object detection.
Current researches on feature pyramid networks usually adopt layer connections
to collect features from certain levels of the feature hierarchy, and do not
consider the significant differences among them. We propose a better
architecture of feature pyramid networks, named selective multi-scale learning
(SMSL), to address this issue. SMSL is efficient and general, which can be
integrated in both single-stage and two-stage detectors to boost detection
performance, with nearly no extra inference cost. RetinaNet combined with SMSL
obtains 1.8\% improvement in AP (from 39.1\% to 40.9\%) on COCO dataset. When
integrated with SMSL, two-stage detectors can get around 1.0\% improvement in
AP.
Related papers
- Class Anchor Margin Loss for Content-Based Image Retrieval [97.81742911657497]
We propose a novel repeller-attractor loss that falls in the metric learning paradigm, yet directly optimize for the L2 metric without the need of generating pairs.
We evaluate the proposed objective in the context of few-shot and full-set training on the CBIR task, by using both convolutional and transformer architectures.
arXiv Detail & Related papers (2023-06-01T12:53:10Z) - Extending One-Stage Detection with Open-World Proposals [8.492340530784697]
We show that fully convolutional one-stage detection network FCOS can increase OWP performance by as much as 6% in recall on novel classes.
While two-stage methods worsen by 6% in recall on novel classes, we show that FCOS only drops 2% when jointly optimizing for OWP and classification.
arXiv Detail & Related papers (2022-01-07T02:29:09Z) - Disentangle Your Dense Object Detector [82.22771433419727]
Deep learning-based dense object detectors have achieved great success in the past few years and have been applied to numerous multimedia applications such as video understanding.
However, the current training pipeline for dense detectors is compromised to lots of conjunctions that may not hold.
We propose Disentangled Dense Object Detector (DDOD), in which simple and effective disentanglement mechanisms are designed and integrated into the current state-of-the-art detectors.
arXiv Detail & Related papers (2021-07-07T00:52:16Z) - Decoupled and Memory-Reinforced Networks: Towards Effective Feature
Learning for One-Step Person Search [65.51181219410763]
One-step methods have been developed to handle pedestrian detection and identification sub-tasks using a single network.
There are two major challenges in the current one-step approaches.
We propose a decoupled and memory-reinforced network (DMRNet) to overcome these problems.
arXiv Detail & Related papers (2021-02-22T06:19:45Z) - Learning to Generate Content-Aware Dynamic Detectors [62.74209921174237]
We introduce a newpective of designing efficient detectors, which is automatically generating sample-adaptive model architecture.
We introduce a course-to-fine strat-egy tailored for object detection to guide the learning of dynamic routing.
Experiments on MS-COCO dataset demonstrate that CADDet achieves 1.8 higher mAP with 10% fewer FLOPs compared with vanilla routing.
arXiv Detail & Related papers (2020-12-08T08:05:20Z) - Dual Refinement Feature Pyramid Networks for Object Detection [2.88935873409577]
FPN is a common component used in object detectors, it supplements multi-scale information by adjacent level features and summation.
In this paper, we analyze the design defects from pixel level and feature map level.
We design a novel parameter-free feature pyramid networks named Dual Refinement Feature Pyramid Networks for the problems.
arXiv Detail & Related papers (2020-12-03T07:17:03Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z) - A novel Region of Interest Extraction Layer for Instance Segmentation [3.5493798890908104]
This paper is motivated by the need to overcome the limitations of existing RoI extractors.
The proposed layer (called Generic RoI Extractor - GRoIE) introduces non-local building blocks and attention mechanisms to boost the performance.
GRoIE can be integrated seamlessly with every two-stage architecture for both object detection and instance segmentation tasks.
arXiv Detail & Related papers (2020-04-28T17:07:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.