Toward Minimal Misalignment at Minimal Cost in One-Stage and Anchor-Free
Object Detection
- URL: http://arxiv.org/abs/2112.08902v1
- Date: Thu, 16 Dec 2021 14:22:13 GMT
- Title: Toward Minimal Misalignment at Minimal Cost in One-Stage and Anchor-Free
Object Detection
- Authors: Shuaizheng Hao, Hongzhe Liu, Ningwei Wang and Cheng Xu
- Abstract summary: classification and regression branches have different sensibility to the features from the same scale level and the same spatial location.
We propose a point-based prediction method, which is based on the assumption that the high classification confidence point has the high regression quality, leads to the misalignment problem.
We aim to resolve the phenomenon at minimal cost: a minor adjustment of the head network and a new label assignment method replacing the rigid one.
- Score: 6.486325109549893
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Common object detection models consist of classification and regression
branches, due to different task drivers, these two branches have different
sensibility to the features from the same scale level and the same spatial
location. The point-based prediction method, which is based on the assumption
that the high classification confidence point has the high regression quality,
leads to the misalignment problem. Our analysis shows, the problem is further
composed of scale misalignment and spatial misalignment specifically. We aim to
resolve the phenomenon at minimal cost: a minor adjustment of the head network
and a new label assignment method replacing the rigid one. Our experiments show
that, compared to the baseline FCOS, a one-stage and anchor-free object
detection model, our model consistently get around 3 AP improvement with
different backbones, demonstrating both simplicity and efficiency of our
method.
Related papers
- OCMG-Net: Neural Oriented Normal Refinement for Unstructured Point Clouds [18.234146052486054]
We present a robust refinement method for estimating oriented normals from unstructured point clouds.
Our framework incorporates sign orientation and data augmentation in the feature space to refine the initial oriented normals.
To address the issue of noise-caused direction inconsistency existing in previous approaches, we introduce a new metric called the Chamfer Normal Distance.
arXiv Detail & Related papers (2024-09-02T09:30:02Z) - Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark [101.23684938489413]
Anomaly detection (AD) is often focused on detecting anomalies for industrial quality inspection and medical lesion examination.
This work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field.
Inspired by the metrics in the segmentation field, we propose several more practical threshold-dependent AD-specific metrics.
arXiv Detail & Related papers (2024-04-16T17:38:26Z) - Match and Locate: low-frequency monocular odometry based on deep feature
matching [0.65268245109828]
We introduce a novel approach for the robotic odometry which only requires a single camera.
The approach is based on matching image features between the consecutive frames of the video stream using deep feature matching models.
We evaluate the performance of the approach in the AISG-SLA Visual Localisation Challenge and find that while being computationally efficient and easy to implement our method shows competitive results.
arXiv Detail & Related papers (2023-11-16T17:32:58Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - The KFIoU Loss for Rotated Object Detection [115.334070064346]
In this paper, we argue that one effective alternative is to devise an approximate loss who can achieve trend-level alignment with SkewIoU loss.
Specifically, we model the objects as Gaussian distribution and adopt Kalman filter to inherently mimic the mechanism of SkewIoU.
The resulting new loss called KFIoU is easier to implement and works better compared with exact SkewIoU.
arXiv Detail & Related papers (2022-01-29T10:54:57Z) - Layer-wise Customized Weak Segmentation Block and AIoU Loss for Accurate
Object Detection [7.3179451998143605]
We propose a scale-customized weak segmentation (SCWS) block at the pixel level for scale customized object feature learning in each layer.
By integrating the SCWS blocks into the single-shot detector, a scale-aware object detector (SCOD) is constructed to detect objects of different sizes naturally and accurately.
arXiv Detail & Related papers (2021-08-25T02:42:28Z) - Instance-Level Relative Saliency Ranking with Graph Reasoning [126.09138829920627]
We present a novel unified model to segment salient instances and infer relative saliency rank order.
A novel loss function is also proposed to effectively train the saliency ranking branch.
experimental results demonstrate that our proposed model is more effective than previous methods.
arXiv Detail & Related papers (2021-07-08T13:10:42Z) - Modulating Localization and Classification for Harmonized Object
Detection [40.82723262074911]
We propose a mutual learning framework to modulate the two tasks.
In particular, the two tasks are forced to learn from each other with a novel mutual labeling strategy.
We achieve a significant performance gain over the baseline detectors on the COCO dataset.
arXiv Detail & Related papers (2021-03-16T10:36:02Z) - Novelty Detection via Robust Variational Autoencoding [13.664682865991255]
We propose a new method for novelty detection that can tolerate high corruption of the training points.
Our method trains a robust variational autoencoder (VAE), which aims to generate a model for the uncorrupted training points.
arXiv Detail & Related papers (2020-06-09T22:34:27Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.