Solving Missing-Annotation Object Detection with Background
Recalibration Loss
- URL: http://arxiv.org/abs/2002.05274v2
- Date: Mon, 3 Aug 2020 19:21:26 GMT
- Title: Solving Missing-Annotation Object Detection with Background
Recalibration Loss
- Authors: Han Zhang, Fangyi Chen, Zhiqiang Shen, Qiqi Hao, Chenchen Zhu, Marios
Savvides
- Abstract summary: This paper focuses on a novel and challenging detection scenario: A majority of true objects/instances is unlabeled in the datasets.
Previous art has proposed to use soft sampling to re-weight the gradients of RoIs based on the overlaps with positive instances, while their method is mainly based on the two-stage detector.
In this paper, we introduce a superior solution called Background Recalibration Loss (BRL) that can automatically re-calibrate the loss signals according to the pre-defined IoU threshold and input image.
- Score: 49.42997894751021
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper focuses on a novel and challenging detection scenario: A majority
of true objects/instances is unlabeled in the datasets, so these
missing-labeled areas will be regarded as the background during training.
Previous art on this problem has proposed to use soft sampling to re-weight the
gradients of RoIs based on the overlaps with positive instances, while their
method is mainly based on the two-stage detector (i.e. Faster RCNN) which is
more robust and friendly for the missing label scenario. In this paper, we
introduce a superior solution called Background Recalibration Loss (BRL) that
can automatically re-calibrate the loss signals according to the pre-defined
IoU threshold and input image. Our design is built on the one-stage detector
which is faster and lighter. Inspired by the Focal Loss formulation, we make
several significant modifications to fit on the missing-annotation
circumstance. We conduct extensive experiments on the curated PASCAL VOC and MS
COCO datasets. The results demonstrate that our proposed method outperforms the
baseline and other state-of-the-arts by a large margin. Code available:
https://github.com/Dwrety/mmdetection-selective-iou.
Related papers
- MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection [36.478530086163744]
We propose a novel Mutually optimizing pre-training framework for remote sensing object Detection, dubbed as MutDet.
MutDet fuses the object embeddings and detector features bidirectionally in the last encoder layer, enhancing their information interaction.
Experiments on various settings show new state-of-the-art transfer performance.
arXiv Detail & Related papers (2024-07-13T15:28:15Z) - Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels [17.211935951030114]
We propose a robust oriented remote sensing object detection method through dynamic loss decay (DLD) mechanism.
Our solution has won the 2st place in the "fine-grained object detection based on sub-meter remote sensing imagery" track with noisy labels of 2023 National Big Data and Computing Intelligence Challenge.
arXiv Detail & Related papers (2024-05-15T01:29:28Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Rethinking the Localization in Weakly Supervised Object Localization [51.29084037301646]
Weakly supervised object localization (WSOL) is one of the most popular and challenging tasks in computer vision.
Recent dividing WSOL into two parts (class-agnostic object localization and object classification) has become the state-of-the-art pipeline for this task.
We propose to replace SCR with a binary-class detector (BCD) for localizing multiple objects, where the detector is trained by discriminating the foreground and background.
arXiv Detail & Related papers (2023-08-11T14:38:51Z) - Boosting Few-shot Fine-grained Recognition with Background Suppression
and Foreground Alignment [53.401889855278704]
Few-shot fine-grained recognition (FS-FGR) aims to recognize novel fine-grained categories with the help of limited available samples.
We propose a two-stage background suppression and foreground alignment framework, which is composed of a background activation suppression (BAS) module, a foreground object alignment (FOA) module, and a local to local (L2L) similarity metric.
Experiments conducted on multiple popular fine-grained benchmarks demonstrate that our method outperforms the existing state-of-the-art by a large margin.
arXiv Detail & Related papers (2022-10-04T07:54:40Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Salvage Reusable Samples from Noisy Data for Robust Learning [70.48919625304]
We propose a reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.
Our key idea is to additionally identify and correct reusable samples, and then leverage them together with clean examples to update the networks.
arXiv Detail & Related papers (2020-08-06T02:07:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.