EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement
- URL: http://arxiv.org/abs/2002.07421v1
- Date: Tue, 18 Feb 2020 08:04:58 GMT
- Title: EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement
- Authors: Linpu Fang, Hang Xu, Zhili Liu, Sarah Parisot, Zhenguo Li
- Abstract summary: We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
- Score: 53.69674636044927
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detectors trained on fully-annotated data currently yield state of the
art performance but require expensive manual annotations. On the other hand,
weakly-supervised detectors have much lower performance and cannot be used
reliably in a realistic setting. In this paper, we study the hybrid-supervised
object detection problem, aiming to train a high quality detector with only a
limited amount of fullyannotated data and fully exploiting cheap data with
imagelevel labels. State of the art methods typically propose an iterative
approach, alternating between generating pseudo-labels and updating a detector.
This paradigm requires careful manual hyper-parameter tuning for mining good
pseudo labels at each round and is quite time-consuming. To address these
issues, we present EHSOD, an end-to-end hybrid-supervised object detection
system which can be trained in one shot on both fully and weakly-annotated
data. Specifically, based on a two-stage detector, we proposed two modules to
fully utilize the information from both kinds of labels: 1) CAMRPN module aims
at finding foreground proposals guided by a class activation heat-map; 2)
hybrid-supervised cascade module further refines the bounding-box position and
classification with the help of an auxiliary head compatible with image-level
data. Extensive experiments demonstrate the effectiveness of the proposed
method and it achieves comparable results on multiple object detection
benchmarks with only 30% fully-annotated data, e.g. 37.5% mAP on COCO. We will
release the code and the trained models.
Related papers
- Semi-supervised Open-World Object Detection [74.95267079505145]
We introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD)
We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting.
Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-02-25T07:12:51Z) - Towards Few-Annotation Learning for Object Detection: Are
Transformer-based Models More Efficient ? [11.416621957617334]
In this paper, we propose a semi-supervised method tailored for the current state-of-the-art object detector Deformable DETR.
We evaluate our method on the semi-supervised object detection benchmarks COCO and Pascal VOC, and it outperforms previous methods, especially when annotations are scarce.
arXiv Detail & Related papers (2023-10-30T18:51:25Z) - KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection [48.66703222700795]
We resort to a novel kernel strategy to identify the most informative point clouds to acquire labels.
To accommodate both one-stage (i.e., SECOND) and two-stage detectors, we incorporate the classification entropy tangent and well trade-off between detection performance and the total number of bounding boxes selected for annotation.
Our results show that approximately 44% box-level annotation costs and 26% computational time are reduced compared to the state-of-the-art method.
arXiv Detail & Related papers (2023-07-16T04:27:03Z) - Semi-Supervised and Long-Tailed Object Detection with CascadeMatch [91.86787064083012]
We propose a novel pseudo-labeling-based detector called CascadeMatch.
Our detector features a cascade network architecture, which has multi-stage detection heads with progressive confidence thresholds.
We show that CascadeMatch surpasses existing state-of-the-art semi-supervised approaches in handling long-tailed object detection.
arXiv Detail & Related papers (2023-05-24T07:09:25Z) - Augment and Criticize: Exploring Informative Samples for Semi-Supervised
Monocular 3D Object Detection [64.65563422852568]
We improve the challenging monocular 3D object detection problem with a general semi-supervised framework.
We introduce a novel, simple, yet effective Augment and Criticize' framework that explores abundant informative samples from unlabeled data.
The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI.
arXiv Detail & Related papers (2023-03-20T16:28:15Z) - Weakly-Supervised Salient Object Detection Using Point Supervison [17.88596733603456]
Current state-of-the-art saliency detection models rely heavily on large datasets of accurate pixel-wise annotations.
We propose a novel weakly-supervised salient object detection method using point supervision.
Our method outperforms the previous state-of-the-art methods trained with the stronger supervision.
arXiv Detail & Related papers (2022-03-22T12:16:05Z) - WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection [75.80075054706079]
We propose a weakly- and semi-supervised object detection framework (WSSOD)
An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images.
The proposed framework demonstrates remarkable performance on PASCAL-VOC and MSCOCO benchmark, achieving a high performance comparable to those obtained in fully-supervised settings.
arXiv Detail & Related papers (2021-05-21T11:58:50Z) - Self-Supervised Person Detection in 2D Range Data using a Calibrated
Camera [83.31666463259849]
We propose a method to automatically generate training labels (called pseudo-labels) for 2D LiDAR-based person detectors.
We show that self-supervised detectors, trained or fine-tuned with pseudo-labels, outperform detectors trained using manual annotations.
Our method is an effective way to improve person detectors during deployment without any additional labeling effort.
arXiv Detail & Related papers (2020-12-16T12:10:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.