USD: Unknown Sensitive Detector Empowered by Decoupled Objectness and
Segment Anything Model
- URL: http://arxiv.org/abs/2306.02275v1
- Date: Sun, 4 Jun 2023 06:42:09 GMT
- Title: USD: Unknown Sensitive Detector Empowered by Decoupled Objectness and
Segment Anything Model
- Authors: Yulin He, Wei Chen, Yusong Tan, Siqi Wang
- Abstract summary: Open World Object Detection (OWOD) is a novel and challenging computer vision task.
We propose a simple yet effective learning strategy, namely Decoupled Objectness Learning (DOL), which divides the learning of these two boundaries into decoder layers.
We also introduce an Auxiliary Supervision Framework (ASF) that uses a pseudo-labeling and a soft-weighting strategies to alleviate the negative impact of noise.
- Score: 14.080744645704751
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Open World Object Detection (OWOD) is a novel and challenging computer vision
task that enables object detection with the ability to detect unknown objects.
Existing methods typically estimate the object likelihood with an additional
objectness branch, but ignore the conflict in learning objectness and
classification boundaries, which oppose each other on the semantic manifold and
training objective. To address this issue, we propose a simple yet effective
learning strategy, namely Decoupled Objectness Learning (DOL), which divides
the learning of these two boundaries into suitable decoder layers. Moreover,
detecting unknown objects comprehensively requires a large amount of
annotations, but labeling all unknown objects is both difficult and expensive.
Therefore, we propose to take advantage of the recent Large Vision Model (LVM),
specifically the Segment Anything Model (SAM), to enhance the detection of
unknown objects. Nevertheless, the output results of SAM contain noise,
including backgrounds and fragments, so we introduce an Auxiliary Supervision
Framework (ASF) that uses a pseudo-labeling and a soft-weighting strategies to
alleviate the negative impact of noise. Extensive experiments on popular
benchmarks, including Pascal VOC and MS COCO, demonstrate the effectiveness of
our approach. Our proposed Unknown Sensitive Detector (USD) outperforms the
recent state-of-the-art methods in terms of Unknown Recall, achieving
significant improvements of 14.3\%, 15.5\%, and 8.9\% on the M-OWODB, and
27.1\%, 29.1\%, and 25.1\% on the S-OWODB.
Related papers
- Open-World Object Detection with Instance Representation Learning [1.8749305679160366]
We propose a method to train an object detector that can both detect novel objects and extract semantically rich features in open-world conditions.
Our method learns a robust and generalizable feature space, outperforming other OWOD-based feature extraction methods.
arXiv Detail & Related papers (2024-09-24T13:13:34Z) - SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images [11.217630579076237]
Few-shot object detection (FSOD) has garnered significant research attention in the field of remote sensing.
We propose a novel FSOD method for remote sensing images called Few-shot Oriented object detection with Memorable Contrastive learning (FOMC)
Specifically, we employ oriented bounding boxes instead of traditional horizontal bounding boxes to learn a better feature representation for arbitrary-oriented aerial objects.
arXiv Detail & Related papers (2024-03-20T08:15:18Z) - Semi-supervised Open-World Object Detection [74.95267079505145]
We introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD)
We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting.
Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-02-25T07:12:51Z) - Unsupervised Recognition of Unknown Objects for Open-World Object
Detection [28.787586991713535]
Open-World Object Detection (OWOD) extends object detection problem to a realistic and dynamic scenario.
Current OWOD models, such as ORE and OW-DETR, focus on pseudo-labeling regions with high objectness scores as unknowns.
This paper proposes a novel approach that learns an unsupervised discriminative model to recognize true unknown objects.
arXiv Detail & Related papers (2023-08-31T08:17:29Z) - Identification of Novel Classes for Improving Few-Shot Object Detection [12.013345715187285]
Few-shot object detection (FSOD) methods offer a remedy by realizing robust object detection using only a few training samples per class.
We develop a semi-supervised algorithm to detect and then utilize unlabeled novel objects as positive samples during training to improve FSOD performance.
Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods.
arXiv Detail & Related papers (2023-03-18T14:12:52Z) - Open World DETR: Transformer based Open World Object Detection [60.64535309016623]
We propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR.
We fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint.
Our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
arXiv Detail & Related papers (2022-12-06T13:39:30Z) - Robust Region Feature Synthesizer for Zero-Shot Object Detection [87.79902339984142]
We build a novel zero-shot object detection framework that contains an Intra-class Semantic Diverging component and an Inter-class Structure Preserving component.
It is the first study to carry out zero-shot object detection in remote sensing imagery.
arXiv Detail & Related papers (2022-01-01T03:09:15Z) - OW-DETR: Open-world Detection Transformer [90.56239673123804]
We introduce a novel end-to-end transformer-based framework, OW-DETR, for open-world object detection.
OW-DETR comprises three dedicated components namely, attention-driven pseudo-labeling, novelty classification and objectness scoring.
Our model outperforms the recently introduced OWOD approach, ORE, with absolute gains ranging from 1.8% to 3.3% in terms of unknown recall.
arXiv Detail & Related papers (2021-12-02T18:58:30Z) - Cascade Attentive Dropout for Weakly Supervised Object Detection [7.697578661762592]
Weakly supervised object detection (WSOD) aims to classify and locate objects with only image-level supervision.
Many WSOD approaches adopt multiple instance learning as the initial model, which is prone to converge to the most discriminative object regions.
We propose a novel cascade attentive dropout strategy to alleviate the part domination problem, together with an improved global context module.
arXiv Detail & Related papers (2020-11-20T08:08:13Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.