Towards Generalized Few-Shot Open-Set Object Detection
- URL: http://arxiv.org/abs/2210.15996v3
- Date: Thu, 22 Feb 2024 01:58:15 GMT
- Title: Towards Generalized Few-Shot Open-Set Object Detection
- Authors: Binyi Su, Hua Zhang, Jingzhi Li, Zhong Zhou
- Abstract summary: Open-set object detection (OSOD) aims to detect the known categories and reject unknown objects in a dynamic world.
In this paper, we seek a solution for the generalized few-shot open-set object detection (G-FOOD)
We propose a new G-FOOD algorithm to tackle this issue, named underlineFew-shunderlineOt underlineOpen-set underlineDetector (FOOD)
- Score: 13.671120689841736
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Open-set object detection (OSOD) aims to detect the known categories and
reject unknown objects in a dynamic world, which has achieved significant
attention. However, previous approaches only consider this problem in
data-abundant conditions, while neglecting the few-shot scenes. In this paper,
we seek a solution for the generalized few-shot open-set object detection
(G-FOOD), which aims to avoid detecting unknown classes as known classes with a
high confidence score while maintaining the performance of few-shot detection.
The main challenge for this task is that few training samples induce the model
to overfit on the known classes, resulting in a poor open-set performance. We
propose a new G-FOOD algorithm to tackle this issue, named
\underline{F}ew-sh\underline{O}t \underline{O}pen-set \underline{D}etector
(FOOD), which contains a novel class weight sparsification classifier (CWSC)
and a novel unknown decoupling learner (UDL). To prevent over-fitting, CWSC
randomly sparses parts of the normalized weights for the logit prediction of
all classes, and then decreases the co-adaptability between the class and its
neighbors. Alongside, UDL decouples training the unknown class and enables the
model to form a compact unknown decision boundary. Thus, the unknown objects
can be identified with a confidence probability without any threshold,
prototype, or generation. We compare our method with several state-of-the-art
OSOD methods in few-shot scenes and observe that our method improves the
F-score of unknown classes by 4.80\%-9.08\% across all shots in VOC-COCO
dataset settings \footnote[1]{The source code is available at
\url{https://github.com/binyisu/food}}.
Related papers
- Semi-supervised Open-World Object Detection [74.95267079505145]
We introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD)
We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting.
Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-02-25T07:12:51Z) - Activate and Reject: Towards Safe Domain Generalization under Category
Shift [71.95548187205736]
We study a practical problem of Domain Generalization under Category Shift (DGCS)
It aims to simultaneously detect unknown-class samples and classify known-class samples in the target domains.
Compared to prior DG works, we face two new challenges: 1) how to learn the concept of unknown'' during training with only source known-class samples, and 2) how to adapt the source-trained model to unseen environments.
arXiv Detail & Related papers (2023-10-07T07:53:12Z) - USD: Unknown Sensitive Detector Empowered by Decoupled Objectness and
Segment Anything Model [14.080744645704751]
Open World Object Detection (OWOD) is a novel and challenging computer vision task.
We propose a simple yet effective learning strategy, namely Decoupled Objectness Learning (DOL), which divides the learning of these two boundaries into decoder layers.
We also introduce an Auxiliary Supervision Framework (ASF) that uses a pseudo-labeling and a soft-weighting strategies to alleviate the negative impact of noise.
arXiv Detail & Related papers (2023-06-04T06:42:09Z) - Open World DETR: Transformer based Open World Object Detection [60.64535309016623]
We propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR.
We fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint.
Our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
arXiv Detail & Related papers (2022-12-06T13:39:30Z) - Detecting the unknown in Object Detection [20.84221126313118]
We propose a novel training strategy, called UNKAD, able to predict unknown objects without requiring any annotation.
UNKAD first identifies and pseudo-labels unknown objects and then uses the pseudo-annotations to train an additional unknown class.
While UNKAD can directly detect unknown objects, we further combine it with previous unknown detection techniques, showing that it improves their performance at no costs.
arXiv Detail & Related papers (2022-08-24T16:27:38Z) - Expanding Low-Density Latent Regions for Open-Set Object Detection [32.29988419525467]
We propose to identify unknown objects by separating high/low-density regions in the latent space.
We present a novel Open-set Detector (OpenDet) with expanded low-density regions.
arXiv Detail & Related papers (2022-03-28T17:11:09Z) - Few-Shot Open-Set Recognition using Meta-Learning [72.15940446408824]
The problem of open-set recognition is considered.
A new oPen sEt mEta LEaRning (PEELER) algorithm is introduced.
arXiv Detail & Related papers (2020-05-27T23:49:26Z) - Any-Shot Object Detection [81.88153407655334]
'Any-shot detection' is where totally unseen and few-shot categories can simultaneously co-occur during inference.
We propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes.
Our framework can also be used solely for Zero-shot detection and Few-shot detection tasks.
arXiv Detail & Related papers (2020-03-16T03:43:15Z) - Frustratingly Simple Few-Shot Object Detection [98.42824677627581]
We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task.
Such a simple approach outperforms the meta-learning methods by roughly 220 points on current benchmarks.
arXiv Detail & Related papers (2020-03-16T00:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.