Universal-Prototype Augmentation for Few-Shot Object Detection
- URL: http://arxiv.org/abs/2103.01077v1
- Date: Mon, 1 Mar 2021 15:35:36 GMT
- Title: Universal-Prototype Augmentation for Few-Shot Object Detection
- Authors: Aming Wu, Yahong Han, Linchao Zhu, Yi Yang, Cheng Deng
- Abstract summary: Few-shot object detection (FSOD) aims to strengthen the performance of novel object detection with few labeled samples.
To alleviate the constraint of few samples, enhancing the generalization ability of learned features for novel objects plays a key role.
We propose a new prototype, namely universal prototype, that is learned from all object categories.
- Score: 128.4592084104352
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Few-shot object detection (FSOD) aims to strengthen the performance of novel
object detection with few labeled samples. To alleviate the constraint of few
samples, enhancing the generalization ability of learned features for novel
objects plays a key role. Thus, the feature learning process of FSOD should
focus more on intrinsical object characteristics, which are invariant under
different visual changes and therefore are helpful for feature generalization.
Unlike previous attempts of the meta-learning paradigm, in this paper, we
explore how to smooth object features with intrinsical characteristics that are
universal across different object categories. We propose a new prototype,
namely universal prototype, that is learned from all object categories. Besides
the advantage of characterizing invariant characteristics, the universal
prototypes alleviate the impact of unbalanced object categories. After
augmenting object features with the universal prototypes, we impose a
consistency loss to maximize the agreement between the augmented features and
the original one, which is beneficial for learning invariant object
characteristics. Thus, we develop a new framework of few-shot object detection
with universal prototypes (${FSOD}^{up}$) that owns the merit of feature
generalization towards novel objects. Experimental results on PASCAL VOC and MS
COCO demonstrate the effectiveness of ${FSOD}^{up}$. Particularly, for the
1-shot case of VOC Split2, ${FSOD}^{up}$ outperforms the baseline by 6.8\% in
terms of mAP. Moreover, we further verify ${FSOD}^{up}$ on a long-tail
detection dataset, i.e., LVIS. And employing ${FSOD}^{up}$ outperforms the
state-of-the-art method.
Related papers
- Negative Prototypes Guided Contrastive Learning for WSOD [8.102080369924911]
Weakly Supervised Object Detection (WSOD) with only image-level annotation has recently attracted wide attention.
We propose the Negative Prototypes Guided Contrastive learning architecture.
Our proposed method achieves the state-of-the-art performance.
arXiv Detail & Related papers (2024-06-04T08:16:26Z) - Fine-Grained Prototypes Distillation for Few-Shot Object Detection [8.795211323408513]
Few-shot object detection (FSOD) aims at extending a generic detector for novel object detection with only a few training examples.
In general, methods based on meta-learning employ an additional support branch to encode novel examples into class prototypes.
New methods are required to capture the distinctive local context for more robust novel object detection.
arXiv Detail & Related papers (2024-01-15T12:12:48Z) - Reflection Invariance Learning for Few-shot Semantic Segmentation [53.20466630330429]
Few-shot semantic segmentation (FSS) aims to segment objects of unseen classes in query images with only a few annotated support images.
This paper proposes a fresh few-shot segmentation framework to mine the reflection invariance in a multi-view matching manner.
Experiments on both PASCAL-$5textiti$ and COCO-$20textiti$ datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-06-01T15:14:58Z) - Robust Region Feature Synthesizer for Zero-Shot Object Detection [87.79902339984142]
We build a novel zero-shot object detection framework that contains an Intra-class Semantic Diverging component and an Inter-class Structure Preserving component.
It is the first study to carry out zero-shot object detection in remote sensing imagery.
arXiv Detail & Related papers (2022-01-01T03:09:15Z) - Dual Prototypical Contrastive Learning for Few-shot Semantic
Segmentation [55.339405417090084]
We propose a dual prototypical contrastive learning approach tailored to the few-shot semantic segmentation (FSS) task.
The main idea is to encourage the prototypes more discriminative by increasing inter-class distance while reducing intra-class distance in prototype feature space.
We demonstrate that the proposed dual contrastive learning approach outperforms state-of-the-art FSS methods on PASCAL-5i and COCO-20i datasets.
arXiv Detail & Related papers (2021-11-09T08:14:50Z) - Plug and Play, Model-Based Reinforcement Learning [60.813074750879615]
We introduce an object-based representation that allows zero-shot integration of new objects from known object classes.
This is achieved by representing the global transition dynamics as a union of local transition functions.
Experiments show that our representation can achieve sample-efficiency in a variety of set-ups.
arXiv Detail & Related papers (2021-08-20T01:20:15Z) - FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding [14.896822373116729]
We present Few-Shot object detection via Contrastive proposals (FSCE)
FSCE is a simple yet effective approach to learning contrastive-aware object encodings that facilitate the classification of detected objects.
Our design outperforms current state-of-the-art works in any shot and all data, with up to +8.8% on standard benchmark PASCAL VOC and +2.7% on challenging benchmark.
arXiv Detail & Related papers (2021-03-10T09:15:05Z) - Fine-Grained Dynamic Head for Object Detection [68.70628757217939]
We propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance.
Experiments demonstrate the effectiveness and efficiency of the proposed method on several state-of-the-art detection benchmarks.
arXiv Detail & Related papers (2020-12-07T08:16:32Z) - Towards Better Object Detection in Scale Variation with Adaptive Feature
Selection [3.5352273012717044]
We propose a novel adaptive feature selection module (AFSM) to automatically learn the way to fuse multi-level representations in the channel dimension.
It significantly improves the performance of the detectors that have a feature pyramid structure.
A class-aware sampling mechanism (CASM) is proposed to tackle the class imbalance problem.
arXiv Detail & Related papers (2020-12-06T13:41:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.