Meta-DETR: Few-Shot Object Detection via Unified Image-Level
Meta-Learning
- URL: http://arxiv.org/abs/2103.11731v2
- Date: Thu, 25 Mar 2021 04:54:43 GMT
- Title: Meta-DETR: Few-Shot Object Detection via Unified Image-Level
Meta-Learning
- Authors: Gongjie Zhang, Zhipeng Luo, Kaiwen Cui, Shijian Lu
- Abstract summary: Few-shot object detection aims at detecting novel objects with only a few annotated examples.
This paper presents a novel meta-detector framework, namely Meta-DETR, which eliminates region-wise prediction.
It instead meta-learns object localization and classification at image level in a unified and complementary manner.
- Score: 39.50529982746885
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Few-shot object detection aims at detecting novel objects with only a few
annotated examples. Prior works have proved meta-learning a promising solution,
and most of them essentially address detection by meta-learning over regions
for their classification and location fine-tuning. However, these methods
substantially rely on initially well-located region proposals, which are
usually hard to obtain under the few-shot settings. This paper presents a novel
meta-detector framework, namely Meta-DETR, which eliminates region-wise
prediction and instead meta-learns object localization and classification at
image level in a unified and complementary manner. Specifically, it first
encodes both support and query images into category-specific features and then
feeds them into a category-agnostic decoder to directly generate predictions
for specific categories. To facilitate meta-learning with deep networks, we
design a simple but effective Semantic Alignment Mechanism (SAM), which aligns
high-level and low-level feature semantics to improve the generalization of
meta-learned representations. Experiments over multiple few-shot object
detection benchmarks show that Meta-DETR outperforms state-of-the-art methods
by large margins.
Related papers
- Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification [1.6385815610837167]
Recent CLIP-based methods have shown promising zero-shot and few-shot performance on image classification tasks.
We propose the Meta-Feature Adaption method (MF-Adapter) that combines the complementary strengths of both LRs and high-level semantic representations.
Our proposed method is superior to the state-of-the-art CLIP downstream few-shot classification methods, even showing stronger performance on a set of challenging visual classification tasks.
arXiv Detail & Related papers (2024-07-08T06:18:04Z) - Meta-ZSDETR: Zero-shot DETR with Meta-learning [29.58827207505671]
We present the first method that combines DETR and meta-learning to perform zero-shot object detection, named Meta-ZSDETR.
The model is optimized with meta-contrastive learning, which contains a regression head to generate the coordinates of class-specific boxes.
Experimental results show that our method outperforms the existing ZSD methods by a large margin.
arXiv Detail & Related papers (2023-08-18T13:17:07Z) - CLIP the Gap: A Single Domain Generalization Approach for Object
Detection [60.20931827772482]
Single Domain Generalization tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain.
We propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts.
We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss.
arXiv Detail & Related papers (2023-01-13T12:01:18Z) - Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation
Exploitation [100.87407396364137]
We design Meta-DETR, which (i) is the first image-level few-shot detector, and (ii) introduces a novel inter-class correlational meta-learning strategy.
Experiments over multiple few-shot object detection benchmarks show that the proposed Meta-DETR outperforms state-of-the-art methods by large margins.
arXiv Detail & Related papers (2022-07-30T13:46:07Z) - Incremental-DETR: Incremental Few-Shot Object Detection via
Self-Supervised Learning [60.64535309016623]
We propose the Incremental-DETR that does incremental few-shot object detection via fine-tuning and self-supervised learning on the DETR object detector.
To alleviate severe over-fitting with few novel class data, we first fine-tune the class-specific components of DETR with self-supervision.
We further introduce a incremental few-shot fine-tuning strategy with knowledge distillation on the class-specific components of DETR to encourage the network in detecting novel classes without catastrophic forgetting.
arXiv Detail & Related papers (2022-05-09T05:08:08Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with
Attentive Feature Alignment [33.446875089255876]
Few-shot object detection (FSOD) aims to detect objects using only few examples.
We propose a meta-learning based few-shot object detection method by transferring meta-knowledge learned from data-abundant base classes to data-scarce novel classes.
arXiv Detail & Related papers (2021-04-15T19:01:27Z) - CAFENet: Class-Agnostic Few-Shot Edge Detection Network [19.01453512012934]
We tackle a novel few-shot learning challenge, which we call few-shot semantic edge detection.
We also present a Class-Agnostic Few-shot Edge detection Network (CAFENet) based on meta-learning strategy.
arXiv Detail & Related papers (2020-03-18T14:18:59Z) - Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning [79.25478727351604]
We explore a simple process: meta-learning over a whole-classification pre-trained model on its evaluation metric.
We observe this simple method achieves competitive performance to state-of-the-art methods on standard benchmarks.
arXiv Detail & Related papers (2020-03-09T20:06:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.