Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with
Attentive Feature Alignment
- URL: http://arxiv.org/abs/2104.07719v1
- Date: Thu, 15 Apr 2021 19:01:27 GMT
- Title: Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with
Attentive Feature Alignment
- Authors: Guangxing Han, Shiyuan Huang, Jiawei Ma, Yicheng He, Shih-Fu Chang
- Abstract summary: Few-shot object detection (FSOD) aims to detect objects using only few examples.
We propose a meta-learning based few-shot object detection method by transferring meta-knowledge learned from data-abundant base classes to data-scarce novel classes.
- Score: 33.446875089255876
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Few-shot object detection (FSOD) aims to detect objects using only few
examples. It's critically needed for many practical applications but so far
remains challenging. We propose a meta-learning based few-shot object detection
method by transferring meta-knowledge learned from data-abundant base classes
to data-scarce novel classes. Our method incorporates a coarse-to-fine approach
into the proposal based object detection framework and integrates prototype
based classifiers into both the proposal generation and classification stages.
To improve proposal generation for few-shot novel classes, we propose to learn
a lightweight matching network to measure the similarity between each spatial
position in the query image feature map and spatially-pooled class features,
instead of the traditional object/nonobject classifier, thus generating
category-specific proposals and improving proposal recall for novel classes. To
address the spatial misalignment between generated proposals and few-shot class
examples, we propose a novel attentive feature alignment method, thus improving
the performance of few-shot object detection. Meanwhile we jointly learn a
Faster R-CNN detection head for base classes. Extensive experiments conducted
on multiple FSOD benchmarks show our proposed approach achieves state of the
art results under (incremental) few-shot learning settings.
Related papers
- Frustratingly Simple but Effective Zero-shot Detection and Segmentation:
Analysis and a Strong Baseline [45.03824571286718]
Methods for object detection and segmentation often require abundant instance-level annotations for training, which are time-consuming and expensive to collect.
To address this, the task of zero-shot object detection (or segmentation) aims at learning effective methods for identifying and localizing object instances for the categories that have no supervision available.
In this work, we extensively study these design choices, and carefully construct a simple yet extremely effective zero-shot recognition method.
arXiv Detail & Related papers (2023-02-14T20:00:30Z) - Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation
Exploitation [100.87407396364137]
We design Meta-DETR, which (i) is the first image-level few-shot detector, and (ii) introduces a novel inter-class correlational meta-learning strategy.
Experiments over multiple few-shot object detection benchmarks show that the proposed Meta-DETR outperforms state-of-the-art methods by large margins.
arXiv Detail & Related papers (2022-07-30T13:46:07Z) - Incremental-DETR: Incremental Few-Shot Object Detection via
Self-Supervised Learning [60.64535309016623]
We propose the Incremental-DETR that does incremental few-shot object detection via fine-tuning and self-supervised learning on the DETR object detector.
To alleviate severe over-fitting with few novel class data, we first fine-tune the class-specific components of DETR with self-supervision.
We further introduce a incremental few-shot fine-tuning strategy with knowledge distillation on the class-specific components of DETR to encourage the network in detecting novel classes without catastrophic forgetting.
arXiv Detail & Related papers (2022-05-09T05:08:08Z) - Query Adaptive Few-Shot Object Detection with Heterogeneous Graph
Convolutional Networks [33.446875089255876]
Few-shot object detection (FSOD) aims to detect never-seen objects using few examples.
We propose a novel FSOD model using heterogeneous graph convolutional networks.
arXiv Detail & Related papers (2021-12-17T22:08:15Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - Discovery-and-Selection: Towards Optimal Multiple Instance Learning for
Weakly Supervised Object Detection [86.86602297364826]
We propose a discoveryand-selection approach fused with multiple instance learning (DS-MIL)
Our proposed DS-MIL approach can consistently improve the baselines, reporting state-of-the-art performance.
arXiv Detail & Related papers (2021-10-18T07:06:57Z) - Experience feedback using Representation Learning for Few-Shot Object
Detection on Aerial Images [2.8560476609689185]
The performance of our method is assessed on DOTA, a large-scale remote sensing images dataset.
It highlights in particular some intrinsic weaknesses for the few-shot object detection task.
arXiv Detail & Related papers (2021-09-27T13:04:53Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.