Few-Shot Object Detection via Variational Feature Aggregation
- URL: http://arxiv.org/abs/2301.13411v1
- Date: Tue, 31 Jan 2023 04:58:21 GMT
- Title: Few-Shot Object Detection via Variational Feature Aggregation
- Authors: Jiaming Han, Yuqiang Ren, Jian Ding, Ke Yan, Gui-Song Xia
- Abstract summary: We propose a meta-learning framework with two novel feature aggregation schemes.
We first present a Class-Agnostic Aggregation (CAA) method, where the query and support features can be aggregated regardless of their categories.
We then propose a Variational Feature Aggregation (VFA) method, which encodes support examples into class-level support features.
- Score: 32.34871873486389
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: As few-shot object detectors are often trained with abundant base samples and
fine-tuned on few-shot novel examples,the learned models are usually biased to
base classes and sensitive to the variance of novel examples. To address this
issue, we propose a meta-learning framework with two novel feature aggregation
schemes. More precisely, we first present a Class-Agnostic Aggregation (CAA)
method, where the query and support features can be aggregated regardless of
their categories. The interactions between different classes encourage
class-agnostic representations and reduce confusion between base and novel
classes. Based on the CAA, we then propose a Variational Feature Aggregation
(VFA) method, which encodes support examples into class-level support features
for robust feature aggregation. We use a variational autoencoder to estimate
class distributions and sample variational features from distributions that are
more robust to the variance of support examples. Besides, we decouple
classification and regression tasks so that VFA is performed on the
classification branch without affecting object localization. Extensive
experiments on PASCAL VOC and COCO demonstrate that our method significantly
outperforms a strong baseline (up to 16\%) and previous state-of-the-art
methods (4\% in average). Code will be available at:
\url{https://github.com/csuhan/VFA}
Related papers
- Semantic Enhanced Few-shot Object Detection [37.715912401900745]
We propose a fine-tuning based FSOD framework that utilizes semantic embeddings for better detection.
Our method allows each novel class to construct a compact feature space without being confused with similar base classes.
arXiv Detail & Related papers (2024-06-19T12:40:55Z) - Fine-Grained Prototypes Distillation for Few-Shot Object Detection [8.795211323408513]
Few-shot object detection (FSOD) aims at extending a generic detector for novel object detection with only a few training examples.
In general, methods based on meta-learning employ an additional support branch to encode novel examples into class prototypes.
New methods are required to capture the distinctive local context for more robust novel object detection.
arXiv Detail & Related papers (2024-01-15T12:12:48Z) - Intra-class Adaptive Augmentation with Neighbor Correction for Deep
Metric Learning [99.14132861655223]
We propose a novel intra-class adaptive augmentation (IAA) framework for deep metric learning.
We reasonably estimate intra-class variations for every class and generate adaptive synthetic samples to support hard samples mining.
Our method significantly improves and outperforms the state-of-the-art methods on retrieval performances by 3%-6%.
arXiv Detail & Related papers (2022-11-29T14:52:38Z) - PatchMix Augmentation to Identify Causal Features in Few-shot Learning [55.64873998196191]
Few-shot learning aims to transfer knowledge learned from base with sufficient categories labelled data to novel categories with scarce known information.
We propose a novel data augmentation strategy dubbed as PatchMix that can break this spurious dependency.
We show that such an augmentation mechanism, different from existing ones, is able to identify the causal features.
arXiv Detail & Related papers (2022-11-29T08:41:29Z) - Class-Incremental Learning with Strong Pre-trained Models [97.84755144148535]
Class-incremental learning (CIL) has been widely studied under the setting of starting from a small number of classes (base classes)
We explore an understudied real-world setting of CIL that starts with a strong model pre-trained on a large number of base classes.
Our proposed method is robust and generalizes to all analyzed CIL settings.
arXiv Detail & Related papers (2022-04-07T17:58:07Z) - CAD: Co-Adapting Discriminative Features for Improved Few-Shot
Classification [11.894289991529496]
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples.
Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning.
We propose a strategy to cross-attend and re-weight discriminative features for few-shot classification.
arXiv Detail & Related papers (2022-03-25T06:14:51Z) - Exploring Category-correlated Feature for Few-shot Image Classification [27.13708881431794]
We present a simple yet effective feature rectification method by exploring the category correlation between novel and base classes as the prior knowledge.
The proposed approach consistently obtains considerable performance gains on three widely used benchmarks.
arXiv Detail & Related papers (2021-12-14T08:25:24Z) - Few-Shot Object Detection via Association and DIscrimination [83.8472428718097]
Few-shot object detection via Association and DIscrimination builds up a discriminative feature space for each novel class with two integral steps.
Experiments on Pascal VOC and MS-COCO datasets demonstrate FADI achieves new SOTA performance, significantly improving the baseline in any shot/split by +18.7.
arXiv Detail & Related papers (2021-11-23T05:04:06Z) - Revisiting Deep Local Descriptor for Improved Few-Shot Classification [56.74552164206737]
We show how one can improve the quality of embeddings by leveraging textbfDense textbfClassification and textbfAttentive textbfPooling.
We suggest to pool feature maps by applying attentive pooling instead of the widely used global average pooling (GAP) to prepare embeddings for few-shot classification.
arXiv Detail & Related papers (2021-03-30T00:48:28Z) - UniT: Unified Knowledge Transfer for Any-shot Object Detection and
Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training.
We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.