CFA: Constraint-based Finetuning Approach for Generalized Few-Shot
Object Detection
- URL: http://arxiv.org/abs/2204.05220v1
- Date: Mon, 11 Apr 2022 16:04:54 GMT
- Title: CFA: Constraint-based Finetuning Approach for Generalized Few-Shot
Object Detection
- Authors: Karim Guirguis, Ahmed Hendawy, George Eskandar, Mohamed Abdelsamad,
Matthias Kayser, Juergen Beyerer
- Abstract summary: Generalized few-shot object detection (G-FSOD) aims to tackle FSOD without forgetting previously seen base classes.
CFA adapts a continual learning method, namely Average Gradient Episodic Memory (A-GEM) to G-FSOD.
Our method outperforms current FSOD and G-FSOD approaches on the novel task with minor degeneration on the base task.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Few-shot object detection (FSOD) seeks to detect novel categories with
limited data by leveraging prior knowledge from abundant base data. Generalized
few-shot object detection (G-FSOD) aims to tackle FSOD without forgetting
previously seen base classes and, thus, accounts for a more realistic scenario,
where both classes are encountered during test time. While current FSOD methods
suffer from catastrophic forgetting, G-FSOD addresses this limitation yet
exhibits a performance drop on novel tasks compared to the state-of-the-art
FSOD. In this work, we propose a constraint-based finetuning approach (CFA) to
alleviate catastrophic forgetting, while achieving competitive results on the
novel task without increasing the model capacity. CFA adapts a continual
learning method, namely Average Gradient Episodic Memory (A-GEM) to G-FSOD.
Specifically, more constraints on the gradient search strategy are imposed from
which a new gradient update rule is derived, allowing for better knowledge
exchange between base and novel classes. To evaluate our method, we conduct
extensive experiments on MS-COCO and PASCAL-VOC datasets. Our method
outperforms current FSOD and G-FSOD approaches on the novel task with minor
degeneration on the base task. Moreover, CFA is orthogonal to FSOD approaches
and operates as a plug-and-play module without increasing the model capacity or
inference time.
Related papers
- High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study [64.06777376676513]
We develop a few-shot segmentation (FSS) framework based on foundation models.
To be specific, we propose a simple approach to extract implicit knowledge from foundation models to construct coarse correspondence.
Experiments on two widely used datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-09-10T08:04:11Z) - SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection [2.0755366440393743]
Confusion and forgetting of object classes have been challenges of prime interest in Few-Shot Object Detection (FSOD)
We introduce a novel Submodular Mutual Information Learning framework which adopts mutual information functions.
Our proposed approach generalizes to several existing approaches in FSOD, agnostic of the backbone architecture.
arXiv Detail & Related papers (2024-07-02T20:53:43Z) - DiffClass: Diffusion-Based Class Incremental Learning [30.514281721324853]
Class Incremental Learning (CIL) is challenging due to catastrophic forgetting.
Recent exemplar-free CIL methods attempt to mitigate catastrophic forgetting by synthesizing previous task data.
We propose a novel exemplar-free CIL method to overcome these issues.
arXiv Detail & Related papers (2024-03-08T03:34:18Z) - Activate and Reject: Towards Safe Domain Generalization under Category
Shift [71.95548187205736]
We study a practical problem of Domain Generalization under Category Shift (DGCS)
It aims to simultaneously detect unknown-class samples and classify known-class samples in the target domains.
Compared to prior DG works, we face two new challenges: 1) how to learn the concept of unknown'' during training with only source known-class samples, and 2) how to adapt the source-trained model to unseen environments.
arXiv Detail & Related papers (2023-10-07T07:53:12Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Improved Region Proposal Network for Enhanced Few-Shot Object Detection [23.871860648919593]
Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches.
We develop a semi-supervised algorithm to detect and then utilize unlabeled novel objects as positive samples during the FSOD training stage.
Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception of the object detection model for large objects.
arXiv Detail & Related papers (2023-08-15T02:35:59Z) - Fast Hierarchical Learning for Few-Shot Object Detection [57.024072600597464]
Transfer learning approaches have recently achieved promising results on the few-shot detection task.
These approaches suffer from catastrophic forgetting'' issue due to finetuning of base detector.
We tackle the aforementioned issues in this work.
arXiv Detail & Related papers (2022-10-10T20:31:19Z) - Bridging Non Co-occurrence with Unlabeled In-the-wild Data for
Incremental Object Detection [56.22467011292147]
Several incremental learning methods are proposed to mitigate catastrophic forgetting for object detection.
Despite the effectiveness, these methods require co-occurrence of the unlabeled base classes in the training data of the novel classes.
We propose the use of unlabeled in-the-wild data to bridge the non-occurrence caused by the missing base classes during the training of additional novel classes.
arXiv Detail & Related papers (2021-10-28T10:57:25Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.