Meta Guided Metric Learner for Overcoming Class Confusion in Few-Shot
Road Object Detection
- URL: http://arxiv.org/abs/2110.15074v1
- Date: Thu, 28 Oct 2021 12:51:08 GMT
- Title: Meta Guided Metric Learner for Overcoming Class Confusion in Few-Shot
Road Object Detection
- Authors: Anay Majee and Anbumani Subramanian and Kshitij Agrawal
- Abstract summary: Few-Shot Object Detection techniques extend the knowledge from existing base object classes to learn novel road objects.
We introduce a novel Meta Guided Metric (MGML) to overcome class confusion in FSOD.
Our method outperforms State-of-the-Art (SoTA) approaches in FSOD on the India Driving Learner dataset.
- Score: 0.3867363075280544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Localization and recognition of less-occurring road objects have been a
challenge in autonomous driving applications due to the scarcity of data
samples. Few-Shot Object Detection techniques extend the knowledge from
existing base object classes to learn novel road objects given few training
examples. Popular techniques in FSOD adopt either meta or metric learning
techniques which are prone to class confusion and base class forgetting. In
this work, we introduce a novel Meta Guided Metric Learner (MGML) to overcome
class confusion in FSOD. We re-weight the features of the novel classes higher
than the base classes through a novel Squeeze and Excite module and encourage
the learning of truly discriminative class-specific features by applying an
Orthogonality Constraint to the meta learner. Our method outperforms
State-of-the-Art (SoTA) approaches in FSOD on the India Driving Dataset (IDD)
by upto 11 mAP points while suffering from the least class confusion of 20%
given only 10 examples of each novel road object. We further show similar
improvements on the few-shot splits of PASCAL VOC dataset where we outperform
SoTA approaches by upto 5.8 mAP accross all splits.
Related papers
- Semantic Enhanced Few-shot Object Detection [37.715912401900745]
We propose a fine-tuning based FSOD framework that utilizes semantic embeddings for better detection.
Our method allows each novel class to construct a compact feature space without being confused with similar base classes.
arXiv Detail & Related papers (2024-06-19T12:40:55Z) - Enhancing Visual Continual Learning with Language-Guided Supervision [76.38481740848434]
Continual learning aims to empower models to learn new tasks without forgetting previously acquired knowledge.
We argue that the scarce semantic information conveyed by the one-hot labels hampers the effective knowledge transfer across tasks.
Specifically, we use PLMs to generate semantic targets for each class, which are frozen and serve as supervision signals.
arXiv Detail & Related papers (2024-03-24T12:41:58Z) - Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization [63.66349334291372]
We propose a framework with Meta prompt and Instance Contrastive learning (MIC) schemes.
Firstly, we simulate a novel-class-emerging scenario to help the prompt that learns class and background prompts generalize to novel classes.
Secondly, we design an instance-level contrastive strategy to promote intra-class compactness and inter-class separation, which benefits generalization of the detector to novel class objects.
arXiv Detail & Related papers (2024-03-14T14:25:10Z) - MetaGCD: Learning to Continually Learn in Generalized Category Discovery [26.732455383707798]
We consider a real-world scenario where a model that is trained on pre-defined classes continually encounters unlabeled data.
The goal is to continually discover novel classes while maintaining the performance in known classes.
We propose an approach, called MetaGCD, to learn how to incrementally discover with less forgetting.
arXiv Detail & Related papers (2023-08-21T22:16:49Z) - Novel Class Discovery without Forgetting [72.52222295216062]
We identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetting.
We propose a machine learning model to incrementally discover novel categories of instances from unlabeled data.
We introduce experimental protocols based on CIFAR-10, CIFAR-100 and ImageNet-1000 to measure the trade-off between knowledge retention and novel class discovery.
arXiv Detail & Related papers (2022-07-21T17:54:36Z) - Class-incremental Novel Class Discovery [76.35226130521758]
We study the new task of class-incremental Novel Class Discovery (class-iNCD)
We propose a novel approach for class-iNCD which prevents forgetting of past information about the base classes.
Our experiments, conducted on three common benchmarks, demonstrate that our method significantly outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2022-07-18T13:49:27Z) - Few-Shot Object Detection via Association and DIscrimination [83.8472428718097]
Few-shot object detection via Association and DIscrimination builds up a discriminative feature space for each novel class with two integral steps.
Experiments on Pascal VOC and MS-COCO datasets demonstrate FADI achieves new SOTA performance, significantly improving the baseline in any shot/split by +18.7.
arXiv Detail & Related papers (2021-11-23T05:04:06Z) - Attention Guided Cosine Margin For Overcoming Class-Imbalance in
Few-Shot Road Object Detection [5.821060995749903]
Few-shot object detection (FSOD) localizes and classifies objects in an image given only a few data samples.
Recent trends in FSOD research show the adoption of metric and meta-learning techniques, which are prone to catastrophic forgetting and class confusion.
We introduce Attention Guided Cosine Margin (AGCM) that facilitates the creation of tighter and well separated class-specific feature clusters.
arXiv Detail & Related papers (2021-11-12T10:11:56Z) - Incremental Object Detection via Meta-Learning [77.55310507917012]
We propose a meta-learning approach that learns to reshape model gradients, such that information across incremental tasks is optimally shared.
In comparison to existing meta-learning methods, our approach is task-agnostic, allows incremental addition of new-classes and scales to high-capacity models for object detection.
arXiv Detail & Related papers (2020-03-17T13:40:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.