Overcoming Classifier Imbalance for Long-tail Object Detection with
Balanced Group Softmax
- URL: http://arxiv.org/abs/2006.10408v1
- Date: Thu, 18 Jun 2020 10:24:26 GMT
- Title: Overcoming Classifier Imbalance for Long-tail Object Detection with
Balanced Group Softmax
- Authors: Yu Li, Tao Wang, Bingyi Kang, Sheng Tang, Chunfeng Wang, Jintao Li,
Jiashi Feng
- Abstract summary: We provide the first systematic analysis on the underperformance of state-of-the-art models in front of long-tail distribution.
We propose a novel balanced group softmax (BAGS) module for balancing the classifiers within the detection frameworks through group-wise training.
Extensive experiments on the very recent long-tail large vocabulary object recognition benchmark LVIS show that our proposed BAGS significantly improves the performance of detectors.
- Score: 88.11979569564427
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Solving long-tail large vocabulary object detection with deep learning based
models is a challenging and demanding task, which is however under-explored.In
this work, we provide the first systematic analysis on the underperformance of
state-of-the-art models in front of long-tail distribution. We find existing
detection methods are unable to model few-shot classes when the dataset is
extremely skewed, which can result in classifier imbalance in terms of
parameter magnitude. Directly adapting long-tail classification models to
detection frameworks can not solve this problem due to the intrinsic difference
between detection and classification.In this work, we propose a novel balanced
group softmax (BAGS) module for balancing the classifiers within the detection
frameworks through group-wise training. It implicitly modulates the training
process for the head and tail classes and ensures they are both sufficiently
trained, without requiring any extra sampling for the instances from the tail
classes.Extensive experiments on the very recent long-tail large vocabulary
object recognition benchmark LVIS show that our proposed BAGS significantly
improves the performance of detectors with various backbones and frameworks on
both object detection and instance segmentation. It beats all state-of-the-art
methods transferred from long-tail image classification and establishes new
state-of-the-art.Code is available at
https://github.com/FishYuLi/BalancedGroupSoftmax.
Related papers
- Investigating Self-Supervised Methods for Label-Efficient Learning [27.029542823306866]
We study different self supervised pretext tasks, namely contrastive learning, clustering, and masked image modelling for their low-shot capabilities.
We introduce a framework involving both mask image modelling and clustering as pretext tasks, which performs better across all low-shot downstream tasks.
When testing the model on full scale datasets, we show performance gains in multi-class classification, multi-label classification and semantic segmentation.
arXiv Detail & Related papers (2024-06-25T10:56:03Z) - On Model Calibration for Long-Tailed Object Detection and Instance
Segmentation [56.82077636126353]
We propose NorCal, Normalized for long-tailed object detection and instance segmentation.
We show that separately handling the background class and normalizing the scores over classes for each proposal are keys to achieving superior performance.
arXiv Detail & Related papers (2021-07-05T17:57:20Z) - Distribution Alignment: A Unified Framework for Long-tail Visual
Recognition [52.36728157779307]
We propose a unified distribution alignment strategy for long-tail visual recognition.
We then introduce a generalized re-weight method in the two-stage learning to balance the class prior.
Our approach achieves the state-of-the-art results across all four recognition tasks with a simple and unified framework.
arXiv Detail & Related papers (2021-03-30T14:09:53Z) - The Devil is the Classifier: Investigating Long Tail Relation
Classification with Decoupling Analysis [36.298869931803836]
Long-tailed relation classification is a challenging problem as the head classes may dominate the training phase.
We propose a robust classifier with attentive relation routing, which assigns soft weights by automatically aggregating the relations.
arXiv Detail & Related papers (2020-09-15T12:47:00Z) - The Devil is in Classification: A Simple Framework for Long-tail Object
Detection and Instance Segmentation [93.17367076148348]
We investigate performance drop of the state-of-the-art two-stage instance segmentation model Mask R-CNN on the recent long-tail LVIS dataset.
We unveil that a major cause is the inaccurate classification of object proposals.
We propose a simple calibration framework to more effectively alleviate classification head bias with a bi-level class balanced sampling approach.
arXiv Detail & Related papers (2020-07-23T12:49:07Z) - UniT: Unified Knowledge Transfer for Any-shot Object Detection and
Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training.
We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.