Fugu-MT 論文翻訳(概要): Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

論文の概要: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

arxiv url: http://arxiv.org/abs/2006.10408v1
Date: Thu, 18 Jun 2020 10:24:26 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-19 09:51:18.555943
Title: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Title（参考訳）: 平衡群ソフトマックスを用いたロングテール物体検出のための分類器不均衡の克服
Authors: Yu Li, Tao Wang, Bingyi Kang, Sheng Tang, Chunfeng Wang, Jintao Li, Jiashi Feng
Abstract要約: 本報告では, 長期分布前における最先端モデルの過小評価に関する最初の体系的解析を行う。本稿では,グループワイドトレーニングを通じて検出フレームワーク内の分類器のバランスをとるための,新しいバランス付きグループソフトマックス(BAGS)モジュールを提案する。非常に最近の長尾大語彙オブジェクト認識ベンチマークLVISの大規模な実験により,提案したBAGSは検出器の性能を著しく向上することが示された。
参考スコア（独自算出の注目度）: 88.11979569564427
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Solving long-tail large vocabulary object detection with deep learning based models is a challenging and demanding task, which is however under-explored.In this work, we provide the first systematic analysis on the underperformance of state-of-the-art models in front of long-tail distribution. We find existing detection methods are unable to model few-shot classes when the dataset is extremely skewed, which can result in classifier imbalance in terms of parameter magnitude. Directly adapting long-tail classification models to detection frameworks can not solve this problem due to the intrinsic difference between detection and classification.In this work, we propose a novel balanced group softmax (BAGS) module for balancing the classifiers within the detection frameworks through group-wise training. It implicitly modulates the training process for the head and tail classes and ensures they are both sufficiently trained, without requiring any extra sampling for the instances from the tail classes.Extensive experiments on the very recent long-tail large vocabulary object recognition benchmark LVIS show that our proposed BAGS significantly improves the performance of detectors with various backbones and frameworks on both object detection and instance segmentation. It beats all state-of-the-art methods transferred from long-tail image classification and establishes new state-of-the-art.Code is available at https://github.com/FishYuLi/BalancedGroupSoftmax.
Abstract（参考訳）: 深層学習に基づくモデルを用いた長文大語彙物体検出は難易度の高い課題であり,未検討の課題である。本研究では,長文分布に先立つ最先端モデルの性能低下に関する最初の体系的分析を行う。既存の検出手法では,データセットが極度に歪んだ場合,少数のクラスをモデル化できないことが判明した。本研究は,検出と分類に本質的な違いがあるため,検出フレームワークにロングテール分類モデルを直接適用しても,この問題は解決できない。本研究では,グループ学習を通じて検出フレームワーク内の分類器のバランスをとるための,新しいバランスグループソフトマックス(bags)モジュールを提案する。これは、頭と尾のクラスのトレーニングプロセスを暗黙的に調整し、尾のクラスからインスタンスのサンプリングを余分に必要とせずに、両者が十分に訓練されていることを保証する。最近のlong-tail large vocabulary object recognition benchmark lvisにおける拡張実験により、提案するバッグは、様々なバックボーンと、オブジェクト検出とインスタンスセグメンテーションの両方のフレームワークを備えた検出器の性能を大幅に改善していることが示された。ロングテール画像分類から転送されるすべての最先端メソッドを破り、https://github.com/FishYuLi/BalancedGroupSoftmax.orgで新しい最先端コードを確立する。

論文の概要: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

関連論文リスト