Fugu-MT 論文翻訳(概要): A systematic study of the foreground-background imbalance problem in deep learning for object detection

論文の概要: A systematic study of the foreground-background imbalance problem in deep learning for object detection

arxiv url: http://arxiv.org/abs/2306.16539v1
Date: Wed, 28 Jun 2023 20:27:11 GMT
ステータス: 翻訳完了
システム内更新日: 2023-06-30 15:46:53.677664
Title: A systematic study of the foreground-background imbalance problem in deep learning for object detection
Title（参考訳）: 物体検出のための深層学習における前景-背景不均衡問題の体系的研究
Authors: Hanxue Gu, Haoyu Dong, Nicholas Konz, Maciej A. Mazurowski
Abstract要約: 対象物検出におけるF-B不均衡問題の包括的解析と実験を行った。 F-B不均衡の異なる側面が検出性能に及ぼす影響を実験的に検討した。 F-Bの不均衡は検出性能を著しく低下させる可能性があると結論付けている。
参考スコア（独自算出の注目度）: 2.806890214136407
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The class imbalance problem in deep learning has been explored in several studies, but there has yet to be a systematic analysis of this phenomenon in object detection. Here, we present comprehensive analyses and experiments of the foreground-background (F-B) imbalance problem in object detection, which is very common and caused by small, infrequent objects of interest. We experimentally study the effects of different aspects of F-B imbalance (object size, number of objects, dataset size, object type) on detection performance. In addition, we also compare 9 leading methods for addressing this problem, including Faster-RCNN, SSD, OHEM, Libra-RCNN, Focal-Loss, GHM, PISA, YOLO-v3, and GFL with a range of datasets from different imaging domains. We conclude that (1) the F-B imbalance can indeed cause a significant drop in detection performance, (2) The detection performance is more affected by F-B imbalance when fewer training data are available, (3) in most cases, decreasing object size leads to larger performance drop than decreasing number of objects, given the same change in the ratio of object pixels to non-object pixels, (6) among all selected methods, Libra-RCNN and PISA demonstrate the best performance in addressing the issue of F-B imbalance. (7) When the training dataset size is large, the choice of method is not impactful (8) Soft-sampling methods, including focal-loss, GHM, and GFL, perform fairly well on average but are relatively unstable.
Abstract（参考訳）: 深層学習におけるクラス不均衡問題は、いくつかの研究で研究されているが、物体検出におけるこの現象の体系的な解析はまだ行われていない。本稿では,対象検出におけるフォアグラウンドバックグラウンド(f-b)不均衡問題の包括的解析と実験を行う。 F-B不均衡(オブジェクトサイズ,オブジェクト数,データセットサイズ,オブジェクトタイプ)の異なる側面が検出性能に及ぼす影響を実験的に検討した。さらに,Faster-RCNN,SSD,OHEM,Libra-RCNN,Focal-Loss,GHM,PISA,YOLO-v3,GFLの9つの主要な手法を,異なる画像領域のデータセットで比較した。 We conclude that (1) the F-B imbalance can indeed cause a significant drop in detection performance, (2) The detection performance is more affected by F-B imbalance when fewer training data are available, (3) in most cases, decreasing object size leads to larger performance drop than decreasing number of objects, given the same change in the ratio of object pixels to non-object pixels, (6) among all selected methods, Libra-RCNN and PISA demonstrate the best performance in addressing the issue of F-B imbalance. (7) トレーニングデータセットのサイズが大きい場合, 方法の選択は影響を受けない (8) フォーカスロス, GHM, GFLを含むソフトサンプリング手法は, 平均的にかなりよく動作するが, 比較的不安定である。

関連論文リスト

Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning [51.170479006249195]
本研究では,新しいデータセット,ベンチマーク,動的粗大な学習手法を提案する。提案するデータセットであるAI-TOD-Rは、すべてのオブジェクト指向オブジェクト検出データセットの中で最小のオブジェクトサイズを特徴としている。完全教師付きおよびラベル効率の両アプローチを含む,幅広い検出パラダイムにまたがるベンチマークを提案する。
論文参考訳（メタデータ） (2024-12-16T09:14:32Z)
DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection [6.635903943457569]
元のYOLOアルゴリズムは、異なるスケールのターゲットを認識する能力の弱いため、全体的な検出精度が低い。本稿では,空中画像のターゲット検出のための動的アテンションスケール系列融合アルゴリズム(DASSF)を提案する。 DASSF法をYOLOv8nと比較すると,平均平均精度(mAP)は9.2%,2.4%増加した。
論文参考訳（メタデータ） (2024-06-18T05:26:44Z)
Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection [133.66006666465447]
現在のメトリクスはサイズに敏感で、大きなオブジェクトが集中し、小さなオブジェクトが無視される傾向があります。サイズに基づくバイアスは、追加のセマンティック情報なしでは不適切であるため、評価はサイズ不変であるべきだと論じる。我々は,この目標に適した最適化フレームワークを開発し,異なる大きさのオブジェクトの検出において,大幅な改善を実現した。
論文参考訳（メタデータ） (2024-05-16T03:01:06Z)
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies [0.5439020425818999]
本研究は, YOLOv5単段検出器を用いて, 前地上クラス不均衡問題に対処するベンチマークフレームワークを提案する。我々は,サンプリング,損失重み付け,データ強化という3つの確立した手法を精査した。比較分析の結果,2段階検出法では有効であるが,YOLOv5の性能向上には有効ではないことが明らかとなった。
論文参考訳（メタデータ） (2024-03-11T19:06:04Z)
Diffusion-Based Particle-DETR for BEV Perception [94.88305708174796]
Bird-Eye-View (BEV)は、自律走行車(AV)における視覚知覚のための最も広く使われているシーンの1つである。近年の拡散法は、視覚知覚のための不確実性モデリングに有望なアプローチを提供するが、BEVの広い範囲において、小さな物体を効果的に検出することができない。本稿では,BEVにおける拡散パラダイムと最先端の3Dオブジェクト検出器を組み合わせることで,この問題に対処する。
論文参考訳（メタデータ） (2023-12-18T09:52:14Z)
Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for Advanced Object Detection [55.2480439325792]
本稿では,LSKNetのバックボーンをDiffusionDetヘッドに統合したオブジェクト検出モデルの詳細な評価を行う。提案手法は平均精度(MAP)を約45.7%向上させる。この進歩は、提案された修正の有効性を強調し、航空画像解析の新しいベンチマークを設定する。
論文参考訳（メタデータ） (2023-11-21T19:49:13Z)
Rethinking Scale Imbalance in Semi-supervised Object Detection for Aerial Images [48.460468764544835]
本研究では,航空画像のためのS3OD学習パイプラインを提案する。 S3ODでは,3つの重要な要素,SAT(Size-aware Adaptive Thresholding),SLA(Size-re Balanced Label Assignment),TNL(Teacher-guided Negative Learning)が,非バイアス学習の保証のために提案されている。
論文参考訳（メタデータ） (2023-10-23T08:55:10Z)
ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection [24.085715205081385]
我々は、オブジェクトのサイズを画像中の画素数、サイズ不均衡として、データセット内の特定のサイズのオブジェクトの過剰表現として表現する。本稿では,任意のオブジェクト検出モデル上にプラグインとして適用可能な,新しいARchitectUre-Agnostic BAlanced Loss (ARUBA)を提案する。
論文参考訳（メタデータ） (2022-10-10T11:28:16Z)
Towards Model Generalization for Monocular 3D Object Detection [57.25828870799331]
我々は,Mono3Dオブジェクト検出に有効な統合カメラ一般化パラダイム(CGP)を提案する。また,インスタンスレベルの拡張によりギャップを埋める2D-3D幾何一貫性オブジェクトスケーリング戦略(GCOS)を提案する。 DGMono3Dと呼ばれる手法は、評価された全てのデータセットに対して顕著な性能を達成し、SoTAの教師なしドメイン適応スキームを上回ります。
論文参考訳（メタデータ） (2022-05-23T23:05:07Z)
Scale-Equivalent Distillation for Semi-Supervised Object Detection [57.59525453301374]
近年のSemi-Supervised Object Detection (SS-OD) 法は主に自己学習に基づいており、教師モデルにより、ラベルなしデータを監視信号としてハードな擬似ラベルを生成する。実験結果から,これらの手法が直面する課題を分析した。本稿では,大規模オブジェクトサイズの分散とクラス不均衡に頑健な簡易かつ効果的なエンド・ツー・エンド知識蒸留フレームワークであるSED(Scale-Equivalent Distillation)を提案する。
論文参考訳（メタデータ） (2022-03-23T07:33:37Z)
Resolving Class Imbalance in Object Detection with Weighted Cross Entropy Losses [0.0]
物体検出はコンピュータビジョンにおいて重要なタスクであり、自律運転、監視、ロボット工学といった現実世界の多くのアプリケーションに役立っている。不均一なオブジェクトクラス分布を持つ特別なデータセットに関しては、検出器のパフォーマンスにはまだ制限がある。クロスエントロピー損失の重み付き変種を適用して,そのような問題を探索し,克服することを提案する。
論文参考訳（メタデータ） (2020-06-02T06:36:12Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。