Fugu-MT 論文翻訳(概要): Portable Active Learning for Object Detection

論文の概要: Portable Active Learning for Object Detection

arxiv url: http://arxiv.org/abs/2605.10349v1
Date: Mon, 11 May 2026 10:52:23 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.74496
Title: Portable Active Learning for Object Detection
Title（参考訳）: オブジェクト検出のためのポータブル能動学習
Authors: Rashi Sharma, Justin Timothy C. Bersamin, Karthikk Subramanian,
Abstract要約: Portable Active Learning (PAL) は、推論出力のみで動作する検出器に依存しない、ポータブルなフレームワークである。 PALは、クラスレベルのインスタンスの不確実性と画像レベルの多様性を組み合わせて、データ選択をガイドする。
参考スコア（独自算出の注目度）: 0.649970685896541
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Annotating bounding boxes is costly and limits the scalability of object detection. This challenge is compounded by the need to preserve high accuracy while minimizing manual effort in real-world applications. Prior active learning methods often depend on model features or modify detector internals and training schedules, increasing integration overhead. Moreover, they rarely jointly exploit the benefits of image-level signals, class-imbalance cues, and instance-level uncertainty for comprehensive selection. We present Portable Active Learning (PAL), a detector-agnostic, easily portable framework that operates solely on inference outputs. PAL combines class-wise instance uncertainty with image-level diversity to guide data selection. At each round, PAL trains lightweight class-specific logistic classifiers to distinguish true from false positives, producing entropy-based uncertainty scores for proposals. Candidate images are then refined using global image entropy, class diversity, and image similarity, yielding batches that are both informative and diverse. PAL requires no changes to model internals or training pipelines, ensuring broad compatibility across detectors. Extensive experiments on COCO, PASCAL VOC, and BDD100K demonstrate that PAL consistently improves label efficiency and detection accuracy compared to existing active learning baselines, making it a practical solution for scalable and cost-effective deployment of object detection in real-world settings.
Abstract（参考訳）: 境界ボックスのアノテーションはコストがかかり、オブジェクト検出のスケーラビリティが制限される。この課題は、現実世界のアプリケーションにおける手作業を最小限にしながら、高い精度を維持する必要性によって複雑化している。従来のアクティブな学習手法はモデルの特徴に依存したり、ディテクターの内部やトレーニングスケジュールを変更することがあり、統合のオーバーヘッドが増加する。さらに、画像レベルの信号、クラス不均衡の方法、総合的な選択のためのインスタンスレベルの不確実性の利点を共同で活用することは滅多にない。提案するPortable Active Learning (PAL, Portable Active Learning) は,推論出力のみで動作する,検出に依存しない,ポータブルなフレームワークである。 PALは、クラスレベルのインスタンスの不確実性と画像レベルの多様性を組み合わせて、データ選択をガイドする。各ラウンドでPALは、偽陽性と真を区別するために、クラス固有のロジスティック分類器を軽量に訓練し、提案に対するエントロピーベースの不確実性スコアを生成する。候補画像は、グローバルな画像エントロピー、クラス多様性、および画像類似性を用いて洗練され、情報的かつ多様なバッチが生成される。 PALはモデル内部やトレーニングパイプラインの変更を必要とせず、検出器間の広範な互換性を確保する。 COCO、PASCAL VOC、BDD100Kに関する大規模な実験は、PALが既存のアクティブラーニングベースラインと比較して、ラベル効率と検出精度を一貫して改善していることを示している。

論文の概要: Portable Active Learning for Object Detection

関連論文リスト