Fugu-MT 論文翻訳(概要): BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera

論文の概要: BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera

arxiv url: http://arxiv.org/abs/2604.11714v1
Date: Mon, 13 Apr 2026 16:50:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.695774
Title: BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera
Title（参考訳）: BEM:False-Positive Suppressionのための訓練不要バックグラウンド埋め込みメモリ
Authors: Junwoo Park, Jangho Lee, Sunho Lim,
Abstract要約: COCOライクなベンチマークは、インスタンス密度よりもカテゴリの多様性を強調しており、クラスごとの間隔で訓練された検出器は、密度の高いシングルクラスまたは少数クラスのシーンで苦労する。提案するバックグラウンド埋め込みメモリ(BEM, background Embedding Memory)は,事前学習した検知器に推論時にアタッチ可能な軽量でトレーニング不要で軽量なモジュールである。
参考スコア（独自算出の注目度）: 2.2721664313694205
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Pretrained detectors perform well on benchmarks but often suffer performance degradation in real-world deployments due to distribution gaps between training data and target environments. COCO-like benchmarks emphasize category diversity rather than instance density, causing detectors trained under per-class sparsity to struggle in dense, single- or few-class scenes such as surveillance and traffic monitoring. In fixed-camera environments, the quasi-static background provides a stable, label-free prior that can be exploited at inference to suppress spurious detections. To address the issue, we propose Background Embedding Memory (BEM), a lightweight, training-free, weight-frozen module that can be attached to pretrained detectors during inference. BEM estimates clean background embeddings, maintains a prototype memory, and re-scores detection logits with an inverse-similarity, rank-weighted penalty, effectively reducing false positives while maintaining recall. Empirically, background-frame cosine similarity correlates negatively with object count and positively with Precision-Confidence AUC (P-AUC), motivating its use as a training-free control signal. Across YOLO and RT-DETR families on LLVIP and simulated surveillance streams, BEM consistently reduces false positives while preserving real-time performance. Our code is available at https://github.com/Leo-Park1214/Background-Embedding-Memory.git
Abstract（参考訳）: 事前訓練された検出器は、ベンチマークでよく機能するが、トレーニングデータとターゲット環境の間の分散ギャップにより、実世界のデプロイでパフォーマンスが低下することが多い。 COCOライクなベンチマークは、インスタンス密度よりもカテゴリの多様性を強調しており、クラスごとの間隔で訓練された検出器は、監視や交通監視といった、密集したシングルクラスまたは少数クラスのシーンで苦労する。固定カメラ環境では、準静的な背景は安定なラベルなしの事前を提供し、推論時に利用でき、急激な検出を抑えることができる。この問題に対処するため,BEM(Background Embedding Memory)を提案する。 BEMはクリーンなバックグラウンド埋め込みを推定し、プロトタイプメモリを維持し、逆相似でランク重み付きペナルティで検出ロジットを再スコアし、リコールを維持しながら偽陽性を効果的に低減する。経験的に、背景フレームのコサイン類似性は、対象数と負の相関を持ち、精度信頼AUC(P-AUC)と正の相関を持ち、トレーニング不要な制御信号としての使用を動機付けている。 LLVIPおよびシミュレートされた監視ストリーム上のYOLOおよびRT-DETRファミリー全体で、BEMは、リアルタイムのパフォーマンスを維持しながら、常に偽陽性を減らす。私たちのコードはhttps://github.com/Leo-Park1214/Background-Embedding-Memory.gitで利用可能です。

論文の概要: BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera

関連論文リスト