Fugu-MT 論文翻訳(概要): Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging

論文の概要: Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging

arxiv url: http://arxiv.org/abs/2606.06536v1
Date: Wed, 03 Jun 2026 21:56:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-08 14:33:29.353288
Title: Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging
Title（参考訳）: UAV透過線イメージングによる絶縁体欠陥検出のためのアテンションガイドオートエンコーダフュージョン
Authors: Malak Allam, Khaled Shaban, Ali Hamdi,
Abstract要約: 本稿では,Attention-Guided AutoEncoder-Enhanced YOLOフレームワークであるAE-YOLOを提案する。このアーキテクチャは、FPN-PAN(Feature Pyramid Network-Path Aggregation Network)のネックに軽量なボトルネックオートエンコーダを統合する。 Insulator-Defect Detectionデータセットの実験によると、効率的なNetV2バックボーンを持つAE-YOLOは0.5で95.10%のmAP、96.40%の精度、93.80%のリコールを達成した。
参考スコア（独自算出の注目度）: 0.5097809301149341
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Automated defect detection in high-voltage transmission-line insulators remains challenging due to severe class imbalance, large scale variation, and the small spatial extent of defect instances in Unmanned Aerial Vehicle (UAV) imagery. To address these challenges, this paper proposes AE-YOLO, an Attention-Guided AutoEncoder-Enhanced YOLO framework for robust insulator defect detection. The architecture integrates lightweight bottleneck autoencoders within a Feature Pyramid Network-Path Aggregation Network (FPN-PAN) neck. This preserves anomaly-sensitive information during multi-scale feature fusion. Convolutional Block Attention Modules (CBAM) are used throughout the backbone, enhancing feature discrimination and suppressing background interference. The framework also introduces a variance-maximizing autoencoder regularization strategy, which encourages diverse, defect-discriminative latent representations. The network trains using a unified objective that combines focal loss, Complete IoU (CIoU) loss, and autoencoder regularization to address foreground-background imbalance and improve localization accuracy. During inference, Weighted Boxes Fusion (WBF) combines predictions from YOLOv8, YOLOv10, and YOLO11. An autoencoder-guided confidence boosting mechanism improves sensitivity to rare defect categories. Experiments on the Insulator-Defect Detection dataset show that AE-YOLO with an EfficientNetV2 backbone achieves 95.10 percent mAP at 0.5, 96.40 percent precision, and 93.80 percent recall. This performance surpasses the strongest YOLO-family baseline by 5.0 points in mAP at 0.5 and 6.7 points in recall. These results confirm the effectiveness and adaptability of the framework. The model is a practical and scalable solution for UAV-based transmission-line inspection and defect monitoring.
Abstract（参考訳）: 高電圧送電線絶縁体における欠陥の自動検出は, 高度不均衡, 大規模変動, 無人航空機(UAV)画像の空間的欠陥事例の少ないため, 依然として困難である。これらの課題に対処するため,本論文では,頑健な絶縁体欠陥検出のためのアテンションガイド付きオートエンコーダ拡張YOLOフレームワークであるAE-YOLOを提案する。このアーキテクチャは、FPN-PAN(Feature Pyramid Network-Path Aggregation Network)のネックに軽量なボトルネックオートエンコーダを統合する。これは、マルチスケールのフィーチャ融合中に異常に敏感な情報を保存する。 Convolutional Block Attention Modules (CBAM) は背骨全体に使用され、特徴の識別を高め、バックグラウンド干渉を抑制する。このフレームワークはまた、分散を最大化するオートエンコーダ正規化戦略を導入し、多様な欠陥を識別可能な潜在表現を奨励している。ネットワークは、焦点損失、完全IoU(CIoU)損失、およびフォアグラウンド・バックグラウンドの不均衡に対処するためのオートエンコーダ正規化を組み合わせ、ローカライズ精度を向上させる統一目的を用いて訓練する。推測中、Weighted Boxes Fusion (WBF)はYOLOv8、YOLOv10、YOLO11からの予測を組み合わせている。自己エンコーダ誘導型信頼促進機構は、まれな欠陥カテゴリに対する感度を向上させる。 Insulator-Defect Detectionデータセットの実験によると、効率的なNetV2バックボーンを持つAE-YOLOは0.5で95.10%のmAP、96.40%の精度、93.80%のリコールを達成した。このパフォーマンスは、最強のYOLOファミリーベースラインを0.5で5.0ポイント、リコールで6.7ポイント上回る。これらの結果は,フレームワークの有効性と適応性を確認した。このモデルは、UAVベースのトランスミッションラインインスペクションと欠陥監視のための実用的でスケーラブルなソリューションである。

論文の概要: Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging

関連論文リスト