Fugu-MT 論文翻訳(概要): SFFNet: Synergistic Feature Fusion Network With Dual-Domain Edge Enhancement for UAV Image Object Detection

論文の概要: SFFNet: Synergistic Feature Fusion Network With Dual-Domain Edge Enhancement for UAV Image Object Detection

arxiv url: http://arxiv.org/abs/2604.03176v1
Date: Fri, 03 Apr 2026 16:52:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 17:20:24.53985
Title: SFFNet: Synergistic Feature Fusion Network With Dual-Domain Edge Enhancement for UAV Image Object Detection
Title（参考訳）: SFFNet:UAV画像オブジェクト検出のためのデュアルドメインエッジ拡張による相乗的特徴融合ネットワーク
Authors: Wenfeng Zhang, Jun Ni, Yue Meng, Xiaodong Pei, Wei Hu, Qibing Qin, Lei Huang,
Abstract要約: 無人航空機(UAV)画像における物体検出は非常に難しい課題である。我々は,UAV画像の物体検出に適した2領域エッジ拡張を備えた相乗的特徴融合ネットワーク(SFFNet)を開発した。 2つの挑戦的な空中データセットの実験は、SFFNet-Xの卓越した性能を示している。
参考スコア（独自算出の注目度）: 25.963089047314963
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Object detection in unmanned aerial vehicle (UAV) images remains a highly challenging task, primarily caused by the complexity of background noise and the imbalance of target scales. Traditional methods easily struggle to effectively separate objects from intricate backgrounds and fail to fully leverage the rich multi-scale information contained within images. To address these issues, we have developed a synergistic feature fusion network (SFFNet) with dual-domain edge enhancement specifically tailored for object detection in UAV images. Firstly, the multi-scale dynamic dual-domain coupling (MDDC) module is designed. This component introduces a dual-driven edge extraction architecture that operates in both the frequency and spatial domains, enabling effective decoupling of multi-scale object edges from background noise. Secondly, to further enhance the representation capability of the model's neck in terms of both geometric and semantic information, a synergistic feature pyramid network (SFPN) is proposed. SFPN leverages linear deformable convolutions to adaptively capture irregular object shapes and establishes long-range contextual associations around targets through the designed wide-area perception module (WPM). Moreover, to adapt to the various applications or resource-constrained scenarios, six detectors of different scales (N/S/M/B/L/X) are designed. Experiments on two challenging aerial datasets (VisDrone and UAVDT) demonstrate the outstanding performance of SFFNet-X, achieving 36.8 AP and 20.6 AP, respectively. The lightweight models (N/S) also maintain a balance between detection accuracy and parameter efficiency. The code will be available at https://github.com/CQNU-ZhangLab/SFFNet.
Abstract（参考訳）: 無人航空機(UAV)画像における物体検出は、主に背景雑音の複雑さと目標スケールの不均衡に起因する非常に困難な課題である。従来の手法では、複雑な背景からオブジェクトを効果的に分離することは困難であり、画像に含まれるリッチなマルチスケール情報を完全に活用できない。これらの問題に対処するため,UAV画像の物体検出に適した2領域エッジ拡張を備えた相乗的特徴融合ネットワーク(SFFNet)を開発した。まず,マルチスケール動的二重ドメイン結合(MDDC)モジュールを設計する。このコンポーネントは、周波数領域と空間領域の両方で動作するデュアル駆動エッジ抽出アーキテクチャを導入し、背景雑音からマルチスケールオブジェクトエッジを効果的に分離することを可能にする。次に、幾何情報と意味情報の両方の観点からモデル首の表現能力を更に高めるために、相乗的特徴ピラミッドネットワーク(SFPN)を提案する。 SFPNは線形変形可能な畳み込みを利用して不規則な物体の形状を適応的に捕捉し、設計されたワイドエリア認識モジュール(WPM)を介してターゲット周辺の長距離コンテキスト関連を確立する。さらに、様々な応用や資源制約のシナリオに適応するために、異なるスケールの6つの検出器(N/S/M/B/L/X)が設計されている。 2つの挑戦的な航空データセット(VisDroneとUAVDT)の実験は、それぞれ36.8 APと20.6 APを達成したSFFNet-Xの優れた性能を示した。軽量モデル(N/S)は検出精度とパラメータ効率のバランスを保っている。コードはhttps://github.com/CQNU-ZhangLab/SFFNet.comから入手できる。

論文の概要: SFFNet: Synergistic Feature Fusion Network With Dual-Domain Edge Enhancement for UAV Image Object Detection

関連論文リスト