Fugu-MT 論文翻訳(概要): Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

論文の概要: Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

arxiv url: http://arxiv.org/abs/2106.01883v1
Date: Thu, 3 Jun 2021 14:29:19 GMT
ステータス: 翻訳完了
システム内更新日: 2021-06-04 12:24:52.338468
Title: Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
Title（参考訳）: Kullback-Leibler Divergence を用いた回転物体検出のための高精度バウンディングボックスの学習
Authors: Xue Yang, Xiaojiang Yang, Jirui Yang, Qi Ming, Wentao Wang, Qi Tian, Junchi Yan
Abstract要約: 既存の回転物体検出器は、主に水平検出パラダイムから受け継がれている。本稿では,回転回帰損失の設計を帰納パラダイムから導出手法に変更することを目的としている。
参考スコア（独自算出の注目度）: 100.6913091147422
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Existing rotated object detectors are mostly inherited from the horizontal detection paradigm, as the latter has evolved into a well-developed area. However, these detectors are difficult to perform prominently in high-precision detection due to the limitation of current regression loss design, especially for objects with large aspect ratios. Taking the perspective that horizontal detection is a special case for rotated object detection, in this paper, we are motivated to change the design of rotation regression loss from induction paradigm to deduction methodology, in terms of the relation between rotation and horizontal detection. We show that one essential challenge is how to modulate the coupled parameters in the rotation regression loss, as such the estimated parameters can influence to each other during the dynamic joint optimization, in an adaptive and synergetic way. Specifically, we first convert the rotated bounding box into a 2-D Gaussian distribution, and then calculate the Kullback-Leibler Divergence (KLD) between the Gaussian distributions as the regression loss. By analyzing the gradient of each parameter, we show that KLD (and its derivatives) can dynamically adjust the parameter gradients according to the characteristics of the object. It will adjust the importance (gradient weight) of the angle parameter according to the aspect ratio. This mechanism can be vital for high-precision detection as a slight angle error would cause a serious accuracy drop for large aspect ratios objects. More importantly, we have proved that KLD is scale invariant. We further show that the KLD loss can be degenerated into the popular $l_{n}$-norm loss for horizontal detection. Experimental results on seven datasets using different detectors show its consistent superiority, and codes are available at https://github.com/yangxue0827/RotationDetection.
Abstract（参考訳）: 既存の回転物体検出器は、主に水平検出パラダイムから受け継がれており、後者は十分に発達した領域へと進化している。しかし、これらの検出器は、特にアスペクト比が大きい物体において、現在の回帰損失設計の限界により、高精度検出において顕著に実行するのは難しい。本稿では, 水平方向検出が回転物体検出の特別な場合であるという観点から, 回転物体検出と水平方向検出の関係の観点から, 回転回帰損失の設計を誘導パラダイムから導出手法に変更する動機付けを行う。本研究は, 動的ジョイント最適化において, 推定パラメータが相互に影響を及ぼすように, 回転回帰損失の結合パラメータを適応的, 相乗的に変調する方法が重要な課題であることを示す。具体的には、まず回転した有界箱を2次元ガウス分布に変換し、その後、ガウス分布間のKLDを回帰損失として計算する。各パラメータの勾配を解析することにより、KLD(とその誘導体)が対象の特性に応じてパラメータ勾配を動的に調整できることを示す。アスペクト比に応じて角度パラメータの重要性(漸進的な重み)を調整する。この機構は、わずかな角度誤差が大きなアスペクト比オブジェクトに対して深刻な精度低下を引き起こすため、高精度検出に不可欠である。さらに、KLDがスケール不変であることが証明された。さらに、KLD損失は、水平検出のために人気のある$l_{n}$-norm損失に分解可能であることを示す。異なる検出器を用いた7つのデータセットの実験結果は、一貫性のある優位性を示しており、コードはhttps://github.com/yangxue0827/rotationdetectionで利用可能である。

論文の概要: Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

関連論文リスト