Fugu-MT 論文翻訳(概要): Residual Learning for Image Point Descriptors

論文の概要: Residual Learning for Image Point Descriptors

arxiv url: http://arxiv.org/abs/2312.15471v1
Date: Sun, 24 Dec 2023 12:51:30 GMT
ステータス: 翻訳完了
システム内更新日: 2023-12-27 18:05:34.982309
Title: Residual Learning for Image Point Descriptors
Title（参考訳）: 画像ポイント記述子の残差学習
Authors: Rashik Shrestha, Ajad Chhatkuli, Menelaos Kanakis, Luc Van Gool
Abstract要約: 本稿では,手作りの検出器とディスクリプタを用いて局所画像記述子を学習する,非常にシンプルで効果的な手法を提案する。我々は,手書き記述子にすでに存在する知識を活用することで,最終記述子を最適化する。本手法は,非微分関数を用いたアンサンブル学習や学習に応用できる可能性がある。
参考スコア（独自算出の注目度）: 56.917951170421894
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Local image feature descriptors have had a tremendous impact on the development and application of computer vision methods. It is therefore unsurprising that significant efforts are being made for learning-based image point descriptors. However, the advantage of learned methods over handcrafted methods in real applications is subtle and more nuanced than expected. Moreover, handcrafted descriptors such as SIFT and SURF still perform better point localization in Structure-from-Motion (SfM) compared to many learned counterparts. In this paper, we propose a very simple and effective approach to learning local image descriptors by using a hand-crafted detector and descriptor. Specifically, we choose to learn only the descriptors, supported by handcrafted descriptors while discarding the point localization head. We optimize the final descriptor by leveraging the knowledge already present in the handcrafted descriptor. Such an approach of optimization allows us to discard learning knowledge already present in non-differentiable functions such as the hand-crafted descriptors and only learn the residual knowledge in the main network branch. This offers 50X convergence speed compared to the standard baseline architecture of SuperPoint while at inference the combined descriptor provides superior performance over the learned and hand-crafted descriptors. This is done with minor increase in the computations over the baseline learned descriptor. Our approach has potential applications in ensemble learning and learning with non-differentiable functions. We perform experiments in matching, camera localization and Structure-from-Motion in order to showcase the advantages of our approach.
Abstract（参考訳）: ローカル画像特徴ディスクリプタは,コンピュータビジョン手法の開発と応用に大きな影響を与えてきた。したがって、学習ベースの画像ポイント記述子に多大な努力が払われていることは驚きではない。しかし、実アプリケーションにおける手作りの手法よりも学習方法の利点は、予想以上に微妙で曖昧である。さらに、SIFT や SURF などの手書き記述子は、多くの学習済み言語と比較して、Structure-from-Motion (SfM) においてより優れたポイントローカライゼーションを行う。本稿では,手作りの検出器とディスクリプタを用いて,局所画像記述子を学習する非常に単純かつ効果的な手法を提案する。具体的には、ポイントローカライゼーションヘッドを破棄しながら手作りのディスクリプタでサポートされているディスクリプタのみを学習する。我々は手書き記述子にすでに存在する知識を活用して最終記述子を最適化する。このような最適化のアプローチにより、手作り記述子のような非微分可能関数にすでに存在する学習知識を破棄し、メインネットワークブランチで残っている知識のみを学習できる。これにより、SuperPointの標準ベースラインアーキテクチャと比較して50倍の収束速度が得られ、推論時に複合ディスクリプタは学習および手書きのディスクリプタよりも優れたパフォーマンスを提供する。これはベースライン学習ディスクリプタ上での計算量の増加によって実現される。本手法は,非微分関数を用いた学習と学習に応用できる可能性がある。提案手法の利点を示すため,マッチング,カメラのローカライゼーション,Structure-from-Motion実験を行った。

論文の概要: Residual Learning for Image Point Descriptors

関連論文リスト