Fugu-MT 論文翻訳(概要): Deep Hough Transform for Semantic Line Detection

論文の概要: Deep Hough Transform for Semantic Line Detection

arxiv url: http://arxiv.org/abs/2003.04676v4
Date: Sat, 1 May 2021 17:46:25 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-24 21:22:43.742353
Title: Deep Hough Transform for Semantic Line Detection
Title（参考訳）: 意味線検出のためのディープハフ変換
Authors: Kai Zhao, Qi Han, Chang-Bin Zhang, Jun Xu, and Ming-Ming Cheng
Abstract要約: 自然の場面で意味のある線構造、つまり意味的な線を検知する基本的なタスクに焦点をあてる。従来の手法は線の性質を無視し、準最適性能をもたらす。行検出のためのワンショットエンドツーエンド学習フレームワークを提案する。
参考スコア（独自算出の注目度）: 70.28969017874587
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We focus on a fundamental task of detecting meaningful line structures, a.k.a. semantic line, in natural scenes. Many previous methods regard this problem as a special case of object detection and adjust existing object detectors for semantic line detection. However, these methods neglect the inherent characteristics of lines, leading to sub-optimal performance. Lines enjoy much simpler geometric property than complex objects and thus can be compactly parameterized by a few arguments. To better exploit the property of lines, in this paper, we incorporate the classical Hough transform technique into deeply learned representations and propose a one-shot end-to-end learning framework for line detection. By parameterizing lines with slopes and biases, we perform Hough transform to translate deep representations into the parametric domain, in which we perform line detection. Specifically, we aggregate features along candidate lines on the feature map plane and then assign the aggregated features to corresponding locations in the parametric domain. Consequently, the problem of detecting semantic lines in the spatial domain is transformed into spotting individual points in the parametric domain, making the post-processing steps, i.e. non-maximal suppression, more efficient. Furthermore, our method makes it easy to extract contextual line features eg features along lines close to a specific line, that are critical for accurate line detection. In addition to the proposed method, we design an evaluation metric to assess the quality of line detection and construct a large scale dataset for the line detection task. Experimental results on our proposed dataset and another public dataset demonstrate the advantages of our method over previous state-of-the-art alternatives.
Abstract（参考訳）: 自然場面において意味のある線構造、すなわち意味線を検出する基本的なタスクに焦点を当てる。従来の多くの手法では、この問題をオブジェクト検出の特別な場合とみなし、既存のオブジェクト検出をセマンティックライン検出のために調整している。しかし、これらの手法は線の性質を無視し、準最適性能をもたらす。直線は複素対象よりもはるかに単純な幾何学的性質を持ち、従っていくつかの引数によってコンパクトにパラメータ化できる。本稿では,線の性質をよりよく活用するために,古典的ハフ変換手法を深く学習された表現に取り入れ,一対一で線検出を行う学習フレームワークを提案する。傾斜や偏りのある線をパラメータ化することで、深部表現をパラメトリック領域に変換するハフ変換を行い、線の検出を行う。具体的には、フィーチャーマッププレーンの候補ラインに沿って機能を集約し、パラメトリックドメイン内の対応する場所に集約した機能を割り当てます。これにより、空間領域における意味線を検出する問題は、パラメトリック領域内の個々の点を検知し、処理後のステップ、すなわち非最大抑制をより効率的にする。さらに,本手法は,行の正確な検出に欠かせない,特定の行に近い線に沿って,文脈的特徴,例えば特徴を抽出しやすくする。提案手法に加えて,線検出の質を評価するための評価指標を設計し,線検出タスクのための大規模データセットを構築する。提案するデータセットおよび他の公開データセットにおける実験結果は、従来の最先端の代替案よりも優れた方法を示している。

論文の概要: Deep Hough Transform for Semantic Line Detection

関連論文リスト