Fugu-MT 論文翻訳(概要): UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models

論文の概要: UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models

arxiv url: http://arxiv.org/abs/2007.06676v3
Date: Tue, 6 Oct 2020 19:41:35 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-10 23:51:56.136141
Title: UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models
Title（参考訳）: UnRectDepthNet:共通カメラ歪みモデル処理のためのジェネリックフレームワークを用いた自己監督単眼深度推定
Authors: Varun Ravi Kumar, Senthil Yogamani, Markus Bach, Christian Witt, Stefan Milz and Patrick Mader
Abstract要約: 本研究では,未修正単眼ビデオから深度,ユークリッド距離,および視覚計測を推定するための,汎用的な規模対応型自己教師パイプラインを提案する。提案アルゴリズムは,KITTI修正データセットでさらに評価され,最先端の結果が得られた。
参考スコア（独自算出の注目度）: 8.484676769284578
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In classical computer vision, rectification is an integral part of multi-view depth estimation. It typically includes epipolar rectification and lens distortion correction. This process simplifies the depth estimation significantly, and thus it has been adopted in CNN approaches. However, rectification has several side effects, including a reduced field of view (FOV), resampling distortion, and sensitivity to calibration errors. The effects are particularly pronounced in case of significant distortion (e.g., wide-angle fisheye cameras). In this paper, we propose a generic scale-aware self-supervised pipeline for estimating depth, euclidean distance, and visual odometry from unrectified monocular videos. We demonstrate a similar level of precision on the unrectified KITTI dataset with barrel distortion comparable to the rectified KITTI dataset. The intuition being that the rectification step can be implicitly absorbed within the CNN model, which learns the distortion model without increasing complexity. Our approach does not suffer from a reduced field of view and avoids computational costs for rectification at inference time. To further illustrate the general applicability of the proposed framework, we apply it to wide-angle fisheye cameras with 190$^\circ$ horizontal field of view. The training framework UnRectDepthNet takes in the camera distortion model as an argument and adapts projection and unprojection functions accordingly. The proposed algorithm is evaluated further on the KITTI rectified dataset, and we achieve state-of-the-art results that improve upon our previous work FisheyeDistanceNet. Qualitative results on a distorted test scene video sequence indicate excellent performance https://youtu.be/K6pbx3bU4Ss.
Abstract（参考訳）: 古典的コンピュータビジョンでは、整流は多視点深度推定の不可欠な部分である。典型的には、エピポーラ補正とレンズ歪み補正を含む。このプロセスは深さ推定を大幅に単純化し、cnnのアプローチで採用されている。しかし、修正には、視野の縮小(FOV)、再サンプリング歪み、校正誤差に対する感度など、いくつかの副作用がある。この効果は特に大きな歪み(広角魚眼カメラなど)の場合には顕著に発音される。本稿では,非再現単眼映像から深さ,ユークリッド距離,視覚オドメトリを推定する汎用的なスケールアウェア自己教師付きパイプラインを提案する。補正されたKITTIデータセットと同等のバレル歪みを有する未修正KITTIデータセットに対して、同様の精度を示す。補正ステップがCNNモデルに暗黙的に吸収されるという直感は、複雑さを増すことなく歪みモデルを学習する。提案手法は視野の縮小に悩まされず,推論時の修正に要する計算コストを回避する。提案フレームワークの汎用性をさらに説明するために,190$^\circ$水平視野を持つ広角魚眼カメラに適用する。トレーニングフレームワークUnRectDepthNetは、カメラ歪みモデルを引数として取り込んで、プロジェクションとアンプロジェクション機能を適用する。提案アルゴリズムはkitti正規化データセット上でさらに評価され,これまでのfisheye distancenetの成果を改良した最新結果が得られる。歪んだテストシーンのビデオシーケンスの質的な結果は、優れたパフォーマンス https://youtu.be/K6pbx3bU4Ssを示している。

論文の概要: UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models

関連論文リスト