Fugu-MT 論文翻訳(概要): KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D

論文の概要: KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D

arxiv url: http://arxiv.org/abs/2109.13410v1
Date: Tue, 28 Sep 2021 00:41:29 GMT
ステータス: 翻訳完了
システム内更新日: 2021-09-29 14:46:08.989394
Title: KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
Title（参考訳）: KITTI-360:2次元・3次元都市景観理解のための新しいデータセットとベンチマーク
Authors: Yiyi Liao, Jun Xie, Andreas Geiger
Abstract要約: KITTI-360は、よりリッチな入力モダリティ、包括的なセマンティックインスタンスアノテーション、正確なローカライゼーションを含む郊外の運転データセットである。その結果,150k以上のセマンティクスとインスタンスのアノテート画像と1Bのアノテート3Dポイントが得られた。我々は、同じデータセット上のコンピュータビジョン、グラフィックス、ロボット工学の問題を含む、モバイル知覚に関連するいくつかのタスクのベンチマークとベースラインを構築した。
参考スコア（独自算出の注目度）: 67.50776195828242
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: For the last few decades, several major subfields of artificial intelligence including computer vision, graphics, and robotics have progressed largely independently from each other. Recently, however, the community has realized that progress towards robust intelligent systems such as self-driving cars requires a concerted effort across the different fields. This motivated us to develop KITTI-360, successor of the popular KITTI dataset. KITTI-360 is a suburban driving dataset which comprises richer input modalities, comprehensive semantic instance annotations and accurate localization to facilitate research at the intersection of vision, graphics and robotics. For efficient annotation, we created a tool to label 3D scenes with bounding primitives and developed a model that transfers this information into the 2D image domain, resulting in over 150k semantic and instance annotated images and 1B annotated 3D points. Moreover, we established benchmarks and baselines for several tasks relevant to mobile perception, encompassing problems from computer vision, graphics, and robotics on the same dataset. KITTI-360 will enable progress at the intersection of these research areas and thus contributing towards solving one of our grand challenges: the development of fully autonomous self-driving systems.
Abstract（参考訳）: 過去数十年間、コンピュータビジョン、グラフィックス、ロボット工学など、人工知能のいくつかの主要なサブフィールドは互いに独立して進化してきた。しかし、近年、自動運転車のような堅牢なインテリジェントシステムへの進歩には、さまざまな分野にわたる協調的な努力が必要だとコミュニティは認識している。これにより、一般的なKITTIデータセットの後継であるKITTI-360を開発する動機となった。 KITTI-360は、視覚、グラフィックス、ロボット工学の交差点での研究を容易にするために、より豊富な入力モダリティ、包括的なセマンティックインスタンスアノテーション、正確なローカライゼーションを含む郊外の運転データセットである。そこで我々は,3次元シーンを境界プリミティブでラベル付けするツールを開発し,その情報を2次元画像領域に転送し,150k以上のセマンティクスとインスタンスの注釈付き画像と1Bの注釈付き3Dポイントを生成するモデルを開発した。さらに,同じデータセット上で,コンピュータビジョンやグラフィックス,ロボット工学の問題を包含して,モバイル知覚に関連するいくつかのタスクのベンチマークとベースラインを構築した。 kitti-360は、これらの研究分野の交差点での進歩を可能にし、私たちの大きな課題である完全自動運転システムの開発に寄与します。

論文の概要: KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D

関連論文リスト