Fugu-MT 論文翻訳(概要): LineMarkNet: Line Landmark Detection for Valet Parking

論文の概要: LineMarkNet: Line Landmark Detection for Valet Parking

arxiv url: http://arxiv.org/abs/2309.10475v1
Date: Tue, 19 Sep 2023 09:43:29 GMT
ステータス: 翻訳完了
システム内更新日: 2023-09-20 15:14:00.230016
Title: LineMarkNet: Line Landmark Detection for Valet Parking
Title（参考訳）: linemarknet:バレット駐車のためのラインランドマーク検出
Authors: Zizhang Wu, Fan Wang, Yuanzhu Gan, Tianhao Xu, Weiwei Sun and Rui Tang
Abstract要約: 我々は,サラウンドビューカメラからラインランドマークを検出するディープネットワーク(LineMarkNet)を開発した。次に、複数行のランドマークを検出するためにマルチタスクデコーダを使用します。実験結果から,本フレームワークは複数の線検出手法と比較して性能が向上していることがわかった。
参考スコア（独自算出の注目度）: 18.448476027679213
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We aim for accurate and efficient line landmark detection for valet parking, which is a long-standing yet unsolved problem in autonomous driving. To this end, we present a deep line landmark detection system where we carefully design the modules to be lightweight. Specifically, we first empirically design four general line landmarks including three physical lines and one novel mental line. The four line landmarks are effective for valet parking. We then develop a deep network (LineMarkNet) to detect line landmarks from surround-view cameras where we, via the pre-calibrated homography, fuse context from four separate cameras into the unified bird-eye-view (BEV) space, specifically we fuse the surroundview features and BEV features, then employ the multi-task decoder to detect multiple line landmarks where we apply the center-based strategy for object detection task, and design our graph transformer to enhance the vision transformer with hierarchical level graph reasoning for semantic segmentation task. At last, we further parameterize the detected line landmarks (e.g., intercept-slope form) whereby a novel filtering backend incorporates temporal and multi-view consistency to achieve smooth and stable detection. Moreover, we annotate a large-scale dataset to validate our method. Experimental results show that our framework achieves the enhanced performance compared with several line detection methods and validate the multi-task network's efficiency about the real-time line landmark detection on the Qualcomm 820A platform while meantime keeps superior accuracy, with our deep line landmark detection system.
Abstract（参考訳）: 自動運転における長年の未解決問題であるvalet parkingの高精度かつ効率的なラインランドマーク検出を目指している。そこで本研究では,軽量なモジュールを慎重に設計する深線ランドマーク検出システムを提案する。具体的には、3つの物理的なラインと1つの新しいメンタルラインを含む4つの一般的なラインランドマークを経験的に設計する。 4路線のランドマークはバレット駐車に有効である。 We then develop a deep network (LineMarkNet) to detect line landmarks from surround-view cameras where we, via the pre-calibrated homography, fuse context from four separate cameras into the unified bird-eye-view (BEV) space, specifically we fuse the surroundview features and BEV features, then employ the multi-task decoder to detect multiple line landmarks where we apply the center-based strategy for object detection task, and design our graph transformer to enhance the vision transformer with hierarchical level graph reasoning for semantic segmentation task. 最後に、検出されたラインランドマーク(例えばインターセプト・スロープ形式)をパラメータ化し、新しいフィルタリングバックエンドは時間的・多ビュー整合性を取り入れ、スムーズで安定した検出を実現する。さらに,提案手法を検証するために,大規模データセットにアノテートする。実験結果から,本フレームワークは,複数のライン検出手法と比較して性能が向上し,Qualcomm 820Aプラットフォーム上でのリアルタイムラインランドマーク検出におけるマルチタスクネットワークの効率が向上し,一方,精度が向上した。

関連論文リスト

TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder [66.22997415145467]
本稿では,スパース領域における検出機能を改善する共同補完・検出フレームワークを提案する。具体的には,トランスブリッジ(TransBridge)を提案する。トランスブリッジ(TransBridge)はトランスフォーマーをベースとした新しいアップサンプリングブロックである。その結果,本フレームワークは,各手法の平均精度(mAP)が0.7から1.5の範囲で,エンドツーエンドの3Dオブジェクト検出を一貫して改善していることがわかった。
論文参考訳（メタデータ） (2025-12-12T00:08:03Z)
RoadPainter: Points Are Ideal Navigators for Topology transformER [10.179711440042123]
トポロジ推論は、道路シーンの正確な理解を提供することを目的としており、自律システムは安全かつ効率的なルートを特定できる。多視点画像を用いた車線中心線のトポロジの検出と推論のための革新的なアプローチであるRoadPainterを提案する。
論文参考訳（メタデータ） (2024-07-22T03:23:35Z)
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection [9.633565294243173]
具体的なアーキテクチャ変更の組み合わせによって,その正確性と時間的安定性が向上することを示す。ランドマーク検出器とともにトレーニングされた空間変圧器ネットワークの使用を教師なしで解析する。ランドマーク予測器の出力ヘッドを変更して標準3次元空間のランドマークを推定することにより、精度をさらに向上できることを示す。
論文参考訳（メタデータ） (2024-05-30T14:54:26Z)
DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients [105.25109274550607]
ラインセグメントは、視覚タスクでますます使われています。画像勾配に基づく従来の線検出器は非常に高速で精度が高いが、ノイズの多い画像や困難な条件では頑丈さに欠ける。我々は、両方の世界を最大限に活用するために、伝統的なアプローチと学習されたアプローチを組み合わせることを提案する。
論文参考訳（メタデータ） (2022-12-15T12:36:49Z)
RCLane: Relay Chain Prediction for Lane Detection [76.62424079494285]
本稿では,リレーチェーン予測に基づく車線検出手法を提案する。当社の戦略では,TuSimple,CULane,CurveLanes,LLAMASの4つの主要なベンチマーク上で,最先端の新たなベンチマークを確立することが可能です。
論文参考訳（メタデータ） (2022-07-19T16:48:39Z)
From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery [37.78933209094847]
本稿ではオブジェクトランドマーク検出器の教師なし学習のための新しいパラダイムを提案する。我々はLS3D、BBCPose、Human3.6M、PennActionなどの難解なデータセットに対して本手法の有効性を検証した。
論文参考訳（メタデータ） (2022-05-31T15:44:29Z)
SOLD2: Self-supervised Occlusion-aware Line Description and Detection [95.8719432775724]
単一深層ネットワークにおける回線セグメントの最初の共同検出と記述について紹介します。我々の手法は注釈付き行ラベルを必要としないため、任意のデータセットに一般化することができる。複数のマルチビューデータセットにおいて,従来の行検出と記述方法に対するアプローチを評価した。
論文参考訳（メタデータ） (2021-04-07T19:27:17Z)
Pretrained equivariant features improve unsupervised landmark discovery [69.02115180674885]
我々は、この課題を克服する2段階の教師なしアプローチを、強力なピクセルベースの特徴を初めて学習することによって定式化する。本手法は,いくつかの難解なランドマーク検出データセットにおいて最先端の結果を生成する。
論文参考訳（メタデータ） (2021-04-07T05:42:11Z)
Topo-boundary: A Benchmark Dataset on Topological Road-boundary Detection Using Aerial Images for Autonomous Driving [11.576868193291997]
オフライントポロジカル道路境界検出のための新しいベンチマークデータセットであるtextitTopo-boundaryを提案する。データセットには21,556$1000times 1000$-size 4-channel aerial imageが含まれる。データセットを用いて,3つのセグメンテーションベースラインと5つのグラフベースラインを実装し,評価する。
論文参考訳（メタデータ） (2021-03-31T14:42:00Z)
Deep Hough Transform for Semantic Line Detection [70.28969017874587]
自然の場面で意味のある線構造、つまり意味的な線を検知する基本的なタスクに焦点をあてる。従来の手法は線の性質を無視し、準最適性能をもたらす。行検出のためのワンショットエンドツーエンド学習フレームワークを提案する。
論文参考訳（メタデータ） (2020-03-10T13:08:42Z)
Road Curb Detection and Localization with Monocular Forward-view Vehicle Camera [74.45649274085447]
魚眼レンズを装着した校正単眼カメラを用いて3Dパラメータを推定するロバストな手法を提案する。我々のアプローチでは、車両が90%以上の精度で、リアルタイムで距離を抑えることができる。
論文参考訳（メタデータ） (2020-02-28T00:24:18Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。