Fugu-MT 論文翻訳(概要): Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height Estimation

論文の概要: Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height Estimation

arxiv url: http://arxiv.org/abs/2307.02574v1
Date: Wed, 5 Jul 2023 18:16:30 GMT
ステータス: 翻訳完了
システム内更新日: 2023-07-07 16:23:54.115920
Title: Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height Estimation
Title（参考訳）: ストリートビュー画像とOpenStreetMapからの半教師付き学習による建物高さの自動推定
Authors: Hao Li, Zhendong Yuan, Gabriel Dax, Gefei Kong, Hongchao Fan, Alexander Zipf, Martin Werner
Abstract要約: 本稿では,Mapillary SVIとOpenStreetMapのデータから建物の高さを自動的に推定する半教師付き学習(SSL)手法を提案する。提案手法は, 平均絶対誤差(MAE)が約2.1mである建物の高さを推定する上で, 明らかな性能向上につながる。予備結果は,低コストなVGIデータに基づく提案手法のスケールアップに向けた今後の取り組みを期待し,動機づけるものである。
参考スコア（独自算出の注目度）: 59.6553058160943
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate building height estimation is key to the automatic derivation of 3D city models from emerging big geospatial data, including Volunteered Geographical Information (VGI). However, an automatic solution for large-scale building height estimation based on low-cost VGI data is currently missing. The fast development of VGI data platforms, especially OpenStreetMap (OSM) and crowdsourced street-view images (SVI), offers a stimulating opportunity to fill this research gap. In this work, we propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OSM data to generate low-cost and open-source 3D city modeling in LoD1. The proposed method consists of three parts: first, we propose an SSL schema with the option of setting a different ratio of "pseudo label" during the supervised regression; second, we extract multi-level morphometric features from OSM data (i.e., buildings and streets) for the purposed of inferring building height; last, we design a building floor estimation workflow with a pre-trained facade object detection network to generate "pseudo label" from SVI and assign it to the corresponding OSM building footprint. In a case study, we validate the proposed SSL method in the city of Heidelberg, Germany and evaluate the model performance against the reference data of building heights. Based on three different regression models, namely Random Forest (RF), Support Vector Machine (SVM), and Convolutional Neural Network (CNN), the SSL method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters, which is competitive to state-of-the-art approaches. The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data, with possibilities in even regions and areas with diverse data quality and availability.
Abstract（参考訳）: 大規模地理空間情報(vgi)を用いた3次元都市モデルの自動導出には,正確な建物の高さ推定が重要である。しかし、低コストなVGIデータに基づく大規模建物の高さ推定のための自動解が現在欠落している。 VGIデータプラットフォーム、特にOpenStreetMap(OSM)とクラウドソースのストリートビューイメージ(SVI)の開発は、この研究ギャップを埋めるための刺激的な機会を提供する。本研究では,Mapillary SVIとOSMデータから建物の高さを自動的に推定し,低コストでオープンソースの3D都市モデリングをLoD1で生成する半教師付き学習手法を提案する。 The proposed method consists of three parts: first, we propose an SSL schema with the option of setting a different ratio of "pseudo label" during the supervised regression; second, we extract multi-level morphometric features from OSM data (i.e., buildings and streets) for the purposed of inferring building height; last, we design a building floor estimation workflow with a pre-trained facade object detection network to generate "pseudo label" from SVI and assign it to the corresponding OSM building footprint. 本研究では,ドイツハイデルベルク市におけるSSL方式の有効性を検証し,建物の高さの基準データに対してモデル性能を評価する。ランダムフォレスト(rf)、サポートベクターマシン(svm)、畳み込みニューラルネットワーク(cnn)という3つの異なる回帰モデルに基づいて、ssl法は、平均絶対誤差(mae)約2.1メートルのビルの高さを推定する上で、明確なパフォーマンス向上につながる。予備的な結果は、低コストなvgiデータに基づいて提案手法をスケールアップする上での今後の取り組みに有望であり、また、さまざまなデータ品質と可用性を備えたリージョンや領域での可能性も期待でき、モチベーションを与えてくれます。

関連論文リスト

Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method [17.492721759864505]
大規模なフロアカウントデータは、カダストラルと3D都市データベースではほとんど利用できない。本研究では,道路画像から直接床数を推定するエンドツーエンドのディープラーニングフレームワークを提案する。提案された分類回帰ネットワークは精度81.2%に達し、+/-1階内の建物の97.9%を予測している。
論文参考訳（メタデータ） (2025-05-23T15:27:46Z)
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection [65.84604846389624]
我々は,より強力な単一点制御OODフレームワークであるPointOBB-v3を提案する。追加のプリミティブなしで擬似回転ボックスを生成し、エンドツーエンドのパラダイムをサポートする。本手法は従来の最先端手法と比較して3.56%の精度向上を実現している。
論文参考訳（メタデータ） (2025-01-23T18:18:15Z)
Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment [6.606615641354963]
事前訓練と微調整のパラダイムは、衛星リモートセンシングの応用に革命をもたらした。大規模なALSポイントクラウドデータセットを構築し、下流アプリケーションへの影響を評価する。以上の結果から,事前学習したモデルは,ダウンストリームタスク全体において,スクラッチよりも有意に優れていた。
論文参考訳（メタデータ） (2025-01-09T09:21:09Z)
OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances [11.085165252259042]
OSMLocは、脳にインスパイアされた単一画像の視覚的位置決め手法であり、精度、堅牢性、一般化能力を改善するための意味的および幾何学的ガイダンスを備えている。提案したOSMLOCを検証するため,世界規模のクロスエリアとクロスコンディション(CC)のベンチマークを収集し,広範な評価を行う。
論文参考訳（メタデータ） (2024-11-13T14:59:00Z)
OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
学習可能なクエリの集合を用いて、占有された場所とクラスを同時に予測するフレームワークを提案する。 OPUSには、モデルパフォーマンスを高めるための非自明な戦略が組み込まれている。最も軽量なモデルではOcc3D-nuScenesデータセットの2倍 FPS に優れたRayIoUが得られる一方、最も重いモデルは6.1 RayIoUを上回ります。
論文参考訳（メタデータ） (2024-09-14T07:44:22Z)
Fine-Grained Building Function Recognition from Street-View Images via Geometry-Aware Semi-Supervised Learning [18.432786227782803]
細粒度建物機能認識のための幾何対応半教師付きフレームワークを提案する。半教師あり学習における擬似ラベルの精度を高めるために,マルチソースデータ間の幾何学的関係を利用する。提案手法は, 建築物のきめ細かい機能認識において, 優れた性能を示す。
論文参考訳（メタデータ） (2024-08-18T12:48:48Z)
Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation [32.30055363306321]
本研究では、異なる人間のポーズや形状に関連したタスクやデータセットをシームレスに統一するパラダイムを提案する。私たちの定式化は、トレーニングとテスト時間の両方で、人間の体積の任意の点を問う能力に重点を置いています。メッシュや2D/3Dスケルトン,密度の高いポーズなど,さまざまな注釈付きデータソースを,変換することなく自然に利用することが可能です。
論文参考訳（メタデータ） (2024-07-10T10:44:18Z)
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations [55.022519020409405]
本稿では,マルチモーダルな3Dシーンデータセットと階層型言語アノテーションを用いたベンチマーク,MMScanを構築した。結果として得られたマルチモーダルな3Dデータセットは、109kオブジェクトと7.7kリージョン上の1.4Mメタアノテーション付きキャプションと、3Dビジュアルグラウンドと質問応答ベンチマークのための3.04M以上の多様なサンプルを含んでいる。
論文参考訳（メタデータ） (2024-06-13T17:59:30Z)
Optimization Efficient Open-World Visual Region Recognition [55.76437190434433]
RegionSpotは、ローカライゼーション基盤モデルから位置認識ローカライゼーション知識と、ViLモデルからのセマンティック情報を統合する。オープンワールドオブジェクト認識の実験では、私たちのRereaSpotは、以前の代替よりも大きなパフォーマンス向上を実現しています。
論文参考訳（メタデータ） (2023-11-02T16:31:49Z)
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training [58.07391711548269]
Masked Voxel Jigsaw and Reconstruction (MV-JAR) method for LiDAR-based self-supervised pre-training Masked Voxel Jigsaw and Reconstruction (MV-JAR) method for LiDAR-based self-supervised pre-training
論文参考訳（メタデータ） (2023-03-23T17:59:02Z)
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds [55.44204039410225]
本稿では,CAGroup3Dという新しい2段階完全スパース3Dオブジェクト検出フレームワークを提案する。提案手法は,まず,オブジェクト表面のボクセル上でのクラス認識型局所群戦略を活用することによって,高品質な3D提案を生成する。不正なボクセルワイドセグメンテーションにより欠落したボクセルの特徴を回復するために,完全にスパースな畳み込み型RoIプールモジュールを構築した。
論文参考訳（メタデータ） (2022-10-09T13:38:48Z)
Stereo Neural Vernier Caliper [57.187088191829886]
学習に基づくステレオ3Dオブジェクト検出のための新しいオブジェクト中心フレームワークを提案する。初期3次元立方体推定値から改良された更新を予測する方法の問題に対処する。提案手法は,KITTIベンチマークの最先端性能を実現する。
論文参考訳（メタデータ） (2022-03-21T14:36:07Z)
H3D: Benchmark on Semantic Segmentation of High-Resolution 3D Point Clouds and textured Meshes from UAV LiDAR and Multi-View-Stereo [4.263987603222371]
本稿では,3つの方法でユニークな3次元データセットを提案する。ヘシグハイム(ドイツ語: Hessigheim, H3D)は、ドイツの都市。片手で3次元データ分析の分野での研究を促進するとともに、新しいアプローチの評価とランク付けを目的としている。
論文参考訳（メタデータ） (2021-02-10T09:33:48Z)
Height estimation from single aerial images using a deep ordinal regression network [12.991266182762597]
単体画像からの高度推定の曖昧で未解決な問題に対処する。深層学習、特に深層畳み込みニューラルネットワーク(CNN)の成功により、いくつかの研究は、単一の空中画像から高さ情報を推定することを提案した。本稿では,高さ値を間隔増加間隔に分割し,回帰問題を順序回帰問題に変換することを提案する。
論文参考訳（メタデータ） (2020-06-04T12:03:51Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。