Fugu-MT 論文翻訳(概要): In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels

論文の概要: In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels

arxiv url: http://arxiv.org/abs/2603.14309v1
Date: Sun, 15 Mar 2026 09:58:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.735919
Title: In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels
Title（参考訳）: 手動ラベルのない深層学習を用いたTLS点雲からの現地3次元小麦頭部のセグメンテーション
Authors: Tomislav Medic, Liangliang Nan,
Abstract要約: 本研究では,地上レーザー走査(TLS)点群から直接,フィールド内小麦頭部のインスタンス・セグメンテーションの課題に取り組む。そこで本研究では,新しい2段階パイプラインを提案する。最初のステージでは、3Dから2Dのマルチビュープロジェクション、ゼロショット2Dオブジェクト中心のセグメンテーションのためのグラウンドドSAMパイプライン、マルチビューラベル融合を使用する。第2段階では、これらの最初の提案をノイズの多い擬似ラベルとして使用し、教師付き3Dパノプティカルスタイルのセグメンテーションニューラルネットワークをトレーニングする。
参考スコア（独自算出の注目度）: 8.605087464884146
License: http://creativecommons.org/licenses/by/4.0/
Abstract: 3D instance segmentation for laser scanning (LiDAR) point clouds remains a challenge in many remote sensing-related domains. Successful solutions typically rely on supervised deep learning and manual annotations, and consequently focus on objects that can be well delineated through visual inspection and manual labeling of point clouds. However, for tasks with more complex and cluttered scenes, such as in-field plant phenotyping in agriculture, such approaches are often infeasible. In this study, we tackle the task of in-field wheat head instance segmentation directly from terrestrial laser scanning (TLS) point clouds. To address the problem and circumvent the need for manual annotations, we propose a novel two-stage pipeline. To obtain the initial 3D instance proposals, the first stage uses 3D-to-2D multi-view projections, the Grounded SAM pipeline for zero-shot 2D object-centric segmentation, and multi-view label fusion. The second stage uses these initial proposals as noisy pseudo-labels to train a supervised 3D panoptic-style segmentation neural network. Our results demonstrate the feasibility of the proposed approach and show performance improvementsrelative to Wheat3DGS, a recent alternative solution for in-field wheat head instance segmentation without manual 3D annotations based on multi-view RGB images and 3D Gaussian Splatting, showcasing TLS as a competitive sensing alternative. Moreover, the results show that both stages of the proposed pipeline can deliver usable 3D instance segmentation without manual annotations, indicating promising, low-effort transferability to other comparable TLS-based point cloud segmentation tasks.
Abstract（参考訳）: レーザースキャン(LiDAR)ポイントクラウドのための3Dインスタンスセグメンテーションは、多くのリモートセンシング関連ドメインにおいて依然として課題である。成功しているソリューションは一般的に、教師付きディープラーニングと手動アノテーションに依存しているため、視覚的なインスペクションやポイントクラウドのマニュアルラベリングを通じて、適切に記述可能なオブジェクトにフォーカスする。しかし、農業における植物の表現型化のような、より複雑で散らかった場面では、このようなアプローチは不可能であることが多い。本研究では,地球外レーザースキャン(TLS)点雲から直接,フィールド内小麦頭部インスタンスのセグメンテーションの課題に取り組む。そこで我々は,この問題に対処し,手動アノテーションの必要性を回避するために,新しい2段階パイプラインを提案する。最初の3Dインスタンスの提案を得るために、第1ステージでは3Dから2Dのマルチビュープロジェクション、ゼロショット2Dオブジェクト中心のセグメンテーションのためのグラウンドドSAMパイプライン、マルチビューラベル融合を使用する。第2段階では、これらの最初の提案をノイズの多い擬似ラベルとして使用し、教師付き3Dパノプティカルスタイルのセグメンテーションニューラルネットワークをトレーニングする。 Wheat3DGSは,多視点RGB画像と3次元ガウススプラッティングに基づく手動3Dアノテーションを使わずに,近年のフィールド内小麦頭インスタンスセグメンテーションの代替ソリューションであり,TLSを競争力のある代替手段として示している。さらに,提案したパイプラインのいずれのステージでも,手動のアノテーションを使わずに利用可能な3Dインスタンスセグメンテーションが実現可能であることを示す。

論文の概要: In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels

関連論文リスト