Fugu-MT 論文翻訳(概要): ForestMamba: Sparse Mamba with Geometry-guided Queries for 3D Forest Point Cloud Segmentation

論文の概要: ForestMamba: Sparse Mamba with Geometry-guided Queries for 3D Forest Point Cloud Segmentation

arxiv url: http://arxiv.org/abs/2606.01549v1
Date: Mon, 01 Jun 2026 01:49:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.861046
Title: ForestMamba: Sparse Mamba with Geometry-guided Queries for 3D Forest Point Cloud Segmentation
Title（参考訳）: ForestMamba:3Dフォレストポイントクラウドセグメンテーションのための幾何学誘導クェリ付きスパースマンバ
Authors: Trung Thanh Nguyen, Tuan-Anh Vu, Duc Viet Le, Yasutomo Kawanishi, Takahiro Komamizu, Ichiro Ide, Teja Kattenborn,
Abstract要約: ForestMambaは、フォレスト固有の事前情報を特徴符号化、クエリ生成、クエリ改善に組み込む構造認識手法である。 7つの森林地域での実験では、フォレストマンバは、両方のセグメンテーションタスクにおいて、既存のベースラインを一貫して上回っていることが示されている。
参考スコア（独自算出の注目度）: 9.432335594963956
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI-based semantic and instance segmentation of terrestrial and drone LiDAR point clouds is emerging as a transformative approach for converting the complex 3D structure of forests into actionable information for forest monitoring and biodiversity assessment. However, forest LiDAR scenes remain highly challenging due to their large data volumes, irregular sampling density, overlapping and complex canopy structure, and geographic variability. Existing methods based on sparse convolutions or Transformers achieve promising results, but suffer from two key limitations: Quadratic complexity of attention scales poorly to large forest scenes, and Generic context modeling does not exploit forest structural priors, limiting tree separation in complex regions. To address these challenges, we propose ForestMamba, a structure-aware method that incorporates forest-specific priors into feature encoding, query generation, and query refinement, while replacing quadratic attention with linear-time state-space modeling. First, we introduce a sparse encoder with vertical-priority slab serialization that organizes sparse voxels into vertically coherent sequences for efficient long-range context modeling. Second, we propose a geometry-guided query initialization strategy based on an on-the-fly multi-scale Canopy Height Model (CHM), where canopy maxima provide ecologically meaningful query seeds, supplemented by Farthest Point Sampling (FPS) to cover understory trees. Third, we design a Mamba-based query decoder that combines local kNN voxel aggregation with a spatial dual-path Mamba for query refinement with linear computational complexity. Extensive experiments across seven forest regions demonstrate that ForestMamba consistently outperforms existing baselines in both segmentation tasks, while achieving 3 times faster inference and 2.3 times lower GPU memory than Transformer-based methods.
Abstract（参考訳）: AIに基づく地球とドローンのLiDAR点雲のセマンティックスとインスタンスセグメンテーションは、森林の複雑な3D構造を森林監視と生物多様性評価のための実行可能な情報に変換するための転換的アプローチとして現れつつある。しかし、森林のLiDARのシーンは、大きなデータ量、不規則なサンプリング密度、重なり合う複雑な天蓋構造、地理的変動のため、非常に困難なままである。既存のスパース・コンボリューション(英語版)やトランスフォーマー(英語版)に基づく手法は、有望な結果をもたらすが、2つの重要な制限を被る: 注意の二次的複雑さは、大きな森林のシーンに不十分にスケールし、ジェネリック・コンテクスト・モデリングは、複雑な地域での樹木の分離を制限する、森の構造的先行を活用しない。このような課題に対処するため,フォレストマンバは,2次注意を線形時間状態空間モデリングに置き換えつつ,森林固有の先行情報を特徴符号化,クエリ生成,クエリ改善に組み込む構造認識手法である。まず,スパルスボクセルを垂直コヒーレントなシーケンスに整理し,より効率的な長距離コンテキストモデリングを行うスパルスエンコーダを提案する。第2に,本研究では,Farthest Point Sampling (FPS) によって補足された,エコロジー的に意味のあるクエリシードを提供する,オンザフライマルチスケールのキャノピーハイトモデル(CHM)に基づく,幾何学誘導型クエリ初期化戦略を提案する。第3に,局所的なkNNボクセルアグリゲーションと空間的二重パスのMambaを組み合わせたクエリデコーダを設計し,クエリリファインメントと線形計算複雑性を両立させる。 7つの森林地域にわたる大規模な実験により、フォレストマンバは両方のセグメンテーションタスクにおいて既存のベースラインを一貫して上回り、トランスフォーマーベースの手法よりも推論が3倍、GPUメモリが2.3倍低いことが示されている。

論文の概要: ForestMamba: Sparse Mamba with Geometry-guided Queries for 3D Forest Point Cloud Segmentation

関連論文リスト