Fugu-MT 論文翻訳(概要): GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer

論文の概要: GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer

arxiv url: http://arxiv.org/abs/2605.24243v1
Date: Fri, 22 May 2026 21:42:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:17.77041
Title: GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer
Title（参考訳）: GIBLy: アーキテクチャに依存しない幾何学的誘導バイアス層による3次元セマンティックセマンティックセマンティックセグメンテーションの改善
Authors: Diogo Lavado, Alessandra Micheletti, Clàudia Soares,
Abstract要約: 3Dシーン理解では、ディープラーニングモデルは、基本的な幾何学的構造を捉えるために、大きなモデルと広範な訓練に依存している。 GIBLyは3次元セグメンテーションパイプラインにプリエントを統合する軽量な帰納的幾何バイアス層である。複数の3次元セマンティックセグメンテーションベンチマークにまたがるアプローチを検証し、一貫した性能向上を示す。
参考スコア（独自算出の注目度）: 41.99844472131922
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In 3D scene understanding, deep learning models rely on large models and extensive training to capture basic geometric structures that are present in the 3D data. However, existing methods lack explicit mechanisms to incorporate geometric information, such as learnable primitive shapes, often necessitating large models and more training data which in turn increases cost and can limit generalization. We introduce GIBLy, a lightweight geometric inductive bias layer that integrates learnable geometric priors into 3D segmentation pipelines. GIBLy enhances existing architectures -- whether MLP-based, convolution-based, or transformer-based -- by providing features aligned with simple geometric shapes (and thus human-interpretable) that improve segmentation performance with minimal computational overhead. We validate our approach across multiple 3D semantic segmentation benchmarks, demonstrating consistent performance gains, including up to +11.5% mIoU on TS40K with PTV3, while adding only 58K extra parameters. Our results highlight the benefit of explicitly encoding geometric structure to support accurate and efficient 3D scene understanding, with a lightweight add-on layer
Abstract（参考訳）: 3Dシーン理解において、ディープラーニングモデルは、3Dデータに存在する基本的な幾何学的構造を捉えるために、大きなモデルと広範な訓練に依存している。しかし、既存の手法では、学習可能な原始形状のような幾何学的情報を組み込むための明確なメカニズムが欠如しており、しばしば大きなモデルや、コストを増大させ、一般化を制限する訓練データを必要とする。 GIBLyは、学習可能な幾何学的先行要素を3次元セグメント化パイプラインに統合する軽量な幾何学的帰納バイアス層である。 GIBLyは既存のアーキテクチャ(MLPベース、畳み込みベース、トランスフォーマーベース)を強化し、計算オーバーヘッドを最小限に抑えてセグメンテーション性能を改善する単純な幾何学的形状(従って人間解釈可能)に整列した機能を提供する。我々は、複数の3Dセマンティックセグメンテーションベンチマークにまたがってアプローチを検証し、TS40KとPTV3で+11.5% mIoUまでの性能向上を示した。本研究は,3次元シーン理解を高精度かつ効率的に支援するための幾何学的構造を,軽量なアドオン層で明示的に符号化する利点を強調した。

論文の概要: GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer

関連論文リスト