Fugu-MT 論文翻訳(概要): PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding

論文の概要: PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding

arxiv url: http://arxiv.org/abs/2604.04933v1
Date: Mon, 06 Apr 2026 17:59:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.340301
Title: PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding
Title（参考訳）: PointTPA:3次元シーン理解のための動的ネットワークパラメータ適応
Authors: Siyuan Liu, Chaoqun Zheng, Xin Zhou, Tianrui Feng, Dingkang Liang, Xiang Bai,
Abstract要約: 本研究では,シーンレベルの点群に対する入力対応ネットワークパラメータを生成するテスト時間適応フレームワークを提案する。 PointTPAはScanNetの検証で78.4%のmIoUを達成した。
参考スコア（独自算出の注目度）: 51.98954431091292
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Scene-level point cloud understanding remains challenging due to diverse geometries, imbalanced category distributions, and highly varied spatial layouts. Existing methods improve object-level performance but rely on static network parameters during inference, limiting their adaptability to dynamic scene data. We propose PointTPA, a Test-time Parameter Adaptation framework that generates input-aware network parameters for scene-level point clouds. PointTPA adopts a Serialization-based Neighborhood Grouping (SNG) to form locally coherent patches and a Dynamic Parameter Projector (DPP) to produce patch-wise adaptive weights, enabling the backbone to adjust its behavior according to scene-specific variations while maintaining a low parameter overhead. Integrated into the PTv3 structure, PointTPA demonstrates strong parameter efficiency by introducing two lightweight modules of less than 2% of the backbone's parameters. Despite this minimal parameter overhead, PointTPA achieves 78.4% mIoU on ScanNet validation, surpassing existing parameter-efficient fine-tuning (PEFT) methods across multiple benchmarks, highlighting the efficacy of our test-time dynamic network parameter adaptation mechanism in enhancing 3D scene understanding. The code is available at https://github.com/H-EmbodVis/PointTPA.
Abstract（参考訳）: 多様な地形、不均衡なカテゴリー分布、非常に多様な空間配置のため、シーンレベルの雲の理解は依然として困難である。既存の手法ではオブジェクトレベルのパフォーマンスが向上するが、推論中に静的なネットワークパラメータに依存するため、動的なシーンデータへの適応性が制限される。本研究では,シーンレベルの点群に対する入力対応ネットワークパラメータを生成するテスト時間パラメータ適応フレームワークであるPointTPAを提案する。 PointTPAは、SNG(Serialization-based Neighborhood Grouping)を採用し、局所的コヒーレントパッチと動的パラメータプロジェクタ(DPP)を作成してパッチワイドな重みを発生させる。 PTv3構造に統合されたPointTPAは、バックボーンのパラメータの2%未満の2つの軽量モジュールを導入することで、強力なパラメータ効率を示す。この最小パラメータオーバーヘッドにもかかわらず、PointTPAはScanNetのバリデーションにおいて78.4% mIoUを達成し、既存のパラメータ効率の微調整(PEFT)メソッドを複数のベンチマークで上回り、3Dシーン理解の強化におけるテストタイム動的ネットワークパラメータ適応機構の有効性を強調した。コードはhttps://github.com/H-EmbodVis/PointTPAで入手できる。

論文の概要: PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding

関連論文リスト