Fugu-MT 論文翻訳(概要): Efficient Point Cloud Processing with High-Dimensional Positional Encoding and Non-Local MLPs

論文の概要: Efficient Point Cloud Processing with High-Dimensional Positional Encoding and Non-Local MLPs

arxiv url: http://arxiv.org/abs/2603.04099v1
Date: Wed, 04 Mar 2026 14:12:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-05 21:29:15.340657
Title: Efficient Point Cloud Processing with High-Dimensional Positional Encoding and Non-Local MLPs
Title（参考訳）: 高次元位置符号化と非局所MLPを用いた効率的なポイントクラウド処理
Authors: Yanmei Zou, Hongshan Yu, Yaonan Wang, Zhengeng Yang, Xieyuanli Chen, Kailun Yang, Naveed Akhtar,
Abstract要約: ポイントクラウド処理におけるモジュラー特徴抽出のための2段階の抽象化・改善(ABSREF)ビューを開発する。位置情報を明示的に活用するためのHPE(High-stage Positional)モジュールを提案する。 ABSREFの視点では、関係における局所的な集約を再考し、時間を要する局所的な操作を置き換えることを提案する。
参考スコア（独自算出の注目度）: 68.55902504866422
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-Layer Perceptron (MLP) models are the foundation of contemporary point cloud processing. However, their complex network architectures obscure the source of their strength and limit the application of these models. In this article, we develop a two-stage abstraction and refinement (ABS-REF) view for modular feature extraction in point cloud processing. This view elucidates that whereas the early models focused on ABS stages, the more recent techniques devise sophisticated REF stages to attain performance advantages. Then, we propose a High-dimensional Positional Encoding (HPE) module to explicitly utilize intrinsic positional information, extending the ``positional encoding'' concept from Transformer literature. HPE can be readily deployed in MLP-based architectures and is compatible with transformer-based methods. Within our ABS-REF view, we rethink local aggregation in MLP-based methods and propose replacing time-consuming local MLP operations, which are used to capture local relationships among neighbors. Instead, we use non-local MLPs for efficient non-local information updates, combined with the proposed HPE for effective local information representation. We leverage our modules to develop HPENets, a suite of MLP networks that follow the ABS-REF paradigm, incorporating a scalable HPE-based REF stage. Extensive experiments on seven public datasets across four different tasks show that HPENets deliver a strong balance between efficiency and effectiveness. Notably, HPENet surpasses PointNeXt, a strong MLP-based counterpart, by 1.1% mAcc, 4.0% mIoU, 1.8% mIoU, and 0.2% Cls. mIoU, with only 50.0%, 21.5%, 23.1%, 44.4% of FLOPs on ScanObjectNN, S3DIS, ScanNet, and ShapeNetPart, respectively. Source code is available at https://github.com/zouyanmei/HPENet_v2.git.
Abstract（参考訳）: MLP(Multi-Layer Perceptron)モデルは、現代のクラウド処理の基礎である。しかし、それらの複雑なネットワークアーキテクチャは、その強みの源を曖昧にし、これらのモデルの適用を制限する。本稿では、ポイントクラウド処理におけるモジュラー特徴抽出のための2段階の抽象化・改善(ABS-REF)ビューを開発する。この見解は、初期のモデルはABSステージに焦点を当てていたが、より最近の技術は、パフォーマンス上の優位性を得るために洗練されたREFステージを考案した。そこで,本研究では,「位置符号化」の概念をトランスフォーマー文学から拡張し,内在的位置情報を明示的に活用する高次元位置符号化(HPE)モジュールを提案する。 HPEはMPPベースのアーキテクチャで容易にデプロイでき、トランスフォーマーベースのメソッドと互換性がある。 ABS-REF ビューでは,ローカルアグリゲーションを MLP ベースの手法で再考し,近隣住民のローカルな関係を捉えるために使用される,時間を要するローカルな MLP 操作の置き換えを提案する。代わりに、効率的なローカル情報更新に非ローカルMPPを使用し、効率的なローカル情報表現に提案されたHPEと組み合わせる。 ABS-REFパラダイムに従い、スケーラブルなHPEベースのREFステージを組み込んだMLPネットワークスイートであるHPENetsを開発するために、当社のモジュールを活用している。 4つの異なるタスクにわたる7つの公開データセットに関する大規模な実験は、HPENetsが効率と有効性の間に強いバランスを提供することを示している。特に、HPENetは強力なMLPベースのPointNeXtを1.1% mAcc、4.0% mIoU、1.8% mIoU、0.2% Clsで上回っている。 mIoUは、ScanObjectNN、S3DIS、ScanNet、ShapeNetPartのFLOPのわずか50.0%、21.5%、23.1%、44.4%である。ソースコードはhttps://github.com/zouyanmei/HPENet_v2.gitで入手できる。

論文の概要: Efficient Point Cloud Processing with High-Dimensional Positional Encoding and Non-Local MLPs

関連論文リスト