Fugu-MT 論文翻訳(概要): Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression

論文の概要: Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression

arxiv url: http://arxiv.org/abs/2509.14591v2
Date: Sun, 02 Nov 2025 05:01:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-04 16:14:22.24314
Title: Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression
Title（参考訳）: 効率的なダイナミックポイントクラウド圧縮のための双方向特徴整列運動変換
Authors: Xuan Deng, Xingtao Wang, Xiandong Meng, Longguang Wang, Tiange Zhang, Xiaopeng Fan, Debin Zhao,
Abstract要約: 特徴空間における動きを暗黙的にモデル化する双方向特徴整合運動変換(Bi-FMT)フレームワークを提案する。 Bi-FMTは、時間的に一貫した潜在表現を生成するために、過去と将来の両方のフレームで機能を調整する。圧縮効率とランタイムの両方において, Bi-FMT が D-DPCC と AdaDPCC を上回っていることを示す。
参考スコア（独自算出の注目度）: 97.66080040613726
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Efficient dynamic point cloud compression (DPCC) critically depends on accurate motion estimation and compensation. However, the inherently irregular structure and substantial local variations of point clouds make this task highly challenging. Existing approaches typically rely on explicit motion estimation, whose encoded motion vectors often fail to capture complex dynamics and inadequately exploit temporal correlations. To address these limitations, we propose a Bidirectional Feature-aligned Motion Transformation (Bi-FMT) framework that implicitly models motion in the feature space. Bi-FMT aligns features across both past and future frames to produce temporally consistent latent representations, which serve as predictive context in a conditional coding pipeline, forming a unified ``Motion + Conditional'' representation. Built upon this bidirectional feature alignment, we introduce a Cross-Transformer Refinement module (CTR) at the decoder side to adaptively refine locally aligned features. By modeling cross-frame dependencies with vector attention, CRT enhances local consistency and restores fine-grained spatial details that are often lost during motion alignment. Moreover, we design a Random Access (RA) reference strategy that treats the bidirectionally aligned features as conditional context, enabling frame-level parallel compression and eliminating the sequential encoding. Extensive experiments demonstrate that Bi-FMT surpasses D-DPCC and AdaDPCC in both compression efficiency and runtime, achieving BD-Rate reductions of 20% (D1) and 9.4% (D1), respectively.
Abstract（参考訳）: 効率的な動的点雲圧縮(DPCC)は、正確な動きの推定と補償に依存している。しかし、本質的に不規則な構造とかなり局所的な点雲の変動は、この課題を非常に困難にしている。既存のアプローチは、典型的には明示的な動き推定に依存しており、符号化された動きベクトルは複雑なダイナミクスを捉えることができず、時間的相関を不適切に利用している。これらの制約に対処するために,特徴空間における動きを暗黙的にモデル化する双方向特徴整合運動変換(Bi-FMT)フレームワークを提案する。 Bi-FMTは、過去と将来のフレームにまたがって機能を整列させ、時間的に一貫した潜在表現を生成し、条件付きコーディングパイプラインで予測コンテキストとして機能し、統一された ``Motion + Conditional'' 表現を形成する。この双方向機能アライメントに基づいて,デコーダ側でCTR(Cross-Transformer Refinement Module)を導入し,局所的な特徴を適応的に洗練する。ベクトル注意を伴うクロスフレーム依存関係をモデル化することにより、CRTは局所的な一貫性を高め、運動アライメント中にしばしば失われる細粒度の空間的詳細を復元する。さらに、双方向に整列した特徴を条件付きコンテキストとして扱うランダムアクセス(RA)参照戦略を設計し、フレームレベルの並列圧縮を可能にし、シーケンシャルエンコーディングを除去する。 Bi-FMTは圧縮効率とランタイムの両方でD-DPCCとAdaDPCCを上回り、それぞれ20%(D1)と9.4%(D1)のBD-Rate還元を達成した。

論文の概要: Bidirectional Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression

関連論文リスト