Fugu-MT 論文翻訳(概要): HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction

論文の概要: HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction

arxiv url: http://arxiv.org/abs/2511.10211v1
Date: Fri, 14 Nov 2025 01:39:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-14 22:53:22.748069
Title: HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction
Title（参考訳）: HeatV2X: 効率的な配向と相互作用によるスケーラブルな異種協調知覚
Authors: Yueran Zhao, Zhang Zhang, Chao Sun, Tianze Wang, Chao Yue, Nuoran Li,
Abstract要約: V2X(Van-to-Everything)コラボレーティブ・インセプションは、トランスミッションを通じて、単一車両の限界を超えて知覚を拡大する。既存のフレームワークは、(1)参加エージェントは本質的にマルチモーダルで異種であり、(2)新しいエージェントに対応するためには、協調フレームワークはスケーラブルでなければならない。スケーラブルな協調フレームワークであるHeatV2Xを提案する。
参考スコア（独自算出の注目度）: 7.171380055232685
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vehicle-to-Everything (V2X) collaborative perception extends sensing beyond single vehicle limits through transmission. However, as more agents participate, existing frameworks face two key challenges: (1) the participating agents are inherently multi-modal and heterogeneous, and (2) the collaborative framework must be scalable to accommodate new agents. The former requires effective cross-agent feature alignment to mitigate heterogeneity loss, while the latter renders full-parameter training impractical, highlighting the importance of scalable adaptation. To address these issues, we propose Heterogeneous Adaptation (HeatV2X), a scalable collaborative framework. We first train a high-performance agent based on heterogeneous graph attention as the foundation for collaborative learning. Then, we design Local Heterogeneous Fine-Tuning and Global Collaborative Fine-Tuning to achieve effective alignment and interaction among heterogeneous agents. The former efficiently extracts modality-specific differences using Hetero-Aware Adapters, while the latter employs the Multi-Cognitive Adapter to enhance cross-agent collaboration and fully exploit the fusion potential. These designs enable substantial performance improvement of the collaborative framework with minimal training cost. We evaluate our approach on the OPV2V-H and DAIR-V2X datasets. Experimental results demonstrate that our method achieves superior perception performance with significantly reduced training overhead, outperforming existing state-of-the-art approaches. Our implementation will be released soon.
Abstract（参考訳）: V2X(Van-to-Everything)コラボレーティブ・インセプションは、トランスミッションを通じて、単一車両の限界を超えて知覚を拡大する。しかし,(1)参加するエージェントは本質的にマルチモーダルで異種であり,(2)新しいエージェントに対応するためには協調的なフレームワークがスケーラブルでなければならない。前者は不均一性損失を軽減するために効果的なクロスエージェント機能アライメントを必要とし、後者はフルパラメータトレーニングを非現実的に実施し、スケーラブルな適応の重要性を強調している。これらの問題に対処するために、スケーラブルな協調フレームワークであるHeatV2X(Heterogeneous Adaptation)を提案する。まず,協調学習の基礎として,異種グラフの注意に基づく高性能エージェントを訓練する。そこで我々は,異種エージェント間の効果的なアライメントと相互作用を実現するために,局所異種ファインチューニングとグローバルコラボレーティブファインチューニングを設計した。前者はHetero-Aware Adaptersを用いて効率よくモダリティ特異的な差異を抽出し、後者はMulti-Cognitive Adapterを用いてクロスエージェントコラボレーションを強化し、融合ポテンシャルを完全に活用する。これらの設計は、最小のトレーニングコストで協調フレームワークのパフォーマンスを大幅に向上させる。我々は,OPV2V-HとDAIR-V2Xデータセットに対するアプローチを評価した。実験により,本手法は訓練のオーバーヘッドを大幅に低減し,既存の最先端手法よりも優れた知覚性能が得られることが示された。私たちの実装はまもなくリリースされるでしょう。

論文の概要: HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction

関連論文リスト