Fugu-MT 論文翻訳(概要): ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation

論文の概要: ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation

arxiv url: http://arxiv.org/abs/2603.15812v1
Date: Mon, 16 Mar 2026 18:43:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:06.948969
Title: ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation
Title（参考訳）: ModTrack:Covariance Propagationを用いたアイデンティティインフォームドPHDフィルタによるセンサ非依存多視点トラッキング
Authors: Aditya Iyer, Jack Roberts, Nora Ayanian,
Abstract要約: MV-MOT(Multi-View Multi-Object Tracking)は、複数のセンサで観測されるオブジェクトの一貫したアイデンティティをローカライズし、維持することを目的としている。最近のエンドツーエンドアプローチでは、2D Birdのアイビュー表現とアイデンティティアソシエーションを共同で学習し、高いトラッキング精度を実現することでこの問題に対処している。本稿では,モジュール型MV-MOTシステムであるModTrackを提案する。
参考スコア（独自算出の注目度）: 1.0314567485992163
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Multi-View Multi-Object Tracking (MV-MOT) aims to localize and maintain consistent identities of objects observed by multiple sensors. This task is challenging, as viewpoint changes and occlusion disrupt identity consistency across views and time. Recent end-to-end approaches address this by jointly learning 2D Bird's Eye View (BEV) representations and identity associations, achieving high tracking accuracy. However, these methods offer no principled uncertainty accounting and remain tightly coupled to their training configuration, limiting generalization across sensor layouts, modalities, or datasets without retraining. We propose ModTrack, a modular MV-MOT system that matches end-to-end performance while providing cross-modal, sensor-agnostic generalization and traceable uncertainty. ModTrack confines learning methods to just the \textit{Detection and Feature Extraction} stage of the MV-MOT pipeline, performing all fusion, association, and tracking with closed-form analytical methods. Our design reduces each sensor's output to calibrated position-covariance pairs $(\mathbf{z}, R)$; cross-view clustering and precision-weighted fusion then yield unified estimates $(\hat{\mathbf{z}}, \hat{R})$ for identity assignment and temporal tracking. A feedback-coupled, identity-informed Gaussian Mixture Probability Hypothesis Density (GM-PHD) filter with HMM motion modes uses these fused estimates to maintain identities under missed detections and heavy occlusion. ModTrack achieves 95.5 IDF1 and 91.4 MOTA on \textit{WildTrack}, surpassing all prior modular methods by over 21 points and rivaling the state-of-the-art end-to-end methods while providing deployment flexibility they cannot. Specifically, the same tracker core transfers unchanged to \textit{MultiviewX} and \textit{RadarScenes}, with only perception-module replacement required to extend to new domains and sensor modalities.
Abstract（参考訳）: MV-MOT(Multi-View Multi-Object Tracking)は、複数のセンサで観測されるオブジェクトの一貫したアイデンティティをローカライズし、維持することを目的としている。視点の変化と隠蔽が、ビューと時間の間でアイデンティティの一貫性を損なうため、このタスクは難しい。最近のエンドツーエンドアプローチでは、2D Bird's Eye View(BEV)表現とアイデンティティ関連を共同で学習し、高いトラッキング精度を実現する。しかし、これらの手法は原則化された不確実性会計を提供しておらず、トレーニング設定に密結合し、センサレイアウト、モダリティ、データセットを再トレーニングすることなく一般化を制限する。本稿では,モジュール型MV-MOTシステムであるModTrackを提案する。 ModTrack は MV-MOT パイプラインの \textit{Detection and Feature extract} ステージのみに学習手法を限定し、クローズドフォーム解析手法で全ての融合、関連付け、追跡を行う。我々の設計では、各センサの出力を校正された位置共分散ペア$(\mathbf{z}, R)$; クロスビュークラスタリングと精度重み付き融合に還元し、アイデンティティ割り当てと時間追跡のために統一された推定$(\hat{\mathbf{z}}, \hat{R})$を生成する。 HMMモーションモード付きフィードバック結合型ガウス混合確率仮説密度(GM-PHD)フィルタは、これらの融合推定値を用いて、欠落検出および重閉塞下での同一性を維持する。 ModTrack は 95.5 IDF1 と 91.4 MOTA を \textit{WildTrack} 上で達成し、以前のすべてのモジュールメソッドを21ポイント以上越え、最先端のエンドツーエンドメソッドと競合し、デプロイの柔軟性を提供する。具体的には、同じトラッカーコアが \textit{MultiviewX} と \textit{RadarScenes} に変化せず、新しいドメインやセンサーのモダリティに拡張するためには、認識モジュールの置き換えしか必要とされない。

論文の概要: ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation

関連論文リスト